Skip to content

Focoos Models 🧠#

With the Focoos SDK, you can take advantage of a collection of foundational models that are optimized for a range of computer vision tasks. These pre-trained models, covering detection and semantic segmentation across various domains, provide an excellent starting point for your specific use case. Whether you need to fine-tune for custom requirements or adapt them to your application, these models offer a solid foundation to accelerate your development process.


Semantic Segmentation 🖼️#

Model Name Architecture Domain (Classes) Dataset Metric FPS Nvidia-T4
fai-mf-l-ade Mask2Former (Resnet-101) Common Scene (150) ADE20K mIoU: 48.27
mAcc: 62.15
73
fai-mf-m-ade Mask2Former (STDC-2) Common Scene (150) ADE20K mIoU: 45.32
mACC: 57.75
127
bisenetformer-l-ade BisenetFormer (STDC-2) Common Scene (150) ADE20K mIoU: 45.07
mAcc: 58.03
-
bisenetformer-m-ade BisenetFormer (STDC-2) Common Scene (150) ADE20K mIoU: 43.43
mACC: 57.01
-
bisenetformer-s-ade BisenetFormer (STDC-1) Common Scene (150) ADE20K mIoU: 42.91
mACC: 56.55
-

mIoU = Intersection over Union averaged by class
mAcc = Pixel Accuracy averaged by class
FPS = Frames per second computed using TensorRT with resolution 640x640

Object Detection 🕵️‍♂️#

Model Name Architecture Domain (Classes) Dataset Metric FPS Nvidia-T4
fai-detr-l-coco RT-DETR (Resnet-50) Common Objects (80) COCO bbox/AP: 53.06
bbox/AP50: 70.91
87
fai-detr-m-coco RT-DETR (STDC-2) Common Objects (80) COCO bbox/AP: 44.69
bbox/AP50: 61.63
181
fai-detr-l-obj365 RT-DETR (Resnet50) Common Objects (365) Objects365 bbox/AP: 34.60
bbox/AP50: 45.81
87

AP = Average Precision averaged by class
AP50 = Average Precision at IoU threshold 0.50 averaged by class
FPS = Frames per second computed using TensorRT with resolution 640x640

Instance Segmentation 🎭#

Model Name Architecture Domain (Classes) Dataset Metric FPS Nvidia-T4
fai-mf-s-coco-ins Mask2Former (Resnet-50) Common Objects (80) COCO segm/AP: 41.45
segm/AP50: 64.12
86
fai-mf-m-coco-ins Mask2Former (Resnet-101) Common Objects (80) COCO segm/AP: 43.09
segm/AP50: 65.87
70
fai-mf-l-coco-ins Mask2Former (Resnet-101) Common Objects (80) COCO segm/AP: 44.23
segm/AP50: 67.53
55

AP = Average Precision averaged by class
AP50 = Average Precision at IoU threshold 0.50 averaged by class
FPS = Frames per second computed using TensorRT with resolution 640x640