Focoos Models 🧠#

With the Focoos SDK, you can take advantage of a collection of foundational models that are optimized for a range of computer vision tasks. These pre-trained models, covering detection and semantic segmentation across various domains, provide an excellent starting point for your specific use case. Whether you need to fine-tune for custom requirements or adapt them to your application, these models offer a solid foundation to accelerate your development process.

Semantic Segmentation 🖼️#

Model Name	Architecture	Domain (Classes)	Dataset	Metric	FPS Nvidia-T4
fai-mf-l-ade	Mask2Former (Resnet-101)	Common Scene (150)	ADE20K	mIoU: 48.27 mAcc: 62.15	73
fai-mf-m-ade	Mask2Former (STDC-2)	Common Scene (150)	ADE20K	mIoU: 45.32 mACC: 57.75	127
bisenetformer-l-ade	BisenetFormer (STDC-2)	Common Scene (150)	ADE20K	mIoU: 45.07 mAcc: 58.03	-
bisenetformer-m-ade	BisenetFormer (STDC-2)	Common Scene (150)	ADE20K	mIoU: 43.43 mACC: 57.01	-
bisenetformer-s-ade	BisenetFormer (STDC-1)	Common Scene (150)	ADE20K	mIoU: 42.91 mACC: 56.55	-

mIoU = Intersection over Union averaged by class
mAcc = Pixel Accuracy averaged by class
FPS = Frames per second computed using TensorRT with resolution 640x640

Object Detection 🕵️‍♂️#

Model Name	Architecture	Domain (Classes)	Dataset	Metric	FPS Nvidia-T4
fai-detr-l-coco	RT-DETR (Resnet-50)	Common Objects (80)	COCO	bbox/AP: 53.06 bbox/AP50: 70.91	87
fai-detr-m-coco	RT-DETR (STDC-2)	Common Objects (80)	COCO	bbox/AP: 44.69 bbox/AP50: 61.63	181
fai-detr-l-obj365	RT-DETR (Resnet50)	Common Objects (365)	Objects365	bbox/AP: 34.60 bbox/AP50: 45.81	87

AP = Average Precision averaged by class
AP50 = Average Precision at IoU threshold 0.50 averaged by class
FPS = Frames per second computed using TensorRT with resolution 640x640

Instance Segmentation 🎭#

Model Name	Architecture	Domain (Classes)	Dataset	Metric	FPS Nvidia-T4
fai-mf-s-coco-ins	Mask2Former (Resnet-50)	Common Objects (80)	COCO	segm/AP: 41.45 segm/AP50: 64.12	86
fai-mf-m-coco-ins	Mask2Former (Resnet-101)	Common Objects (80)	COCO	segm/AP: 43.09 segm/AP50: 65.87	70
fai-mf-l-coco-ins	Mask2Former (Resnet-101)	Common Objects (80)	COCO	segm/AP: 44.23 segm/AP50: 67.53	55

AP = Average Precision averaged by class
AP50 = Average Precision at IoU threshold 0.50 averaged by class
FPS = Frames per second computed using TensorRT with resolution 640x640