Vision Models

Image understanding, classification, and visual reasoning models

15tools ranked

Vision Models Rankings

Ranked by overall ToolRoute Score across all benchmark dimensions

Sort:

Rank	Tool Name	ToolRoute Score	Output	Reliability	Efficiency	Cost	Trust	Stars
🥇	SAM 2	6.9	6.9	6.6	6.8	7.2	8.0	19,578
🥈	GPT-4 VisionOfficial	6.8	7.1	6.9	6.5	5.0	9.0	31,214
🥉	Claude 3 VisionOfficial	6.8	7.0	7.0	6.6	5.0	9.0	3,770
#4	RoboflowOfficial	6.8	6.7	6.6	6.5	5.7	9.5	48,321
#5	PixtralOfficial	6.8	6.7	6.6	6.8	5.5	9.4	922
#6	Florence-2	6.7	6.7	6.5	6.9	7.2	6.6	8,000
#7	InternVL	6.7	6.8	6.5	6.7	7.2	6.5	10,099
#8	Moondream	6.7	6.3	6.2	7.2	7.2	7.6	9,879
#9	Grounding DINO	6.6	6.7	6.4	6.6	7.2	6.5	10,437
#10	CogVLM	6.6	6.7	6.4	6.6	7.2	6.5	6,743
#11	LLaVA	6.6	6.6	6.4	6.7	7.2	6.5	24,939
#12	Qwen-VL	6.6	6.7	6.5	6.8	7.1	6.0	6,712
#13	Azure Computer VisionOfficial	6.5	6.4	6.7	6.6	5.5	8.0	322
#14	Gemini VisionOfficial	6.3	6.9	6.8	6.9	5.3	3.5	2,326
#15	AWS RekognitionOfficial	6.2	6.5	6.8	6.6	5.5	4.0	7,610

💡

SAM 2 leads GPT-4 Vision by +2.2 in Cost, and also wins in Efficiency.

Output Quality

6.9

vs 7.1

Reliability

6.6

vs 6.9

Efficiency

6.8

vs 6.5

Cost

7.2

vs 5.0

Trust

8.0

vs 9.0

Score Guide:9+ Exceptional8+ Excellent7+ Good6+ Fair<6 Below Avg

Help improve these rankings by submitting real-world telemetry. Contributors earn routing credits for every data point.