๐ป
LEADERBOARD
Code Generation
AI-powered code completion, generation, and transformation
22tools ranked
Code Generation Rankings
Ranked by overall ToolRoute Score across all benchmark dimensions
| Rank | Tool Name | ToolRoute Score | Output | Reliability | Efficiency | Cost | Trust | Stars |
|---|---|---|---|---|---|---|---|---|
| ๐ฅ | GitLab MCPOfficial | 9.2 | 9.4 | 9.4 | 8.6 | 9.0 | 9.5 | 84,527 |
| ๐ฅ | SonarQube MCP Server | 9.2 | 9.5 | 9.8 | 6.6 | 9.6 | 10.0 | 720 |
| ๐ฅ | Figma Context MCP | 8.9 | 9.0 | 9.1 | 8.7 | 8.7 | 8.7 | 14,517 |
| #4 | OpenAI CodexOfficial | 8.8 | 7.8 | 9.8 | 8.8 | 9.1 | 9.9 | 30,596 |
| #5 | Context7Official | 8.6 | 8.3 | 8.8 | 8.3 | 8.7 | 9.9 | 53,720 |
| #6 | GitHub MCP ServerOfficial | 8.5 | 8.3 | 9.0 | 8.1 | 8.7 | 8.8 | 29,246 |
| #7 | Continue | 6.9 | 6.6 | 6.4 | 6.8 | 7.2 | 9.0 | 32,791 |
| #8 | CodeiumOfficial | 6.9 | 6.7 | 6.5 | 6.9 | 7.0 | 8.5 | 5,113 |
| #9 | Aider | 6.9 | 6.9 | 6.5 | 6.6 | 6.5 | 8.8 | 43,936 |
| #10 | CodestralOfficial | 6.8 | 6.8 | 6.6 | 6.7 | 5.7 | 9.0 | 885 |
| #11 | StarCoder2 | 6.7 | 6.6 | 6.4 | 6.8 | 7.2 | 7.0 | 2,061 |
| #12 | CursorOfficial | 6.7 | 7.0 | 6.7 | 6.8 | 5.0 | 8.0 | 32,751 |
| #13 | TabnineOfficial | 6.7 | 6.4 | 6.5 | 6.8 | 5.7 | 9.5 | 1,435 |
| #14 | GitHub CopilotOfficial | 6.7 | 7.0 | 6.8 | 6.9 | 5.0 | 7.5 | 11,565 |
| #15 | QodoOfficial | 6.7 | 6.5 | 6.5 | 6.6 | 6.0 | 9.0 | 11,002 |
| #16 | Bolt.newOfficial | 6.4 | 6.6 | 6.3 | 6.7 | 6.0 | 6.4 | 8,000 |
| #17 | WindsurfOfficial | 6.4 | 6.8 | 6.5 | 6.7 | 5.0 | 6.6 | 6,000 |
| #18 | Sourcegraph CodyOfficial | 6.3 | 6.5 | 6.6 | 6.6 | 5.3 | 6.7 | 4,000 |
| #19 | Replit AgentOfficial | 6.3 | 6.5 | 6.3 | 6.6 | 5.7 | 6.5 | 4,000 |
| #20 | TailwindCSS MCP | 6.2 | 6.0 | 6.0 | 6.5 | 7.0 | 5.7 | 750 |
| #21 | DevinOfficial | 6.1 | 6.8 | 6.3 | 6.1 | 5.0 | 6.5 | 5,000 |
| #22 | Storybook MCP | 6.0 | 6.0 | 5.9 | 6.2 | 6.5 | 5.7 | 600 |
๐ก
Why GitLab MCP is #1
GitLab MCP leads SonarQube MCP Server by +2.0 in Efficiency.
Output Quality
9.4
vs 9.5
Reliability
9.4
vs 9.8
Efficiency
8.6
vs 6.6
Cost
9.0
vs 9.6
Trust
9.5
vs 10.0
Score Guide:9+ Exceptional8+ Excellent7+ Good6+ Fair<6 Below Avg
Contribute Benchmark Data
Help improve these rankings by submitting real-world telemetry. Contributors earn routing credits for every data point.