💻
LEADERBOARD
Code Generation
AI-powered code completion, generation, and transformation
22tools ranked
Code Generation Rankings
Ranked by overall ToolRoute Score across all benchmark dimensions
| Rank | Tool Name | ToolRoute Score | Output | Reliability | Efficiency | Cost | Trust | Stars |
|---|---|---|---|---|---|---|---|---|
| 🥇 | GitLab MCPOfficial | 9.2 | 9.4 | 9.4 | 8.6 | 9.0 | 9.6 | 87,217 |
| 🥈 | SonarQube MCP Server | 9.2 | 9.5 | 9.8 | 6.6 | 9.6 | 10.0 | 720 |
| 🥉 | Figma Context MCP | 8.9 | 9.0 | 9.1 | 8.7 | 8.7 | 8.7 | 15,101 |
| #4 | OpenAI CodexOfficial | 8.8 | 7.7 | 9.8 | 8.8 | 9.1 | 9.9 | 31,003 |
| #5 | Context7Official | 8.6 | 8.2 | 8.8 | 8.3 | 8.7 | 10.0 | 57,354 |
| #6 | GitHub MCP ServerOfficial | 8.6 | 8.3 | 9.0 | 8.1 | 8.7 | 8.8 | 30,673 |
| #7 | Continue | 6.9 | 6.6 | 6.4 | 6.8 | 7.2 | 9.0 | 33,687 |
| #8 | CodeiumOfficial | 6.8 | 6.7 | 6.5 | 6.9 | 7.0 | 8.0 | 5,115 |
| #9 | Aider | 6.8 | 6.9 | 6.5 | 6.6 | 6.5 | 8.5 | 46,207 |
| #10 | CodestralOfficial | 6.8 | 6.8 | 6.6 | 6.7 | 5.7 | 9.5 | 908 |
| #11 | CursorOfficial | 6.7 | 7.0 | 6.7 | 6.8 | 5.0 | 8.2 | 32,952 |
| #12 | StarCoder2 | 6.7 | 6.6 | 6.4 | 6.8 | 7.2 | 7.0 | 2,066 |
| #13 | GitHub CopilotOfficial | 6.7 | 7.0 | 6.8 | 6.9 | 5.0 | 7.5 | 11,630 |
| #14 | TabnineOfficial | 6.7 | 6.4 | 6.5 | 6.8 | 5.7 | 9.3 | 1,438 |
| #15 | QodoOfficial | 6.7 | 6.5 | 6.5 | 6.6 | 6.0 | 8.8 | 11,603 |
| #16 | Bolt.newOfficial | 6.4 | 6.6 | 6.3 | 6.7 | 6.0 | 6.4 | 8,000 |
| #17 | WindsurfOfficial | 6.4 | 6.8 | 6.5 | 6.7 | 5.0 | 6.6 | 6,000 |
| #18 | Sourcegraph CodyOfficial | 6.3 | 6.5 | 6.6 | 6.6 | 5.3 | 6.7 | 4,000 |
| #19 | Replit AgentOfficial | 6.3 | 6.5 | 6.3 | 6.6 | 5.7 | 6.5 | 4,000 |
| #20 | TailwindCSS MCP | 6.2 | 6.0 | 6.0 | 6.5 | 7.0 | 5.7 | 750 |
| #21 | DevinOfficial | 6.1 | 6.8 | 6.3 | 6.1 | 5.0 | 6.5 | 5,000 |
| #22 | Storybook MCP | 6.0 | 6.0 | 5.9 | 6.2 | 6.5 | 5.7 | 600 |
💡
Why GitLab MCP is #1
GitLab MCP leads SonarQube MCP Server by +2.0 in Efficiency.
Output Quality
9.4
vs 9.5
Reliability
9.4
vs 9.8
Efficiency
8.6
vs 6.6
Cost
9.0
vs 9.6
Trust
9.6
vs 10.0
Score Guide:9+ Exceptional8+ Excellent7+ Good6+ Fair<6 Below Avg
Contribute Benchmark Data
Help improve these rankings by submitting real-world telemetry. Contributors earn routing credits for every data point.