Gemini 3.1 Pro
LiveCodeBench
Aider Polyglot
Coding ranks the published roster by SWE-bench Verified when coverage exists. LiveCodeBench is shown beside it as the mandatory companion signal, and Aider Polyglot appears when verified.
Buyer-facing table
| Rank | Model | SWE-bench VerifiedRank source | LiveCodeBench | Aider Polyglot |
|---|---|---|---|---|
| 01 | Gemini 3.1 Pro |
LiveCodeBench
Aider Polyglot
Editorial investigation
The coding leaderboard keeps the software engineering benchmark front and center, then exposes companion evidence next to it so buyers can spot harness-driven divergence quickly.
Open top coding model"Choose this when you need the highest reasoning ceiling available and can feed it text, images, audio, or video in the same request."
Gemini 3.1 Pro
| 91.7% | 69.6% |
| 02 | Claude Sonnet 4.6 | 79.6% | 74.1% | 82.4% |
|---|
| 03 | GPT-5.4 Replaces GPT-5.2 | 79.5% | 72.5% | 88.0% |
|---|
| 04 | Gemini 3 Flash | 78.0% | 90.8% | 75.8% |
|---|
| 05 | Claude Opus 4.6 | 62.7% | 87.1% | 68.4% |
|---|
| 06 | DeepSeek V3.2 (Thinking) | 48.0% | 86.2% | 74.2% |
|---|
SWE-bench Verified
Anthropic
LiveCodeBench
Aider Polyglot
SWE-bench Verified
OpenAI
LiveCodeBench
Aider Polyglot
SWE-bench Verified
LiveCodeBench
Aider Polyglot
SWE-bench Verified
Anthropic
LiveCodeBench
Aider Polyglot
SWE-bench Verified
DeepSeek
LiveCodeBench
Aider Polyglot
SWE-bench Verified