Monthly LLM Rankings
Based on performance, usability, and innovation this month
Ranking | Model | LM Arena Score | Features | Model Availability | API Input | API Output | Context Window | Training Cutoff |
---|---|---|---|---|---|---|---|---|
Large models | ||||||||
1 | Claude 3.5 Sonnet | 1283 |
| api | $3.00 | $15.00 | 200k tokens | Apr 2024 |
2 | GPT-4o | 1285 |
| api | $5.00 | $15.00 | 128k tokens | Oct 2023 |
3 | DeepSeek V3 | 1317 |
| both | $0.27 | $1.10 | ~128k tokens | Jul 2024 |
4 | Qwen 2.5 Max | 1332 |
| api | $1.60 | $6.40 | 128k tokens | Not specified |
5 | Grok 2 | 1288 |
| api | $5.00 | $15.00 | 128k tokens | Unknown |
6 | Mistral Large | 1245 |
| api | $2.00 | $6.00 | 32k tokens | Unknown |
Thinking Models | ||||||||
1 | OpenAI o3-mini | 1310 |
| api | $1.10 | $4.40 | 200k tokens | Oct 2023 |
2 | OpenAI o1 | 1351 |
| api | $15.00 | $60.00 | 200k tokens | Oct 2023 |
3 | DeepSeek R1 | 1362 |
| both | $0.55 | $2.19 | 64k tokens | Not specified |
Smaller Models | ||||||||
1 | GPT-4o Mini | 1273 |
| api | $0.15 | $0.60 | 128k tokens | Oct 2023 |
2 | Claude 3.5 Haiku | 1236 |
| api | $0.80 | $4.00 | 200k tokens | Jul 2024 |
3 | Gemini Flash 2.0 | 1357 |
| api | $0.10 | $0.40 | 2M tokens | Not specified (latest model, likely 2024) |
Last updated: 8.2.2025