| Rank | ||
|---|---|---|
#1 | Grok 4.1 Fast | 58.7% |
#2 | Gemini 3.1 Pro Preview | 55.3% |
#3 | Grok 4.20 Beta | 53.3% |
#4 | Gemini 3 Flash Preview | 48.7% |
#5 | Gemini 3.1 Flash Image Preview | 46.0% |
#6 | GPT-5.4 | 32.7% |
#7 | GPT-5.5 | 29.3% |
#8 | Qwen 3.6 Plus | 28.0% |
#9 | Qwen 3.6 Plus Preview | 22.0% |
#10 | Claude Opus 4.7 | 18.7% |
#11 | Claude Opus 4.6 | 16.7% |
#12 | GLM-5.1 | 12.7% |
#13 | GLM-5 | 12.0% |
#14 | Claude Sonnet 4.6 | 10.7% |
#15 | Claude Haiku 4.5 | 8.7% |
#16 | Gemini 2.5 Pro | 5.3% |