
| 排名 | 模型名称 | ArenaElo | CodingElo | Vision | ArenaHard |
|---|---|---|---|---|---|
| 201 | Claude-2.1 | 1118 | 1132 | 0 | 22.77 |
| 202 | GPT-3.5-Turbo-0613 | 1117 | 1135 | 0 | 24.82 |
| 203 | Mixtral-8x7B-Instruct-v0.1 | 1114 | 1114 | 0 | 23.4 |
| 204 | Claude-Instant-11 | 1111 | 1109 | 0 | 0 |
| 205 | Yi-34B-Chat | 1111 | 1106 | 0 | 23.15 |
| 206 | Gemini Pro | 1111 | 1092 | 0 | 17.8 |
| 207 | Qwen1.5-14B-Chat | 1109 | 1126 | 0 | 0 |
| 208 | GPT-3.5-Turbo-0125 | 1106 | 1124 | 0 | 23.34 |
| 209 | GPT-3.5-Turbo-0314 | 1106 | 1115 | 0 | 18.05 |
| 210 | WizardLM-70B-v1.0 | 1106 | 1071 | 0 | 0 |
| 211 | DBRX-Instruct-Preview | 1103 | 1118 | 0 | 24.63 |
| 212 | Llama-3.2-3B-Instruct | 1103 | 1080 | 0 | 0 |
| 213 | Phi-3-Small-8k-Instruct | 1102 | 1107 | 0 | 29.77 |
| 214 | Tulu-2-DPO-70B | 1099 | 1093 | 0 | 14.99 |
| 215 | Granite-3.0-8B-Instruct | 1093 | 1097 | 0 | 0 |
| 216 | Llama-2-70B-chat | 1093 | 1072 | 0 | 11.55 |
| 217 | OpenChat-3.5-0106 | 1092 | 1102 | 0 | 0 |
| 218 | Vicuna-33B | 1091 | 1067 | 0 | 8.63 |
| 219 | Snowflake Arctic Instruct | 1090 | 1077 | 0 | 17.61 |
| 220 | Starling-LM-7B-alpha | 1088 | 1080 | 0 | 12.8 |
| 221 | Gemma-1.1-7B-it | 1084 | 1084 | 0 | 0 |
| 222 | Nous-Hermes-2-Mixtral-8x7B-DPO | 1084 | 1079 | 0 | 0 |
| 223 | NV-Llama2-70B-SteerLM-Chat | 1081 | 1023 | 0 | 0 |
| 224 | pplx-70B-online | 1078 | 1028 | 0 | 0 |
| 225 | DeepSeek-LLM-67B-Chat | 1077 | 1079 | 0 | 0 |
| 226 | OpenChat-3.5 | 1076 | 1054 | 0 | 0 |
| 227 | MPT-30B-chat | 1076 | 1055 | 0 | 0 |
| 228 | Zephyr-7B-beta | 1076 | 1053 | 0 | 0 |
| 229 | Granite-3.0-2B-Instruct | 1074 | 1088 | 0 | 0 |
| 230 | OpenHermes-2.5-Mistral-7B | 1074 | 1058 | 0 | 0 |
| 231 | Codellama-34B-instruct | 1073 | 1065 | 0 | 0 |
| 232 | Mistral-7B-Instruct-v0.2 | 1072 | 1074 | 0 | 0 |
| 233 | Phi-3-Mini-4K-Instruct-June-24 | 1071 | 1082 | 0 | 0 |
| 234 | Qwen1.5-7B-Chat | 1070 | 1089 | 0 | 0 |
| 235 | GPT-3.5-Turbo-1106 | 1068 | 1095 | 0 | 0 |
| 236 | Phi-3-Mini-4k-Instruct | 1066 | 1086 | 0 | 0 |
| 237 | Llama-2-13b-chat | 1063 | 1051 | 0 | 0 |
| 238 | SOLAR-10.7B-Instruct-v1.0 | 1062 | 1047 | 0 | 0 |
| 239 | Dolphin-2.2.1-Mistral-7B | 1062 | 1024 | 0 | 0 |
| 240 | WizardLM-13b-v1.2 | 1059 | 1026 | 0 | 0 |
| 241 | Llama-3.2-1B-Instruct | 1054 | 1047 | 0 | 0 |
| 242 | Step-1o-Vision-32k (highres) | 0 | 0 | 1180 | 0 |
| 243 | Qwen2.5-VL-72B-Instruct | 0 | 0 | 1164 | 0 |
| 244 | Pixtral-Large-2411 | 0 | 0 | 1153 | 0 |
| 245 | Qwen-VL-Max-1119 | 0 | 0 | 1127 | 0 |
| 246 | Step-1V-32K | 0 | 0 | 1111 | 0 |
| 247 | Qwen2-VL-72b-Instruct | 0 | 0 | 1110 | 0 |
| 248 | Molmo-72B-0924 | 0 | 0 | 1075 | 0 |
| 249 | Pixtral-12B-2409 | 0 | 0 | 1072 | 0 |
| 250 | Llama-3.2-90B-Vision-Instruct | 0 | 0 | 1069 | 0 |
| 251 | InternVL2-26B | 0 | 0 | 1067 | 0 |
| 252 | Qwen2-VL-7B-Instruct | 0 | 0 | 1053 | 0 |
| 253 | Yi-Vision | 0 | 0 | 1045 | 0 |
| 254 | Llama-3.2-1B-Vision-Instruct | 0 | 0 | 1032 | 0 |
| 255 | Hunyuan-standard-Vision-2024-12-31 | 0 | 0 | 1064 | 0 |
| 256 | Aya-Vision-32B | 0 | 0 | 1063 | 0 |
| 257 | Qwen2.5-VL-32B-Instruct | 0 | 0 | 1213 | 0 |
| 258 | Llama-4-Scout-17B-16E-Instruct | 0 | 0 | 1151 | 0 |
| 259 | Molmo-7B-D-0924 | 0 | 0 | 1007 | 0 |
| 260 | gpt-oss-120B | 0 | 0 | 0 | 5.9 |
| 261 | gpt-oss-20B | 0 | 0 | 0 | 4.9 |
| 262 | Gemini-2.5-Flash-Lite | 0 | 0 | 0 | 4.4 |
声明:本文来自猫目,版权归作者所有。文章内容仅代表作者独立观点,不代表数字化转型网立场,转载目的在于传递更多信息。如有侵权,请联系我们。数字化转型网www.szhzxw.cn
本文由数字化转型网(www.szhzxw.cn)转载而成,来源于猫目;编辑/翻译:数字化转型网(专业造就领导者)萍水。

