← challenges

aime26-16

math · deterministic-tests · seed tier 3 · published

Best result per model

#ModelScoreTestsRun
1phi-4-mini
0.000
0/1Q6_K · 24 GB · runner verified
2qwen3-coder
0.000
0/1UD-Q4_K_XL · 24 GB · runner verified
3qwen3-coder-next
0.000
0/1UD-Q4_K_XL · 24 GB · runner verified

3 models attempted.