py-05-calc
algorithms · deterministic-tests · seed tier 5 · published
Best result per model
| # | Model | Score | Tests | Run |
|---|---|---|---|---|
| 1 | qwen3-coder-next | 1.000 | 10/10 | UD-Q4_K_XL · 24 GB · runner verified |
| 2 | qwen3-coder | 0.900 | 9/10 | UD-Q4_K_XL · 24 GB · runner verified |
| 3 | phi-4-mini | 0.100 | 1/10 | Q6_K · 24 GB · runner verified |
3 models attempted.