py-13-windowed-aggregator
architecture · deterministic-tests · seed tier 5 · published
Best result per model
| # | Model | Score | Tests | Run |
|---|---|---|---|---|
| 1 | qwen3-coder | 1.000 | 13/13 | UD-Q4_K_XL · 24 GB · runner verified |
| 2 | qwen3-coder-next | 1.000 | 13/13 | UD-Q4_K_XL · 24 GB · runner verified |
| 3 | phi-4-mini | 0.846 | 11/13 | Q6_K · 24 GB · runner verified |
3 models attempted.