bcb-0028
lib-knowledge · deterministic-tests · seed tier 2 · published
Best result per model
| # | Model | Score | Tests | Run |
|---|---|---|---|---|
| 1 | qwen3-coder | 1.000 | 6/6 | UD-Q4_K_XL · 24 GB · runner verified |
| 2 | qwen3-coder-next | 1.000 | 6/6 | UD-Q4_K_XL · 24 GB · runner verified |
| 3 | phi-4-mini | 0.833 | 5/6 | Q6_K · 24 GB · runner verified |
3 models attempted.