BackendBench Evals Evaluations on the BackendBench verifiers environment siro1/backendbench-openai-gpt-5.2 Viewer β’ Updated 18 days ago β’ 408 β’ 17 siro1/backendbench-openai-gpt-oss-120b Viewer β’ Updated 18 days ago β’ 408 β’ 11 siro1/backendbench-qwen-qwen3-coder Viewer β’ Updated 18 days ago β’ 408 β’ 15 siro1/backendbench-z-ai-glm-4.5-air Viewer β’ Updated 17 days ago β’ 408 β’ 14
BackendBench Evals Evaluations on the BackendBench verifiers environment siro1/backendbench-openai-gpt-5.2 Viewer β’ Updated 18 days ago β’ 408 β’ 17 siro1/backendbench-openai-gpt-oss-120b Viewer β’ Updated 18 days ago β’ 408 β’ 11 siro1/backendbench-qwen-qwen3-coder Viewer β’ Updated 18 days ago β’ 408 β’ 15 siro1/backendbench-z-ai-glm-4.5-air Viewer β’ Updated 17 days ago β’ 408 β’ 14
siro1/Qwen-1.5B-redistill-32B-36k-upgraded-8bs-50clip-larger Text Generation β’ 2B β’ Updated Feb 6, 2025 β’ 5
siro1/Qwen-1.5B-redistill-32B-30k-upgraded-8bs-50clip Text Generation β’ 2B β’ Updated Feb 6, 2025 β’ 8
siro1/meta-llama-Llama-4-Scout-17B-16E-Instruct-nt4-T0.7-H100 Viewer β’ Updated Oct 13, 2025 β’ 408 β’ 7