Issue 01 ยท Pipeline Inspector
Overview
An LLM benchmark pipeline: generate cases, build rubrics, run models, score responses. Walk the four steps in order, then dig into stats and analysis.
Cases
-
Pipeline progress
- / 4
Up next
GenerateThe pipeline
4 steps
Departments
Analysis & reference