Pipeline Dashboard

Overview

Pipeline

1Generate
2Rubric
3Run
4Score

Lab

Runs
Compare
Costs
Autoimprove
Plotting

Analysis

Stats
Analysis
Experiments

Reference

Skills
Tutorials

Score

LLM scores each response against the rubric

Case

No data for this case. Run step 4:

uv run python scripts/04_score.py

Step 4 of 4