Category
Analytics
Measurement, analysis, and insight generation for product decisions.
Level: Intermediate
People Skill
Choose metrics that reflect model usefulness: task success, quality, safety, and latency.
Category
Measurement, analysis, and insight generation for product decisions.
Level: Intermediate
Why It Matters
Use this to know whether the product actually improved.
Essential: Yes
LLM Evaluation Skill
ProductBuilders
Open resourceProduct Analytics Skill
ProductBuilders
Open resourceReforge Programs
Reforge
Open resourceSQLBolt
SQLBolt
Open resource