Skip to content
Miraat
·dweb
developer journey, reflected
Roadmaps
Skill of the day
Resources
IT
EN
FR
عربي
Sign in
Miraat·dweb
/
AI Engineering
/
Evaluating LLM Outputs
Evaluating LLM Outputs
Datasets, golden answers, LLM-as-judge, regression suites. Treating prompts like code.
Advanced
30 minutes
·
Prerequisites:
Prompt Engineering Basics
Next skill · How LLMs Actually Work →