MemotivaLLM Engineer Interview Questions: LLM Evaluation, Hallucinations, Guardrails, Production Monitoring

What is LLM-as-judge?

LLM Engineer Interview Questions: LLM Evaluation, Hallucinations, Guardrails, Production Monitoring

Audio flashcard · 0:21

Nortren·

What is LLM-as-judge?

0:21

LLM-as-judge is an evaluation approach where you use a strong language model to score the outputs of another model against criteria like helpfulness, accuracy, or relevance. It scales much better than human evaluation but inherits the judge's biases and blind spots. Common practice is to use a stronger model than the one being evaluated, and to validate the judge against human ratings on a sample.
arxiv.org