LLM as a Judge

ConceptMentioned in 1 video

A technique where a large language model is used to evaluate the outputs of another AI model, a topic discussed extensively in the context of AI evals.