LLM as a Judge
ConceptMentioned in 1 video
A technique where a large language model is used to evaluate the outputs of another AI model, a topic discussed extensively in the context of AI evals.
A technique where a large language model is used to evaluate the outputs of another AI model, a topic discussed extensively in the context of AI evals.