H

Healthbench

Study / ResearchMentioned in 1 video

An open-source dataset released by OpenAI's Karan and other researchers, containing realistic healthcare tasks designed to evaluate AI models beyond traditional medical exams.