Healthbench
Study / Research
An open-source dataset released by OpenAI's Karan and other researchers, containing realistic healthcare tasks designed to evaluate AI models beyond traditional medical exams.
Mentioned in 1 video
An open-source dataset released by OpenAI's Karan and other researchers, containing realistic healthcare tasks designed to evaluate AI models beyond traditional medical exams.