H
Healthbench
Study / ResearchMentioned in 1 video
An open-source dataset released by OpenAI's Karan and other researchers, containing realistic healthcare tasks designed to evaluate AI models beyond traditional medical exams.
An open-source dataset released by OpenAI's Karan and other researchers, containing realistic healthcare tasks designed to evaluate AI models beyond traditional medical exams.