M

MLE bench

Tool / ProductMentioned in 1 video

Machine Learning Engineer bench from Deep Research or the GPT-4o system card, measuring progress towards model self-improvement.