M
MMLLU
Study / ResearchMentioned in 1 video
A benchmark where 01 (preview) scored 78.2% on a vision plus reasoning task, competitive with human experts.
A benchmark where 01 (preview) scored 78.2% on a vision plus reasoning task, competitive with human experts.