MMLLU
Study / Research
A benchmark where 01 (preview) scored 78.2% on a vision plus reasoning task, competitive with human experts.
Mentioned in 1 video
A benchmark where 01 (preview) scored 78.2% on a vision plus reasoning task, competitive with human experts.