M

MMLLU

Study / ResearchMentioned in 1 video

A benchmark where 01 (preview) scored 78.2% on a vision plus reasoning task, competitive with human experts.