MMLLU

Study / Research

A benchmark where 01 (preview) scored 78.2% on a vision plus reasoning task, competitive with human experts.

Mentioned in 1 video