DINOv2

Software / AppMentioned in 1 video

A self-supervised foundation model trained purely on image data, which learns fine-grained visual features and is used to identify images difficult for LLMs.