AMv2

Software / App

A model that simplifies combining image and pixel tokens by autor_regressively learning the mean squared error of image tokens for reconstruction.

Mentioned in 1 video