VIMA 0.5

Software / AppMentioned in 1 video

A Vision Language Action model used as a baseline and compared against the Size Zero model.