Pi 0.5 architecture

Software / AppMentioned in 1 video

A common architecture for vision-language action models, featuring a pre-trained backbone and an action expert.