LLaVA 1.5

Software / App

An evolution of LLaVA that improved handling of multiple images and videos, utilizing Si-CLIP as the vision encoder.

Mentioned in 1 video