MiniGPT-4

Study / Research

A paper on vision modeling that influenced RWKV's approach to multimodal experiments by integrating image and language models in a latent space.

Mentioned in 1 video