Multimodal information

Concept

Information coming from different modalities such as images, audio, and video, which AI systems need to handle for real-world interaction.

Mentioned in 1 video