Multi-modal data embedding (MMDIT)

ConceptMentioned in 1 video

An innovation in action expert training that improves feature mixing between visual and action features, yielding significant performance boosts.