Multimodal Diffusion Transformer

Concept

A variant of the Diffusion Transformer that treats input text as a standalone modality, injected directly rather than as an afterthought via modulation.

Mentioned in 2 videos

Save the 2 videos on Multimodal Diffusion Transformer to your own pod.

Sign up free to keep building your knowledge base on Multimodal Diffusion Transformer as more episodes are added.

Get Started Free