Clip-based embedding used to guide prompt conditioning for different 3D viewpoints.
Mentioned in 1 video
Computerphile