NVIDIA Cutlass
Software / App
A library from NVIDIA that provides primitives for efficient matrix multiplication and memory loading on GPUs, used as a base for Flash Attention 2.
Mentioned in 1 video
A library from NVIDIA that provides primitives for efficient matrix multiplication and memory loading on GPUs, used as a base for Flash Attention 2.