NVIDIA Cutlass

Software / App

A library from NVIDIA that provides primitives for efficient matrix multiplication and memory loading on GPUs, used as a base for Flash Attention 2.

Mentioned in 1 video