ThunderKid

Software / AppMentioned in 1 video

A CUDA library developed by the speakers that breaks down compute operations into matrix multiplications, aiming for efficient hardware-model co-design on modern GPUs.