f

flash attention

Software / AppMentioned in 1 video

An optimized implementation of transformer attention, which Manifest AI's Vidril framework can match or outperform, especially on non-standard problem shapes.