Fire attention

Software / App

A custom kernel developed by Fireworks AI, primarily for language models, aimed at improving performance, particularly on concurrency.

Mentioned in 1 video