Fireworks.ai

Company

An inference service provider that experienced a 7x increase in token speeds (from 700 to 5,000 tokens/second) after updating to NVIDIA's optimized software.

Mentioned in 1 video