Fireworks.ai
Company
An inference service provider that experienced a 7x increase in token speeds (from 700 to 5,000 tokens/second) after updating to NVIDIA's optimized software.
Mentioned in 1 video
An inference service provider that experienced a 7x increase in token speeds (from 700 to 5,000 tokens/second) after updating to NVIDIA's optimized software.