TensorRT

Software / App

Mentioned as an inferencing solution that can be integrated with Llama Stack.

Mentioned in 4 videos

Videos Mentioning TensorRT

Latent Space

Mentioned as an example of an inference engine framework.

Latent Space

An NVIDIA library used by Replicate for optimizing and deploying deep learning models, particularly for inference.

Lex Fridman

Nvidia's SDK for high-performance deep learning inference, a hardware-specific compiler integrated with MLIR.

DeepLearningAI

Mentioned as an inferencing solution that can be integrated with Llama Stack.