continuous batching

ConceptMentioned in 1 video

An optimization technique for LLM serving, allowing multiple requests to be processed efficiently.

Videos Mentioning continuous batching

Latent Space

An optimization technique for LLM serving, allowing multiple requests to be processed efficiently.