Sleep Time Compute

Concept

The core concept of the paper, exploring scaling compute during inference downtime (post-training, non-test time) for LLMs.

Mentioned in 1 video