Chinchilla scaling laws

Concept

These laws demonstrated the compute-optimal way to scale models by increasing both parameter size and training data.

Mentioned in 1 video