Scaling Laws for Neural Language Models

Book

A paper discussed that explores how model performance scales with compute, data, and parameters, and introduces concepts like inverse scaling.

Mentioned in 2 videos