b
benchmark scaling laws
ConceptMentioned in 1 video
A key innovation in the LLaMA 3 paper, allowing prediction of downstream task performance based on compute budget and training flops.
A key innovation in the LLaMA 3 paper, allowing prediction of downstream task performance based on compute budget and training flops.