Jeremy Bernstein

Person

A researcher who has worked on Muon and proposed ideas about layer-specific learning rates and optimizers.

Mentioned in 1 video