Lottery Ticket Hypothesis
ConceptMentioned in 1 video
A hypothesis suggesting that dense neural networks contain smaller subnetworks that, when trained in isolation, can reach the same accuracy as the original dense network. Cerebras leverages this idea for parameter pruning.
![[Paper Club] Weight Streaming on Wafer-Scale Clusters (w/ Sarah Chieng of Cerebras)](https://i.ytimg.com/vi/eNKe04apEaE/maxresdefault.jpg)