GPT-2 Small
Software / AppMentioned in 1 video
A language model that is well-understood by interpretability researchers and has had sparse autoencoders trained on it.
A language model that is well-understood by interpretability researchers and has had sparse autoencoders trained on it.