sparse autoencoder

Software / App

A small model used by Anthropic to isolate and map patterns within the activations of LLM neurons.

Mentioned in 1 video