M

Mechanistic Interpretability

ConceptMentioned in 2 videos

A field of study that involves interpreting the internal workings and decision-making processes of AI models.