Chain of Thought Reasoning

Concept

A method where AIs lay out their reasoning process step-by-step, which could potentially embed dangerous behaviors if the underlying language model has such tendencies.

Mentioned in 1 video