Chain of Thought Reasoning
ConceptMentioned in 1 video
A method where AIs lay out their reasoning process step-by-step, which could potentially embed dangerous behaviors if the underlying language model has such tendencies.
A method where AIs lay out their reasoning process step-by-step, which could potentially embed dangerous behaviors if the underlying language model has such tendencies.