Alignment Tuning

Concept

A tuning process to ensure model behavior aligns with human values like harmlessness, honesty, and helpfulness.

Mentioned in 1 video