Alignment
Training models to behave according to human preferences and avoid inappropriate responses also called **Preference Training**
Training models to behave according to human preferences and avoid inappropriate responses also called **Preference Training**