Post-training involves much less compute and faster iteration cycles than pre-training. With post-training, you can adapt the model to user preferences, craft the model personality, or introduce safety behavior, says @barret_zoph
.
Post-Training: Adapting AI Models for User Preferences and Safety
By
–
Leave a Reply