AI Dynamics

Global AI News Aggregator

About

Discussion on RLHF vs post-training model comparisons

You’re not comparing RLHF to no RLHF here, you’re comparing different generations post-RLHF mystery models, likely of different sizes. If you don’t do post-training at all, naive attempts to talk to the model go like this:

→ View original post on X — @goodside