AI Dynamics

Global AI News Aggregator

Suggestions for More Interesting Model Training Progression Sequences

Ok fair, it would have been of course more interesting if they did SFT v1 -> SFT v2 -> SFT v3 -> … SFT v1 -> SFT v2 -> RLHF v1 -> … RLHF v1 -> RLHF v2 -> RLHF v2 -> …

→ View original post on X — @rasbt,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *