AI Dynamics

Global AI News Aggregator

Model Training with SFT Using Claude and GPT-4o-mini

When I published this post I hadn't yet come across Trip's own writeup of the project, which provides way more detail about how he trained the model including SFT (supervised fine tuning) against synthetic chat examples created using Claude Haiku and GPT-4o-mini

→ View original post on X — @simonw,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *