AI Dynamics

Global AI News Aggregator

Pre-training with Feedback: Computational Cost Concerns

This would be sad if pre-training with feedback is actually better, because pre-training is by far the most expensive part of the training, and you wouldn't want to re-train from scratch every time you update your reward model.

→ View original post on X — @guillaumelample,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *