AI Dynamics

Global AI News Aggregator

About

Pre-training with Feedback: Computational Cost Concerns

This would be sad if pre-training with feedback is actually better, because pre-training is by far the most expensive part of the training, and you wouldn't want to re-train from scratch every time you update your reward model.

→ View original post on X — @guillaumelample