SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Chu et al.: https://
arxiv.org/abs/2501.17161 #ArtificialIntelligence #DeepLearning #MachineLearning
SFT Memorizes RL Generalizes Foundation Model Post-training Study
By
–
