people commenting that it's normal to train on the train set but somehow I would have expected/hoped that as we're nearing AGI-level capabilities we would not need to really fine-tune/specifically train the model on any specific downstream task, at most a bit of few-shots
AGI-Level Models May Not Need Task-Specific Fine-Tuning
By
–
Leave a Reply