High alpha trick to improve your fine-tuned models: Step 1: fine-tune your model on the data you have, do RLHF, whatever you want to do
Step 2: choose some of the best examples you have, and few-shot your model with those examples to generate more
Step 3: train on those!
Improve Fine-Tuned Models With Few-Shot Data Generation
By
–
Leave a Reply