Step1: They have prepared a dataset of human-written answers to the prompts and used that to train the model which is called Supervised Fine-Tuning (SFT) model. The model used here for fine-tuning is the Instruct GPT(GPT-3.5) 5/9
Supervised Fine-Tuning GPT-3.5 with Human-Written Answers
By
–
Leave a Reply