So fine-tuning the GPT-3 model using the RLHF method(which we will look at later) results in Instruct GPT. Instruct GPT is much better at following instructions than GPT-3 Compare the example below on how GPT3 & InstructGPT answer a question. 2/9
Fine-tuning GPT-3 with RLHF to Create InstructGPT
By
–
