AI Dynamics

Global AI News Aggregator

About

Fine-tuning GPT-3 with RLHF to Create InstructGPT

So fine-tuning the GPT-3 model using the RLHF method(which we will look at later) results in Instruct GPT. Instruct GPT is much better at following instructions than GPT-3 Compare the example below on how GPT3 & InstructGPT answer a question. 2/9

→ View original post on X — @sumanth_077