OpenAssistant's shift from RLHF to supervised instruction-finetuning

AI Dynamics

Global AI News Aggregator

OpenAssistant’s shift from RLHF to supervised instruction-finetuning

–

29 June 2023 23h11

So on that note, OpenAssistant's earlier models used RLHF for instruction-finetuning (
https://
huggingface.co/OpenAssistant/
oasst-rlhf-2-llama-30b-7k-steps-xor
…); The later one seem to use supervised instruction-finetuning. Maybe @ykilcher has some insights whether RLHF wasn't worth the effort vs supervised?

→ View original post on X — @rasbt,

29 June 2023

AI GENERATIVE AI INNOVATION LLMS MACHINE LEARNING OPEN SOURCE RESEARCH

AI Dynamics

OpenAssistant’s shift from RLHF to supervised instruction-finetuning

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring