CarperAI is doing great work lowering the barrier for RLHF training (i.e. training ChatGPT-like models). The latest release of their trlX library includes this great example, showing how to train RLHF models at scale with an open-source dataset! https://
x.com/synth_labs/sta
/synth_labs/status/1628544125120430081
…
CarperAI trlX enables open-source RLHF model training at scale
By
–
Leave a Reply