highly recommend taking a quick scan of the paper, you would love it they show some great comparisons between the base model outputs and the RHLF outputs – from generating Shakespeare to demonstrating visual imagination here are some examples just from the first 10 pages:
Paper Recommendations: RLHF Model Outputs and Comparisons
By
–
Leave a Reply