Finally got a chance to spend some time with the 77-page Llama 2 paper (
https://
arxiv.org/abs/2307.09288). Here are some takeaways at a glance. Appreciate that someone finally did a comprehensive supervised finetuning vs RLHF evaluation! (Lower right)
Llama 2 Paper Analysis: Supervised Finetuning vs RLHF Evaluation
By
–
Leave a Reply