LLaMA has strong performance and is a great contribution. Some gentle feedback: it would be great to see a data contamination analysis, since most benchmarks evaluated are >2 years old, and BIG-Bench is omitted. So I'm curious if pre-training data contamination played a role.
LLaMA Performance Analysis: Data Contamination and Benchmark Concerns
By
–
Leave a Reply