Enjoyed this extremely comprehensive study on predicting language model performance http://
arxiv.org/abs/2405.10938. Found many insightful nuggets:
– In a single model family there usually aren't that many model sizes, which hinders predictive power. However, there are many model
Comprehensive Study on Predicting Language Model Performance
By
–
Leave a Reply