This is the opposite imo: they claimed good scores on three benchmarks and that's it. No grandiose claims in the model card or the paper. The team made a valuable contribution, let's just not overhype it.
Honest Model Evaluation Over Hype in AI Research
By
–