Lack of standardization in the benchmarks. To be fair, MMLU is not that bad compared to many other evals
MMLU Benchmark Standardization Issues in AI Evaluation
By
–
Global AI News Aggregator
By
–
Lack of standardization in the benchmarks. To be fair, MMLU is not that bad compared to many other evals
Leave a Reply