That's a valid point. The team tried hard to decontaminate the data. Also note that there's generalization into more challenging benchmarks such as HiddenMath and IMO-Bench.
Data Decontamination and Generalization in Advanced Math Benchmarks
By
–
Global AI News Aggregator
By
–
That's a valid point. The team tried hard to decontaminate the data. Also note that there's generalization into more challenging benchmarks such as HiddenMath and IMO-Bench.
Leave a Reply