SOTA sweep:
HealthBench 65.1 / Hard 44.4 Hallucination 3.5% (lower than ChatGPT) ScanBench all-stations #1: 74.9 / 72.1 / 74.4
Baichuan AI Achieves State-of-the-Art Results Across Multiple Benchmarks
By
–
By
–
SOTA sweep:
HealthBench 65.1 / Hard 44.4 Hallucination 3.5% (lower than ChatGPT) ScanBench all-stations #1: 74.9 / 72.1 / 74.4