Isn’t GPQA a 4 option multiple choice test? Apple does no better than chance (as do other small models)
Apple AI Models Perform at Chance Level on GPQA Benchmark
By
–
Global AI News Aggregator
By
–
Isn’t GPQA a 4 option multiple choice test? Apple does no better than chance (as do other small models)
Leave a Reply