This is a very low bar, objectively. The claim is obviously not that 100% of humans could solve 100% of the games — that would be silly, and it wouldn't be true either of ARC-AGI1 or 2, nor of any AI benchmark that has ever been used in the field. Not even MNIST can be 100%
ARC-AGI Benchmark Standards and Human Performance Expectations
By
–
Leave a Reply