Note that the v1 semi-private eval set is about the same level of difficulty as the v1 public eval set. The fully private eval set (used for the 2020 and 2024 Kaggle competitions) is estimated to be more difficult. ARC-AGI-2 does not have this issue — all sets were calibrated
ARC-AGI v1 eval sets difficulty calibration and v2 improvements
By
–
Leave a Reply