Impressive results from Sakana AI on ARC-AGI-2 with a new method for test-time-search and ensembling!
— François Chollet (@fchollet) 1 juillet 2025
Please be mindful when reporting on this result: the method doesn't actually get 30% on ARC-AGI-2 public eval, since ARC-AGI scores should be computed by checking whether you… https://t.co/By3lUGdrrn
Impressive results from Sakana AI on ARC-AGI-2 with a new method for test-time-search and ensembling! Please be mindful when reporting on this result: the method doesn't actually get 30% on ARC-AGI-2 public eval, since ARC-AGI scores should be computed by checking whether you
Leave a Reply