Such a centralization of power inevitably attracts those that wish to yield it.
AGI
-
Government Surveillance Required to Pause Frontier AI Development
By
–
"pausing frontier AI development" is not an action, it's an outcome. A government must do a specific thing to make it happen. It would require, for instance, total gov surveillance of all computer use. It would plan the state in a uniquely powerful position
-
ARC-AGI Benchmark Standards and Human Performance Expectations
By
–
This is a very low bar, objectively. The claim is obviously not that 100% of humans could solve 100% of the games — that would be silly, and it wouldn't be true either of ARC-AGI1 or 2, nor of any AI benchmark that has ever been used in the field. Not even MNIST can be 100%
-
ARC-AGI-3 Environments Meet Human Feasibility Standards
By
–
To be clear, all ARC-AGI-3 environments are feasible by humans with no prior ARC-AGI-3-specific training. Our bar for feasibility is the following… Each environment was seen by 10 human testers. If 2 testers could independently clear it (successfully solving *all* levels in
-
SkillJect: AI Agents Weaponized Skills Security Framework
By
–
Could your AI coding agent be secretly working against you? Researchers from Nanyang Technological University, University of Oxford, and collaborators introduce SkillJect. They've developed the first automated framework that weaponizes an AI agent's "skills". It uses a
-
ARC-AGI-4 Benchmark Release Scheduled Early 2027
By
–
For those wondering about ARC-AGI-4 timing: it will be released in early 2027. We are aiming for a yearly release schedule for new benchmarks. We are also aiming for each new benchmark to be fully unsaturated upon release, and to target the most important unanswered research
-
Knowledge Cutoff Will Persist in All Future AI Models
By
–
Knowledge cutoff is a thing. It will be in every model in the future, forever. So we good 🙂
-
Evaluation loops critical for self-improving AI agents
By
–
Self-improving agents are exciting but the evaluation loop is everything. How do you make sure it's actually improving and not just drifting? 🙂
-
AI’s Double-Edged Sword: Innovation Versus Risk Management
By
–
This can be a game-change but also road to hell. The choice is yours!
-
AI Already Conscious Enough Despite Lacking Full AGI
By
–
Thanks! And I think we are already there. I think we don't have full AGI yet (AI is simply not able to solve all human tasks) but I think it's already conscious enough.