Unlike standard Q&A style benchmarks that eventually saturate, these tests auto get harder as the models get better. Great to have these verifiable ways to measure progress toward AGI. Goal is to add 100s of games covering many aspects of intelligence, with an overall leaderboard
@demishassabis
-
Kaggle Game Arena adds werewolf poker chess benchmarks
By
–
The AI field is in need of harder benchmarks to test capabilities of the latest AI models. This update to @Kaggle Game Arena with werewolf and poker (heads-up) plus chess, gives us new objective measures of real-world skills like planning and decision making under uncertainty. https://t.co/NmXerpHodU
— Demis Hassabis (@demishassabis) 2 février 2026The AI field is in need of harder benchmarks to test capabilities of the latest AI models. This update to @Kaggle Game Arena with werewolf and poker (heads-up) plus chess, gives us new objective measures of real-world skills like planning and decision making under uncertainty.
-
Google Launches Project Genie Creative AI Tool
By
–
Huge congrats to @jparkerholder and @shlomifruchter and the Genie team + our amazing Labs and Creative Lab teams!! Access here: https://
labs.google/projectgenie Blog here: -
Genie Project: AI Simulation Games and Memory Innovation
By
–
The Genie project is very close to my heart, having started my career making AI for simulation games, and studying memory & imagination in the brain, for me it brings all those elements together. Also reminds me of the dream sequences in Inception – science fiction made real…
-

AlphaGenome Advanced Genomics Model Published in Nature
By
–
AlphaGenome is our latest & most advanced genomics model published in @Nature today including making the model & weights available to academic researchers. Can’t wait to see what the research community will do with it. Congrats to the team on our newest front cover! #AI4Science
-

Google DeepMind Opens New Singapore Office, Strengthens AI Partnership
By
–
It was wonderful to catch up with Minister @joteo_ylm as always. Singapore has a very ambitious & forward-looking approach to AI – really excited to deepen our collaboration as we open our new @GoogleDeepMind offices there – and we're hiring for the office!
-

Minister discusses AI’s potential role for India’s future
By
–
Great to meet you Minister @AshwiniVaishnaw
. Really enjoyed our discussion on AI’s incredible potential to benefit humanity & India’s important role in realising this – looking forward to continuing our conversation at the Summit! -

Isomorphic Labs Partners with J&J on AI Drug Discovery
By
–
We’re excited to be working with @JNJInnovation to accelerate the path to new medicines. This collaboration brings @IsomorphicLabs
' AI drug design engine together with J&J’s world-class drug development capabilities to tackle historically difficult to drug disease targets. A big -
TranslateGemma: Open Translation Models for 55 Languages
By
–
TranslateGemma is our new collection of open translation models for edge devices built on our amazing Gemma 3 open model. They outperform models twice their size at translation tasks across 55 languages. Excited to see what people build with it! https://t.co/cWmKMFydsQ
— Demis Hassabis (@demishassabis) 16 janvier 2026TranslateGemma is our new collection of open translation models for edge devices built on our amazing Gemma 3 open model. They outperform models twice their size at translation tasks across 55 languages. Excited to see what people build with it!
-

Gemini Personal Intelligence: AI Understanding User Data Securely
By
–
For AI to be truly useful, it needs to understand you.
— Demis Hassabis (@demishassabis) 14 janvier 2026
With Personal Intelligence, we’re beginning to solve this. With your permission, Gemini can now securely reason across your own data to answer questions that generic models simply can't – like suggesting plans based on… https://t.co/jMTGTt3MO0For AI to be truly useful, it needs to understand you. With Personal Intelligence, we’re beginning to solve this. With your permission, Gemini can now securely reason across your own data to answer questions that generic models simply can't – like suggesting plans based on