Also on your thought in the line "Agents are not solved yet, they'll come when AGI comes" in your blog post, 100% agree:
When any system reaches 90% on GAIA benchmark (hard general tasks up to 1hour long, cc @clefourrier @Thom_Wolf @ThomasScialom
), that means that we have really
@aymericroucher
-
Discussion on AI Agents and AGI Progress via GAIA Benchmark
By
–
-

Huggingface’s Smolagents Library Gains Traction for AI Agents
By
–
Since I published it on GitHub a few days ago, @huggingface
's new agentic library smolagents has gathered nearly 4k stars But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: -

Code as Agentic Actions for LLMs
By
–
This is a really important part, thank you for highlighting this @sloppenheimer
! Code (indeed many other languages than python could work) is just the better, overlooked version of writing agentic actions for LLMs -

Hugging Face Releases smolagents Library for AI Agent Development
By
–
For months, we've worked on building @huggingface
's new moonshot: agentic systems. So today we're very proud to announce the release of 𝚜𝚖𝚘𝚕𝚊𝚐𝚎𝚗𝚝𝚜! It's the simplest library we could make to let people build powerful agents. The main logic for agents fits in ~1000 -
Definition of AI Agents Involves Tool Use and Multistep Memory
By
–
@minimaxir i'd say agents are tool use + multistep (multistep means a memory that logs past steps ans errors properly)
-

Hugging Face Releases Picotron for Efficient LLM Training Parallelization
By
–
> Hugging Face releases Picotron, a microscopic lib that solves LLM training 4D parallelization Llama-3.1-405B took 39 million GPU-hours to train, which represents 4.5 thousand years. If they had needed all this time, we would have GPU stories from the time of Pharaoh
-

New LLM Research Shows Tokenizers Can Be Removed
By
–
Potential paradigm shift in LLMs: new paper by @AIatMeta shows that we can get rid of tokenizers! Current LLMs process text by first splitting it into tokens. They use a module named "tokenizer", that -spl-it-s- th-e- te-xt- in-to- arbitrary tokens depending on a fixed
-

Google Releases Gemini 2.0 AI Model with Agentic Capabilities
By
–
𝗚𝗼𝗼𝗴𝗹𝗲 𝗿𝗲𝗹𝗲𝗮𝘀𝗲𝘀 𝗚𝗲𝗺𝗶𝗻𝗶 𝟮.𝟬, 𝘀𝘁𝗮𝗿𝘁𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝗮 𝗙𝗹𝗮𝘀𝗵 𝗺𝗼𝗱𝗲𝗹 𝘁𝗵𝗮𝘁 𝘀𝘁𝗲𝗮𝗺𝗿𝗼𝗹𝗹𝘀 𝗚𝗣𝗧-𝟰𝗼 𝗮𝗻𝗱 𝗖𝗹𝗮𝘂𝗱𝗲-𝟯.𝟲 𝗦𝗼𝗻𝗻𝗲𝘁! And they start a huge effort on agentic capabilities. Performance improvements:
‣ Gemini -

Speculation on Anthropic’s Opus-3.5 and AI Scaling Laws
By
–
Are scaling laws still alive? New blog post suggests that @AnthropicAI might have an extremely strong Opus-3.5 already available, but is not releasing it to keep their edge over the competition. – Since the release of Opus-3.5 has been delayed indefinitely, there have been
-
New Multilingual AI Benchmark and Model Releases in Open Source AI
By
–
Last week was crazy in OS AI, with important models and datasets releases every day. Here are the most important ones I've pinned: Cohere relased GLobal-MMLU, a multilingual version of MMLU, to evaluate AI models' world knowledge in many languages! Meta released