@aymericroucher - AI Dynamics

Discussion on AI Agents and AGI Progress via GAIA Benchmark

By

–

09 January 2025 16h04

Also on your thought in the line "Agents are not solved yet, they'll come when AGI comes" in your blog post, 100% agree:
When any system reaches 90% on GAIA benchmark (hard general tasks up to 1hour long, cc @clefourrier @Thom_Wolf @ThomasScialom
), that means that we have really

→ View original post on X — @aymericroucher

9 January 2025

Huggingface’s Smolagents Library Gains Traction for AI Agents

By

@aymericroucher

–

07 January 2025 15h01

Since I published it on GitHub a few days ago, @huggingface
's new agentic library smolagents has gathered nearly 4k stars But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents:

→ View original post on X — @aymericroucher

7 January 2025

Code as Agentic Actions for LLMs

By

@aymericroucher

–

02 January 2025 22h57

This is a really important part, thank you for highlighting this @sloppenheimer
! Code (indeed many other languages than python could work) is just the better, overlooked version of writing agentic actions for LLMs

→ View original post on X — @aymericroucher

2 January 2025

Hugging Face Releases smolagents Library for AI Agent Development

By

@aymericroucher

–

31 December 2024 16h32

For months, we've worked on building @huggingface
's new moonshot: agentic systems. So today we're very proud to announce the release of 𝚜𝚖𝚘𝚕𝚊𝚐𝚎𝚗𝚝𝚜! It's the simplest library we could make to let people build powerful agents. The main logic for agents fits in ~1000

→ View original post on X — @aymericroucher

31 December 2024

Definition of AI Agents Involves Tool Use and Multistep Memory

By

@aymericroucher

–

29 December 2024 22h48

@minimaxir i'd say agents are tool use + multistep (multistep means a memory that logs past steps ans errors properly)

→ View original post on X — @aymericroucher

29 December 2024

Hugging Face Releases Picotron for Efficient LLM Training Parallelization

By

@aymericroucher

–

19 December 2024 10h14

> Hugging Face releases Picotron, a microscopic lib that solves LLM training 4D parallelization Llama-3.1-405B took 39 million GPU-hours to train, which represents 4.5 thousand years. If they had needed all this time, we would have GPU stories from the time of Pharaoh

→ View original post on X — @aymericroucher

19 December 2024

New LLM Research Shows Tokenizers Can Be Removed

By

@aymericroucher

–

13 December 2024 18h08

Potential paradigm shift in LLMs: new paper by @AIatMeta shows that we can get rid of tokenizers! Current LLMs process text by first splitting it into tokens. They use a module named "tokenizer", that -spl-it-s- th-e- te-xt- in-to- arbitrary tokens depending on a fixed

→ View original post on X — @aymericroucher

13 December 2024

Google Releases Gemini 2.0 AI Model with Agentic Capabilities

By

@aymericroucher

–

11 December 2024 17h54

𝗚𝗼𝗼𝗴𝗹𝗲 𝗿𝗲𝗹𝗲𝗮𝘀𝗲𝘀 𝗚𝗲𝗺𝗶𝗻𝗶 𝟮.𝟬, 𝘀𝘁𝗮𝗿𝘁𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝗮 𝗙𝗹𝗮𝘀𝗵 𝗺𝗼𝗱𝗲𝗹 𝘁𝗵𝗮𝘁 𝘀𝘁𝗲𝗮𝗺𝗿𝗼𝗹𝗹𝘀 𝗚𝗣𝗧-𝟰𝗼 𝗮𝗻𝗱 𝗖𝗹𝗮𝘂𝗱𝗲-𝟯.𝟲 𝗦𝗼𝗻𝗻𝗲𝘁! And they start a huge effort on agentic capabilities. Performance improvements:
‣ Gemini

→ View original post on X — @aymericroucher

11 December 2024

Speculation on Anthropic’s Opus-3.5 and AI Scaling Laws

By

@aymericroucher

–

11 December 2024 12h24

Are scaling laws still alive? New blog post suggests that @AnthropicAI might have an extremely strong Opus-3.5 already available, but is not releasing it to keep their edge over the competition. – Since the release of Opus-3.5 has been delayed indefinitely, there have been

→ View original post on X — @aymericroucher

11 December 2024

New Multilingual AI Benchmark and Model Releases in Open Source AI

By

@aymericroucher

–

09 December 2024 17h47

Last week was crazy in OS AI, with important models and datasets releases every day. Here are the most important ones I've pinned: Cohere relased GLobal-MMLU, a multilingual version of MMLU, to evaluate AI models' world knowledge in many languages! Meta released

→ View original post on X — @aymericroucher

9 December 2024