@aymericroucher - AI Dynamics

Tencent releases Hunyuan-Large: open MoE model beats LLaMA 3.1-405B

By

–

05 November 2024 10h35

> Hunyuan-Large just released by @TencentGlobal : Largest ever open MoE LLM, only 52B active parameters but beats LLaMA 3.1-405B on most academic benchmarks! Key insights: Mixture of Experts (MoE) architecture: 389 B parameters in total, but only 52B are activated for any

→ View original post on X — @aymericroucher

5 November 2024

CLEAR: First Benchmark for Machine Unlearning

By

@aymericroucher

–

02 November 2024 15h51

> CLEAR: first benchmark to make models forget what we want them to forget! With privacy concerns rising, we sometimes need our models to "forget" specific information – like a person's data – while keeping everything else intact. Researchers just released CLEAR, the first

→ View original post on X — @aymericroucher

2 November 2024

Oasis: First AI-Generated Real-Time Video Game Without Engine

By

@aymericroucher

–

01 November 2024 19h07

> Oasis: First Real-Time Video Game Without a Game Engine! 🎮@DecartAI & @Etched just released Oasis – a fully AI-generated video game running at 20 FPS (frames per second). The model takes keyboard inputs and generates everything – physics, rules, graphics – on the fly,… pic.twitter.com/DU6l3PBfhF
— m_ric (@AymericRoucher) 1 novembre 2024

> Oasis: First Real-Time Game Without a Game Engine! @DecartAI & @Etched just released Oasis – a fully AI-generated video game running at 20 FPS (frames per second). The model takes keyboard inputs and generates everything – physics, rules, graphics – on the fly,

→ View original post on X — @aymericroucher

1 November 2024

Tool use mastered by a Minecraft-generating LLM

By

@aymericroucher

–

31 October 2024 23h29

If I had known that "Tool use" would be first mastered by a Minecraft-generating LLM… (cf the video at 0:32)

→ View original post on X — @aymericroucher

31 October 2024

Highlights from OpenAI DevDay: image model, o1 coding paradigm, Realtime API

By

@aymericroucher

–

31 October 2024 13h04

> Highlights from yesterday’s @OpenAI DevDay in London: image model in the works, o1’s new coding paradigm, Realtime API o1 creates a new paradigm for code. OpenAI devs seem to have massively adopted o1 + Cursor, seems like the experience is vastly improved compared to GH

→ View original post on X — @aymericroucher

31 October 2024

Build for future LLM strengths, don’t plug holes

By

@aymericroucher

–

30 October 2024 17h10

SamA expects most shortcomings of LLMs to progressively disappear across future generations
=> Do not build a tool that plugs a hole, that goes around one shortcoming of the model, but instead build a model that leverages future strengths How can one justify trillions of

→ View original post on X — @aymericroucher

30 October 2024

OpenAI Dev Day Q&A: o1 models, larger models, startup integration

By

@aymericroucher

–

30 October 2024 17h07

Ongoing Q&A with @sama at @OpenAI 's dev day! Should we expect o1 like models or more larger models?=> Wants to make LLMs better across the board but this direction of reasoning is important How far wil the go up the integration? What should AI startup builders on top of OpenAI

→ View original post on X — @aymericroucher

30 October 2024

Tricking Sam Altman into releasing GPT-3.5 at DevDay

By

@aymericroucher

–

30 October 2024 12h38

Trying to trick @sama into releasing GPT-3.5 at OpenAI's DevDay in London!

→ View original post on X — @aymericroucher

30 October 2024

Cohere releases Aya 8B & 32B multilingual models for 23 languages

By

@aymericroucher

–

24 October 2024 16h08

Cohere releases Aya 8B & 32B: SOTA multilingual models for 23 languages ! @cohere just dropped two great models: great on generalist use, they specifically shine on 23 non-english languages. How did they pull that multilingual trick? 𝗧𝗿𝗮𝗶𝗻 𝗼𝗻 𝘀𝘆𝗻𝘁𝗵𝗲𝘁𝗶𝗰

→ View original post on X — @aymericroucher

24 October 2024

New Claude 3.5 models outperform GPT-4o

By

@aymericroucher

–

22 October 2024 17h59

New agentic powerhouse: upgraded Claude 3.5 Sonnet leaves GPT-4o in the dust Anthropic just announced two extremely impressive releases, with an improved Claude 3.5 Sonnet and a new Claude 3.5 Haiku ! (Haiku previously only existed in version 3) Claude 3.5 Sonnet: a

→ View original post on X — @aymericroucher

22 October 2024