AI Dynamics

Global AI News Aggregator

Meta’s Muse Spark: Multimodal AI Model with Impressive Reasoning Benchmarks

Meta Superintelligence Labsjust dropped Muse Spark, their first model after a full nine-month rebuild of their AI stack. the tl;dr (summary) It's a natively multimodal reasoning model that now powers Meta AI. It's competitive on reasoning and multimodal benchmarks, introduces a multi-agent "Contemplating mode," and Meta frames it as step one on a scaling ladder toward "personal superintelligence." Where it's strong: -Multimodal perception and visual reasoning (visual STEM, entity recognition, localization) -Health reasoning, built with input from 1,000+ physicians -Test-time reasoning efficiency, using thinking time penalties to compress reasoning tokens -Contemplating mode hits 58% on Humanity's Last Exam and 38% on FrontierScience Research, putting it in the ballpark of Gemini Deep Think and GPT Pro -Pretraining efficiency: reaches the same capability as Llama 4 Maverick with over 10x less compute Where it's weaker (Meta's own admission): -Long-horizon agentic systems -Coding workflows Key scaling findings: -RL compute scales smoothly with log-linear growth on pass@1 and pass@16 -Multi-agent orchestration scales performance without proportional latency increase -Phase transition behavior on AIME: the model first extends reasoning, then compresses it under length penalties, then extends again for higher accuracy My take: very good model, really surprised what meta offered here. And keep in mind: 99% of all instagram / facebook user dont need an LLM for doing academic reserach but for everyday reasoning. Well done, meta! Chubby♨️ (@kimmonismus) Lol what?! Meta has been cooking! These benchmarks are really freaking good holy!! — https://nitter.net/kimmonismus/status/2041918006779957407#m

→ View original post on X — @kimmonismus, 2026-04-08 16:42 UTC

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *