How to Stop Shipping Low-Quality RL Environments (with Examples) https://
latent.space/p/bad-envs RL env startups are all the rage, but so many are TERRIBLE. We're proud to feature our latest guest post from @aurielws
, who has spent years in every layer of the stack at
@latentspacepod
-
How to Stop Shipping Low-Quality RL Environments (with Examples)
By
–
-
Andon Labs’ real-world AI evals: Claude calls FBI, AI CEOs, price cartels
By
–
Andon Labs' Real-World AI Evals: Claude calls the FBI, AI CEOs, price cartels, Butter-Bench, & Luna https://t.co/KpVP5fw9dM@andonlabs cofounders @lukaspet and @axelbacklund explain why dollar-denominated evals reveal what traditional benchmarks miss, how Claude ended up… pic.twitter.com/Nd11hvIMAo
— Latent.Space (@latentspacepod) 4 juin 2026Andon Labs' Real-World AI Evals: Claude calls the FBI, AI CEOs, price cartels, Butter-Bench, & Luna https://
latent.space/p/andon @andonlabs cofounders @lukaspet and @axelbacklund explain why dollar-denominated evals reveal what traditional benchmarks miss, how Claude ended up -
Scaling Past Informal AI: Math, Lean, and formal proofs for AGI
By
–
Scaling Past Informal AI https://
latent.space/p/axiom @axiommathai founder & CEO @CarinaLHong explains why math may be the missing path from code agents to AGI, why verified AI is about scaling brilliance not just fixing hallucinations, how Lean and formal proofs turn reasoning -
OpenAI’s Sam Altman: token usage scaled 1 million times in 6 years
By
–
OpenAI's @sama on scaling challenges: 6 years ago the top tokenmaxxer in the world was using 100k toks/mo, now that's the world median and top tokenmaxxer is > 100B toks/mo.
— Latent.Space (@latentspacepod) 3 juin 2026
That's a 1,000,000x in 6 years.
We think there's another 1,000,000x and global average usage of 100B… https://t.co/wMhihMvaEv pic.twitter.com/S5cvw3RcXfOpenAI's @sama on scaling challenges: 6 years ago the top tokenmaxxer in the world was using 100k toks/mo, now that's the world median and top tokenmaxxer is > 100B toks/mo. That's a 1,000,000x in 6 years. We think there's another 1,000,000x and global average usage of 100B
-
GitHub’s Agent Era: AI agents drive evolution beyond code hosting
By
–
🆕GitHub's Agent Era: 14x commits, 200M developers, Copilot’s next act https://t.co/4JxxgKmJOi@github COO @kdaigle explains why AI agents are forcing GitHub to evolve beyond code hosting, how Copilot is moving from autocomplete to CLI, desktop, cloud agents, and ambient… pic.twitter.com/B3afrwBuaG
— Latent.Space (@latentspacepod) 3 juin 2026GitHub's Agent Era: 14x commits, 200M developers, Copilot’s next act https://
latent.space/p/github @github COO @kdaigle explains why AI agents are forcing GitHub to evolve beyond code hosting, how Copilot is moving from autocomplete to CLI, desktop, cloud agents, and ambient -
Ethan He discusses AI video’s codex phase and coding agent path
By
–
🆕Grok Imagine’s Video Agent Moment: Cosmos, xAI, World Models, Generative UI, & the Codex Phase for Video https://t.co/UiTGJTIlPQ@EthanHe_42, former @xai world model lead and @nvidia Cosmos researcher, explains why AI video may follow the same path as coding agents, how Grok… pic.twitter.com/sCRaCpa10i
— Latent.Space (@latentspacepod) 1 juin 2026Grok Imagine’s Agent Moment: Cosmos, xAI, World Models, Generative UI, & the Codex Phase for https://
latent.space/p/xai @EthanHe_42
, former @xai world model lead and @nvidia Cosmos researcher, explains why AI video may follow the same path as coding agents, how Grok -
Grok, Cosmos, and World Models: The Video Agent Moment
By
–
Grok Imagine’s Agent Moment: Cosmos, xAI, World Models, Generative UI, & the Codex Phase for Video! https://
latent.space/p/video-agents @EthanHe_42
, former @xai world model lead and @nvidia Cosmos researcher, explains why AI video may follow the same path as coding agents, how Grok -
The Age of Async Agents: Devin’s Growth, AI Commits, and Cloud Engineering
By
–
🆕The Age of Async Agents: Devin’s 7x PR growth, 80% AI commits, background agents, memory, testing, & Open-Inspect https://t.co/x5Hw5S3egc@cognition cofounder + CPO @walden_yan and Open-Inspect creator @_colemurray explain why engineering is moving from local IDEs to cloud… pic.twitter.com/fciT77nJNI
— Latent.Space (@latentspacepod) 28 mai 2026The Age of Async Agents: Devin’s 7x PR growth, 80% AI commits, background agents, memory, testing, & Open-Inspect https://
latent.space/p/cognition @cognition cofounder + CPO @walden_yan and Open-Inspect creator @_colemurray explain why engineering is moving from local IDEs to cloud -
Biohub’s Protein World Model: ESMC-6B and Scaling Laws in Biology
By
–
Biohub’s Protein World Model: ESMC-6B, ESMFold2, 6.8B proteins, 1.1B structures, antibody design, SAEs, & the bitter lesson for biology https://
latent.space/p/esmfold2 @biohub Head of Science @alexrives explains why biology may scale like language modeling, how metagenomics unlocked -
Daytona’s Agent-Native Compute Explained: AI Agents Need Composable Computers
By
–
🆕Daytona’s Agent-Native Compute: 60ms sandboxes, 50K startups in 75 sec, 850K daily runs, RL/evals, CLI > MCP, & the end of localhost https://t.co/3sauItT6oc@daytonaio CEO @ivanburazin explains why AI agents need composable computers, how Daytona pivoted from human dev… pic.twitter.com/WGjlPJwpEr
— Latent.Space (@latentspacepod) 21 mai 2026Daytona’s Agent-Native Compute: 60ms sandboxes, 50K startups in 75 sec, 850K daily runs, RL/evals, CLI > MCP, & the end of localhost https://
latent.space/p/daytona @daytonaio CEO @ivanburazin explains why AI agents need composable computers, how Daytona pivoted from human dev