Evolution Strategies can be applied at scale to fine-tune LLMs, and outperforms PPO and GRPO in many model settings!
— hardmaru (@hardmaru) 7 octobre 2025
Fantastic paper “Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning” by @yule_gan, Risto Miikkulainen and team.https://t.co/CEyX6Z5ulG https://t.co/mPkXABJcQz
Evolution Strategies can be applied at scale to fine-tune LLMs, and outperforms PPO and GRPO in many model settings! Fantastic paper “Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning” by @yule_gan
, Risto Miikkulainen and team. https://
arxiv.org/abs/2509.24372
Leave a Reply