AI Dynamics

Global AI News Aggregator

About

High-Entropy Tokens Drive LLM Reasoning via Reinforcement Learning

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning This paper investigates how a small subset of high-entropy tokens—termed "forking tokens"—drives the performance of reinforcement learning with verifiable rewards (RLVR)

→ View original post on X — @askalphaxiv