AI Dynamics

Global AI News Aggregator

About

Flow Q-Learning: Offline RL with Flow-Matching Policies

Flow Q-Learning Flow Q-Learning (FQL) is an offline reinforcement learning (RL) method that uses flow-matching policies to model complex action distributions. It avoids the challenges of iterative action generation by training an expressive one-step policy with RL, which

→ View original post on X — @askalphaxiv