Flow Q-Learning Flow Q-Learning (FQL) is an offline reinforcement learning (RL) method that uses flow-matching policies to model complex action distributions. It avoids the challenges of iterative action generation by training an expressive one-step policy with RL, which
Flow Q-Learning: Offline RL with Flow-Matching Policies
By
–
