Thanks for the comment! I tried to keep the book more "foundational" rather than chasing the latest research (that's what I do in my blog). But yeah, maybe I can add some bonus material to the GitHub repo on implementing RL with verifiable rewards. (I don't want to do just
Foundational approach to book with potential RL bonus material
By
–