New Course: Reinforcement Fine-Tuning LLMs with GRPO!
— Andrew Ng (@AndrewYNg) 21 mai 2025
Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with @Predibase, and taught by @TravisAddair, its Co-Founder and CTO, and @grg_arnav, its Senior Engineer and… pic.twitter.com/j5AXn3swAD
New Course: Reinforcement Fine-Tuning LLMs with GRPO! Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with @Predibase
, and taught by @TravisAddair
, its Co-Founder and CTO, and @grg_arnav
, its Senior Engineer and