AI Dynamics

Global AI News Aggregator

About

Problem Complexity and Human Baseline Improvements in RL Research

Cheers for taking a swing at it, but the problem is far from trivial, even if the final solutions are fairly compact code. Note that they upgraded the human baselines later. The Muesli paper has higher numbers than the original DQN paper.

→ View original post on X — @id_aa_carmack