Bigger, Better, Faster: Human-level Atari with human-level efficiency introduce a value-based RL agent, which we call BBF, that achieves super-human performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used for value estimation, as well as a number
BBF: Value-Based RL Agent Achieves Super-Human Atari Performance
By
–
