Why do standard step sizes cause instability when learning from every single experience? Arsalan Sharifnassab, @RichardSSutton , and their team from Openmind Research Institute and University of Alberta present intentional updates: instead of picking a step size and hoping for
