This course covers three major algorithmic topics in machine learning. Half of the course is devoted to reinforcement learning with the focus on the policy gradient and deep Q-network algorithms. The ...
We examine recent claims that a particular Q-learning algorithm used by competitors ‘autonomously’ and systematically learns to collude, resulting in supracompetitive prices and extra profits for the ...