Mustafa Celikok, TU Delft - A Unifying View of Optimism in Episodic Reinforcement Learning by Gergely Neu, Ciara Pike-Burne
Alexander Mey, TU Delft - Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes by Yi Tian, Jian Qian, Suvrit Sra
Jack Mayo, University of Amsterdam -
Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards by Kyungjae Lee, Hongjun Yang, Sungbin Lim, Songhwai Oh
For
up to date information and an overview of past meetings, see www.timvanerven.nl/neurips-debriefing/
Best
regards,