Dear researchers,
Centrum Wiskunde & Informatica (CWI) kindly invites you to the first Seminar++ meeting on Machine Learning Theory, taking place on Wednesday March 8 from 15:00 - 17:00. These Seminar++ meetings consist of a one-hour lecture building up to an open problem, followed by an hour of brainstorming time. The meeting is intended for interested researchers including PhD students. These meetings are freely accessible without registration. Cookies and tea will be provided in the half-time break.
The meeting of 8 March will be:
Julia Olkhovskaya (Department of Mathematics of the Vrije Universiteit Amsterdam)
Online reinforcement learning with linear function approximation: role of the choice of policy optimization algorithm and learner’s feedback
*Abstract:* We consider learning in an adversarial MDP, where the loss function can change arbitrarily between episodes, and we assume that the Q-function of any policy is linear in some known features. We discuss two recent works, providing new insights into the solution to this problem ([1], [2]). We will look at the combination of methods proposed in these two papers to achieve better theoretical guarantees on the performance of the algorithms. More precisely, we will check if taking the best from both papers can lead to an improvement: exploration bonuses from [1] and the choice of the regularizer from [2]. If there will be time, we also discuss the variation of this problem when the information available to the learner is only the cumulative loss of the learner accumulated over the episode.
* [1] https://arxiv.org/pdf/2301.13087.pdf * [2] https://arxiv.org/pdf/2301.12942.pdf
The event takes place in room L016 in the CWI building, Science Park 123, Amsterdam.
The Seminar++ Meetings are part of the Machine Learning Theory Semester Programme https://www.cwi.nl/~wmkoolen/MLT_Sem23/index.html, which runs in Spring 2023.
Best regards on behalf of CWI from the program committee,
Wouter Koolen