March 8, 15:00: Machine Learning Theory Seminar++ Meeting with Julia Olkhovskaya - Machine-learning-nederland

3 Mar 2023


      Dear researchers,
Centrum Wiskunde & Informatica (CWI) kindly invites you to the first 
Seminar++ meeting on Machine Learning Theory, taking place on Wednesday 
March 8 from 15:00 - 17:00. These Seminar++ meetings consist of a 
one-hour lecture building up to an open problem, followed by an hour of 
brainstorming time. The meeting is intended for interested researchers 
including PhD students. These meetings are freely accessible without 
registration. Cookies and tea will be provided in the half-time break.
The meeting of 8 March will be:
Julia Olkhovskaya (Department of Mathematics of the Vrije Universiteit 
Amsterdam)
Online reinforcement learning with linear function approximation:
      role of the choice of policy optimization algorithm and learner’s
      feedback
*Abstract:* We consider learning in an adversarial MDP, where the loss 
function can change arbitrarily between episodes, and we assume that the 
Q-function of any policy is linear in some known features. We discuss 
two recent works, providing new insights into the solution to this 
problem ([1], [2]). We will look at the combination of methods proposed 
in these two papers to achieve better theoretical guarantees on the 
performance of the algorithms. More precisely, we will check if taking 
the best from both papers can lead to an improvement: exploration 
bonuses from [1] and the choice of the regularizer from [2]. If there 
will be time, we also discuss the variation of this problem when the 
information available to the learner is only the cumulative loss of the 
learner accumulated over the episode.
* [1] https://arxiv.org/pdf/2301.13087.pdf
  * [2] https://arxiv.org/pdf/2301.12942.pdf
The event takes place in room L016 in the CWI building, Science Park 
123, Amsterdam.
The Seminar++ Meetings are part of the Machine Learning Theory Semester 
Programme https://www.cwi.nl/~wmkoolen/MLT_Sem23/index.html, which 
runs in Spring 2023.
Best regards on behalf of CWI from the program committee,
Wouter Koolen