Dear researchers,
Centrum Wiskunde & Informatica (CWI) kindly invites you to
the first Seminar++ meeting on Machine Learning Theory, taking
place on Wednesday March 8 from 15:00 - 17:00. These Seminar++
meetings consist of a one-hour lecture building up to an open
problem, followed by an hour of brainstorming time. The meeting is
intended for interested researchers including PhD students. These
meetings are freely accessible without registration. Cookies and
tea will be provided in the half-time break.
The meeting of 8 March will be:
Julia Olkhovskaya (Department of Mathematics of the Vrije Universiteit Amsterdam)
Abstract: We consider learning in an adversarial MDP, where the loss function can change arbitrarily between episodes, and we assume that the Q-function of any policy is linear in some known features. We discuss two recent works, providing new insights into the solution to this problem ([1], [2]). We will look at the combination of methods proposed in these two papers to achieve better theoretical guarantees on the performance of the algorithms. More precisely, we will check if taking the best from both papers can lead to an improvement: exploration bonuses from [1] and the choice of the regularizer from [2]. If there will be time, we also discuss the variation of this problem when the information available to the learner is only the cumulative loss of the learner accumulated over the episode.
The event takes place in room L016 in the CWI building, Science Park 123, Amsterdam.The Seminar++ Meetings are part of the Machine Learning Theory Semester
Programme, which runs in Spring 2023.
Best regards on behalf of CWI from the program committee,
Wouter Koolen