Dear researchers,
Centrum Wiskunde & Informatica (CWI) kindly invites you seventh Seminar++ meeting on Machine Learning Theory, taking place on Wednesday June 7 from 15:00 - 17:00. These Seminar++ meetings consist of a one-hour lecture building up to an open problem, followed by an hour of brainstorming time. The meeting is intended for interested researchers including PhD students. These meetings are freely accessible without registration. Cookies, coffee and tea will be provided in the half-time break.
The meeting of 7 June will be hosted by:
Odysseas Kanavetas https://www.universiteitleiden.nl/en/staffmembers/odysseas-kanavetas#tab-1 Assistant Professsor at the University of Leiden https://www.universiteitleiden.nl/en.
Asymptotically optimal control for Markov Decision Processes (MDP) under side constraints
*Abstract:* After a brief review of the multi-armed bandit (MAB) problem and its online machine learning applications, we present our work on the model with side constraints. The constraints represent circumstances in which bandit activations are restricted by the availability of certain resources that are replenished at a constant rate.
Then, we consider the problem of adaptive control for Markov Decision Processes (MDP), under side constraints, when there is incomplete information for the transition probabilities and its rewards. Under suitable irreducibility assumptions for the MDP we establish a lower bound for the regret. An open problem is to construct adaptive policies that maximize the rate of convergence of realized rewards to that of the optimal (non adaptive) policy under complete information. We also discuss applications for queuing control problems and reliability models.
The event takes place in room L016 in the CWI building, Science Park 123, Amsterdam.
The Seminar++ Meetings are part of the Machine Learning Theory Semester Programme https://www.cwi.nl/en/events/cwi-research-semester-programs/research-programmes-in-2023/overview-research-semester-programme-machine-learning-theory/, which runs in Spring 2023.
Best regards on behalf of CWI from the program committee,
Wouter Koolen
machine-learning-nederland@list.uva.nl