Dear researchers,
Centrum Wiskunde & Informatica (CWI) kindly invites you
seventh Seminar++ meeting on Machine Learning Theory, taking place
on Wednesday June 7 from 15:00 - 17:00. These Seminar++ meetings
consist of a one-hour lecture building up to an open problem,
followed by an hour of brainstorming time. The meeting is intended
for interested researchers including PhD students. These meetings
are freely accessible without registration. Cookies, coffee and
tea will be provided in the half-time break.
The meeting of 7 June will be hosted by:
Odysseas
Kanavetas Assistant Professsor at the University of Leiden.
Abstract: After a brief review of the multi-armed bandit (MAB) problem and its online machine learning applications, we present our work on the model with side constraints. The constraints represent circumstances in which bandit activations are restricted by the availability of certain resources that are replenished at a constant rate.
Then, we consider the problem of adaptive control for Markov Decision Processes (MDP), under side constraints, when there is incomplete information for the transition probabilities and its rewards. Under suitable irreducibility assumptions for the MDP we establish a lower bound for the regret. An open problem is to construct adaptive policies that maximize the rate of convergence of realized rewards to that of the optimal (non adaptive) policy under complete information. We also discuss applications for queuing control problems and reliability models.
The event takes place in room L016 in the CWI building, Science
Park 123, Amsterdam.
The Seminar++ Meetings are part of the Machine
Learning Theory Semester Programme, which runs in Spring
2023.
Best regards on behalf of CWI from the program committee,
Wouter Koolen