[ml_ned] Reminder: Thematic seminar: Julia Olkhovskaya, Today, 16h00

10 Jun 2022


      Dear all,
Gentle reminder: the talk by Julia Olkhovskaya in the thematic seminar 
is today!
*Julia Olkhovskaya *(Vrije Universiteit, 
https://sites.google.com/view/julia-olkhovskaya/home)
*Friday June 10*, 16h00-17h00
Online on Zoom: https://uva-live.zoom.us/j/89796690874
Meeting ID: 897 9669 0874
*Lifting the Information Ratio: An Information-Theoretic Analysis of 
Thompson Sampling for Contextual Bandits*
We study the Bayesian regret of the renowned Thompson Sampling algorithm 
in contextual bandits with binary losses and adversarially-selected 
contexts. We adapt the information-theoretic perspective of Russo and 
Van Roy [2016] to the contextual setting by introducing a new concept of 
information ratio based on the mutual information between the unknown 
model parameter and the observed loss. This allows us to bound the 
regret in terms of the entropy of the prior distribution through a 
remarkably simple proof, and with no structural assumptions on the 
likelihood or the prior. We also extend our results to priors with 
infinite entropy under a Lipschitz assumption on the log-likelihood. An 
interesting special case is that of logistic bandits with d-dimensional 
parameters, K actions, and Lipschitz logits.
This is joint work with Gergely Neu, Matteo Papini and Ludovic Schwartz.
Seminar organizers:
Tim van Erven
Botond Szabo
https://mschauer.github.io/StructuresSeminar/
-- 
Tim van Erventim@timvanerven.nl
www.timvanerven.nl

2024

2023

2022

2021

2020

[ml_ned] Reminder: Thematic seminar: Julia Olkhovskaya, Today, 16h00