Dear colleagues,
On Friday 4 November we will have a seminar with the two Van Wijngaarden
award winners: Marta Kwiatkowska (University of Oxford) and Susan Murphy
(Harvard University) at the Centrum Wiskunde & Informatica (CWI) in
Amsterdam.
The meeting will take place between 15:00 and 17:00 in the Euler room
(Amsterdam Science Park Congress Center), with drinks available
afterwards. The program is as follows:
* 15:00-15:45: Marta Kwiatkowska: "Safety and robustness for deep
learning with provable guarantees"
* 15:45-16:00: Questions and small break
* 16:00-16:45: Susan Murphy: "Inference for Longitudinal Data After
Adaptive Sampling"
* 16:45-end: Questions followed by drinks
Abstracts of the talks are copied below,
Best regards,
Felix Lucka, Jannis Teunissen & Peter Grunwald
Marta Kwiatkowska (University of Oxford): Safety and robustness for deep
learning with provable guarantees
Abstract: Computing systems are becoming ever more complex, with
decisions increasingly often based on deep learning components. This
lecture will describe progress with developing automated verification
techniques for deep neural networks to ensure safety and robustness of
their decisions. The lecture will conclude with an overview of the
challenges in this field.
Susan Murphy (Harvard University): Inference for Longitudinal Data After
Adaptive Sampling
Abstract: Adaptive sampling methods, such as reinforcement learning (RL)
and bandit algorithms, are increasingly used for the real-time
personalization of interventions in digital applications like mobile
health and education. As a result, there is a need to be able to use the
resulting adaptively collected user data to address a variety of
inferential questions, including questions about time-varying causal
effects. However, current methods for statistical inference on such data
(a) make strong assumptions regarding the environment dynamics, e.g.,
assume the longitudinal data follows a Markovian process, or (b) require
data to be collected with one adaptive sampling algorithm per user,
which excludes algorithms that learn to select actions using data
collected from multiple users. These are major obstacles preventing the
use of adaptive sampling algorithms more widely in practice. In this
work, we provide theory for common Z-estimators based on adaptively
sampled data. The inference is valid even when observations are
non-stationary and highly dependent over time, and (b) allow the online
adaptive sampling algorithm to learn using the data of all users.
Furthermore, our inference method is robust to miss-specification of the
reward models used by the adaptive sampling algorithm. This work is
motivated by our work in designing the Oralytics oral health clinical
trial in which an RL adaptive sampling algorithm will be used to select
treatments, yet valid statistical inference is essential for conducting
primary data analyses after the trial is over.