Dear colleagues,
On Friday 4 November we will have a seminar with the two Van Wijngaarden award winners: Marta Kwiatkowska (University of Oxford) and Susan Murphy (Harvard University) at the Centrum Wiskunde & Informatica (CWI) in Amsterdam.
The meeting will take place between 15:00 and 17:00 in the Euler room (Amsterdam Science Park Congress Center), with drinks available afterwards. The program is as follows:
* 15:00-15:45: Marta Kwiatkowska: "Safety and robustness for deep learning with provable guarantees" * 15:45-16:00: Questions and small break * 16:00-16:45: Susan Murphy: "Inference for Longitudinal Data After Adaptive Sampling" * 16:45-end: Questions followed by drinks
Abstracts of the talks are copied below,
Best regards, Felix Lucka, Jannis Teunissen & Peter Grunwald
Marta Kwiatkowska (University of Oxford): Safety and robustness for deep learning with provable guarantees
Abstract: Computing systems are becoming ever more complex, with decisions increasingly often based on deep learning components. This lecture will describe progress with developing automated verification techniques for deep neural networks to ensure safety and robustness of their decisions. The lecture will conclude with an overview of the challenges in this field.
Susan Murphy (Harvard University): Inference for Longitudinal Data After Adaptive Sampling
Abstract: Adaptive sampling methods, such as reinforcement learning (RL) and bandit algorithms, are increasingly used for the real-time personalization of interventions in digital applications like mobile health and education. As a result, there is a need to be able to use the resulting adaptively collected user data to address a variety of inferential questions, including questions about time-varying causal effects. However, current methods for statistical inference on such data (a) make strong assumptions regarding the environment dynamics, e.g., assume the longitudinal data follows a Markovian process, or (b) require data to be collected with one adaptive sampling algorithm per user, which excludes algorithms that learn to select actions using data collected from multiple users. These are major obstacles preventing the use of adaptive sampling algorithms more widely in practice. In this work, we provide theory for common Z-estimators based on adaptively sampled data. The inference is valid even when observations are non-stationary and highly dependent over time, and (b) allow the online adaptive sampling algorithm to learn using the data of all users. Furthermore, our inference method is robust to miss-specification of the reward models used by the adaptive sampling algorithm. This work is motivated by our work in designing the Oralytics oral health clinical trial in which an RL adaptive sampling algorithm will be used to select treatments, yet valid statistical inference is essential for conducting primary data analyses after the trial is over.