Dear all,
Our seminar speaker this Friday in the thematic seminar is Samory
Kpotufe. Further below, there is also a list of upcoming talks that
are scheduled for the second semester.
Samory Kpotufe (Department of Statistics, Columbia
University, http://www.columbia.edu/~skk2175/)
Samory works on the intersection between statistics and machine
learning, with an interest in adaptive methods, and was one of the
chairs for last year's COLT learning theory conference.
Friday November 26, 16.00-17.00
Online on Zoom:
https://uva-live.zoom.us/j/85155421740
Meeting ID: 851 5542 1740
Please also join for online drinks after the talk.
Some Recent Insights on Transfer and Multitask Learning
A common situation in Machine Learning is one where training data is
not fully representative of a target population due to bias in the
sampling mechanism or due to prohibitive target sampling costs. In
such situations, we aim to ’transfer’ relevant information from the
training data (a.k.a. source data) to the target application. How
much information is in the source data about the target application?
Would some amount of target data improve transfer? These are all
practical questions that depend crucially on 'how far' the source
domain is from the target. However, how to properly measure
'distance' between source and target domains remains largely
unclear. In this talk we will argue that much of the traditional
notions of 'distance' (e.g. KL-divergence, extensions of TV such as
D_A discrepancy, density-ratios, Wasserstein distance) can yield an
over-pessimistic picture of transferability. Instead, we show that
some asymmetric notions of 'relatedness' between source and target
(which we simply term 'transfer-exponents') capture a continuum from
easy to hard transfer. Transfer-exponents uncover a rich set of
situations where transfer is possible even at fast rates; they
encode relative benefits of source and target samples, and have
interesting implications for related problems such as 'multi-task or
multi-source learning'. In particular, in the case of transfer from
multiple sources, we will discuss (if time permits) a strange
phenomena: no procedure can guarantee a rate better than that of
having a single data source, even in seemingly mild situations where
multiple sources are informative about the target. The talk is based
on earlier work with Guillaume Martinet, and ongoing work with Steve
Hanneke.
Seminar organizers:
Tim van Erven
Botond Szabo
https://mschauer.github.io/StructuresSeminar/
Upcoming talks:
Mar. 11, 2022, Tomer
Koren, Tel Aviv University
Mar. 25, 2022,
Nicolò Cesa-Bianchi, Università degli Studi di
Milano
Apr. 22, 2022, Tor
Lattimore, DeepMind
[date to be confirmed], Julia
Olkhovskaya, Vrije Universiteit
--
Tim van Erven <tim@timvanerven.nl>
www.timvanerven.nl