Ranking the information content of distance measures through the information imbalance
- 👤 Speaker: Aldo Glielmo ( International School for Advanced Studies (SISSA)
- 📅 Date & Time: Monday 28 June 2021, 16:30 - 17:30
- 📍 Venue: virtual ZOOM meeting ID: 263 591 6003, Passcode: 000042, https://us02web.zoom.us/j/2635916003?pwd=ZlBEQnRENGwxNmJGMENGMWxjak5nUT09
Abstract
Real-world data typically contain a large number of features that are often heterogeneous in nature, relevance, and also units of measure. When assessing the similarity between data points, one can build various distance measures using subsets of these features. Using the fewest features but still retaining sufficient information about the system is crucial in many statistical learning schemes, particularly when data are sparse. In my talk I will describe the “information imbalance”: a novel statistical concept that quantifies the relative information retained when using two different distance measures, and determine if they are equivalent, independent, or if one is more informative than the other. I will then show how the information imbalance can be used to find the most informative distance measure out of a pool of candidates, and present applications of this idea for the analysis of the Covid-19 epidemic spreading as well as for the construction of optimally informative descriptors of physical systems.
Series This talk is part of the Machine learning in Physics, Chemistry and Materials discussion group (MLDG) series.
Included in Lists
- Hanchen DaDaDash
- Lennard-Jones Centre external
- Machine learning in Physics, Chemistry and Materials discussion group (MLDG)
- virtual ZOOM meeting ID: 263 591 6003, Passcode: 000042, https://us02web.zoom.us/j/2635916003?pwd=ZlBEQnRENGwxNmJGMENGMWxjak5nUT09
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Monday 28 June 2021, 16:30-17:30