Statistical Modelling using Linked Data - Issues and Opportunities
- đ¤ Speaker: Ray Chambers (University of Wollongong)
- đ Date & Time: Friday 08 July 2016, 11:30 - 12:30
- đ Venue: Seminar Room 1, Newton Institute
Abstract
Probabilistic linkage of multiple data sets is now popular and widespread. Unfortunately, there appears to be little corresponding enthusiasm for adjusting standard methods of statistical analysis when they are used with these linked data sets, even though there is plenty of evidence from simulation studies that both incorrect links as well as informative missed links can lead to biased inference. In this presentation I will describe the key issues that need to be addressed when analysing such linked data and some of the methods that can help. In this context, I will focus in particular on the simple linear regression model as a vehicle for demonstrating how knowledge about the statistical properties of the linkage process as well as summary information about the population distribution of the analysis variables can be used to correct for (or at least alleviate) these inferential problems. Recent research at the Australian Bureau of Statistics on a potential weighting/imputation approach to implementing these solutions will also be presented.
Series This talk is part of the Isaac Newton Institute Seminar Series series.
Included in Lists
- All CMS events
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Infectious Diseases
- Cambridge talks
- Chris Davis' list
- dh539
- Featured lists
- INI info aggregator
- Interested Talks
- Isaac Newton Institute Seminar Series
- ndk22's list
- ob366-ai4er
- rp587
- School of Physical Sciences
- Seminar Room 1, Newton Institute
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Ray Chambers (University of Wollongong)
Friday 08 July 2016, 11:30-12:30