University of Cambridge > > Isaac Newton Institute Seminar Series > Statistical Modelling using Linked Data - Issues and Opportunities

Statistical Modelling using Linked Data - Issues and Opportunities

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact INI IT.

DLAW01 - 'Data linkage and anonymisation: setting the agenda'

Probabilistic linkage of multiple data sets is now popular and widespread. Unfortunately, there appears to be little corresponding enthusiasm for adjusting standard methods of statistical analysis when they are used with these linked data sets, even though there is plenty of evidence from simulation studies that both incorrect links as well as informative missed links can lead to biased inference. In this presentation I will describe the key issues that need to be addressed when analysing such linked data and some of the methods that can help. In this context, I will focus in particular on the simple linear regression model as a vehicle for demonstrating how knowledge about the statistical properties of the linkage process as well as summary information about the population distribution of the analysis variables can be used to correct for (or at least alleviate) these inferential problems. Recent research at the Australian Bureau of Statistics on a potential weighting/imputation approach to implementing these solutions will also be presented.

This talk is part of the Isaac Newton Institute Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.


© 2006-2022, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity