BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Talks.cam//talks.cam.ac.uk//
X-WR-CALNAME:Talks.cam
BEGIN:VEVENT
SUMMARY:Evaluating Data Linkage: Creating longitudinal synthetic data to p
 rovide a gold-standard linked dataset - Tom Dalton (University of St Andre
 ws)
DTSTART:20161020T143000Z
DTEND:20161020T153000Z
UID:TALK68621@talks.cam.ac.uk
CONTACT:INI IT
DESCRIPTION:When performing probabilistic data linkage on real world data 
 we\, by the fact we need to link it\, do not know the true linkage. Theref
 ore\, the success of our linkage approach is difficult to evaluate. Often 
 small hand linked datasets are used as a &lsquo\;gold-standard&rsquo\; for
  the linkage approach to be evaluated against. However\, errors in the han
 d-linkage and the limited size and number of these datasets do not allow f
 or robust evaluation. The research focuses on the creation of longitudinal
  synthetic datasets for the domain of population reconstruction. In this t
 alk I will cover the previous and current models we have created to achiev
 e this and detail the approaches to how we: define the desired behaviour i
 n the model to avoid clashes between input distributions\, verify the stat
 istical correctness of the population\, and initialise the model such that
  the starting population meets the temporal requirements of the desired be
 haviour. To conclude I will outline the model&rsquo\;s intended use for li
 nkage evaluation\, its other potential uses and also take questions.  <br>
 <br><br><br>
LOCATION:Seminar Room 2\, Newton Institute
END:VEVENT
END:VCALENDAR
