Record linkage with complete-count historical census data
- đ¤ Speaker: Evan Roberts (University of Minnesota)
- đ Date & Time: Tuesday 13 September 2016, 10:30 - 11:00
- đ Venue: Seminar Room 1, Newton Institute
Abstract
Many areas of social science research benefit from being able to follow individuals and families across time, to observe changes in social behavior across at least part of the life course. Since the 1920s, and particularly since World War II longitudinal social surveys have become a common tool of social scientists. Despite their many benefits these surveys only allow us to study a limited number of birth cohorts, and few of these cohorts are entirely deceased. Comparison across multiple cohorts, and across long periods of the life course is not always possible as social scientists must follow their cohorts in real time.
Historical data on past populations allows us to reconstruct life-course panels for past cohorts. In the past few years complete transcriptions of census data from sequential censuses has become available in several countries including Britain, Canada, Iceland, Norway, Sweden, and the United States. The Minnesota Population Center is developing tools to create large datasets of people linked between at least two censuses. There are multiple challenges in creating this form of historical data, centering around the lack of unique identifiers. People must be identified by a combination of characteristics recorded with error including names, birthplaces, date of birth, and ethnic background. Although linkage rates are low by comparison to modern longitudinal surveys it has proved possible to create samples that are reasonably representative of the origin or terminal population. This paper describes the sources used in creating linked census datasets, the domain-specific issues in data linkage, and demonstrates some of the applications of historical longitudinal data in studying social mobility, and mortality in the past.
Related Links
- https://usa.ipums.org/usa/complete_count.shtml – Description of available US data resources
- http://www.nappdata.org – Description of available international data resources
Series This talk is part of the Isaac Newton Institute Seminar Series series.
Included in Lists
- All CMS events
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- dh539
- Featured lists
- INI info aggregator
- Interested Talks
- Isaac Newton Institute Seminar Series
- ndk22's list
- ob366-ai4er
- rp587
- School of Physical Sciences
- Seminar Room 1, Newton Institute
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Evan Roberts (University of Minnesota)
Tuesday 13 September 2016, 10:30-11:00