Your browser doesn't support javascript.
loading
Examining the quality and population representativeness of linked survey and administrative data: guidance and illustration using linked 1958 National Child Development Study and Hospital Episode Statistics data.
Silverwood, Richard J; Rajah, Nasir; Calderwood, Lisa; De Stavola, Bianca L; Harron, Katie; Ploubidis, George B.
Afiliación
  • Silverwood RJ; Centre for Longitudinal Studies, UCL Social Research Institute, 20 Bedford Way, London WC1H 0AL.
  • Rajah N; Centre for Longitudinal Studies, UCL Social Research Institute, 20 Bedford Way, London WC1H 0AL.
  • Calderwood L; Centre for Longitudinal Studies, UCL Social Research Institute, 20 Bedford Way, London WC1H 0AL.
  • De Stavola BL; Population, Policy & Practice Research and Teaching Department, UCL Great Ormond Street Institute of Child Health, 30 Guilford Street, London WC1N 1EH.
  • Harron K; Population, Policy & Practice Research and Teaching Department, UCL Great Ormond Street Institute of Child Health, 30 Guilford Street, London WC1N 1EH.
  • Ploubidis GB; Centre for Longitudinal Studies, UCL Social Research Institute, 20 Bedford Way, London WC1H 0AL.
Int J Popul Data Sci ; 9(1): 2137, 2024.
Article en En | MEDLINE | ID: mdl-38425790
ABSTRACT

Introduction:

Recent years have seen an increase in linkages between survey and administrative data. It is important to evaluate the quality of such data linkages to discern the likely reliability of ensuing research. Evaluation of linkage quality and bias can be conducted using different approaches, but many of these are not possible when there is a separation of processes for linkage and analysis to help preserve privacy, as is typically the case in the UK (and elsewhere).

Objectives:

We aimed to describe a suite of generalisable methods to evaluate linkage quality and population representativeness of linked survey and administrative data which remain tractable when users of the linked data are not party to the linkage process itself. We emphasise issues particular to longitudinal survey data throughout.

Methods:

Our proposed approaches cover several areas i) Linkage rates, ii) Selection into response, linkage consent and successful linkage, iii) Linkage quality, and iv) Linked data population representativeness. We illustrate these methods using a recent linkage between the 1958 National Child Development Study (NCDS; a cohort following an initial 17,415 people born in Great Britain in a single week of 1958) and Hospital Episode Statistics (HES) databases (containing important information regarding admissions, accident and emergency attendances and outpatient appointments at NHS hospitals in England).

Results:

Our illustrative analyses suggest that the linkage quality of the NCDS-HES data is high and that the linked sample maintains an excellent level of population representativeness with respect to the single dimension we assessed.

Conclusions:

Through this work we hope to encourage providers and users of linked data resources to undertake and publish thorough evaluations. We further hope that providing illustrative analyses using linked NCDS-HES data will improve the quality and transparency of research using this particular linked data resource.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Desarrollo Infantil / Registro Médico Coordinado Idioma: En Revista: Int J Popul Data Sci Año: 2024 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Desarrollo Infantil / Registro Médico Coordinado Idioma: En Revista: Int J Popul Data Sci Año: 2024 Tipo del documento: Article