Journal article 73 views 3 downloads
Improving opportunities for data linkage within Children Looked After administrative records in Wales
International Journal of Population Data Science, Volume: 10, Issue: 1
Swansea University Authors:
Grace Bailey , Alexandra Lee, Saira Ahmed, Ieuan Scanlon, Laura Cowley, Ian Farr, Caroline Brooks
, Laura North, Lucy Griffiths
DOI (Published version): 10.23889/ijpds.v10i1.2383
Abstract
IntroductionLinkage of population-based administrative data is a powerful tool for studying important public issues. To overcome confidentiality and disclosure issues, records are de-identified and allocated a unique identifier. Within the Secure Anonymised Information Linkage (SAIL) Databank, these...
Published in: | International Journal of Population Data Science |
---|---|
ISSN: | 2399-4908 |
Published: |
Swansea University
2025
|
Online Access: |
Check full text
|
URI: | https://cronfa.swan.ac.uk/Record/cronfa68946 |
Abstract: |
IntroductionLinkage of population-based administrative data is a powerful tool for studying important public issues. To overcome confidentiality and disclosure issues, records are de-identified and allocated a unique identifier. Within the Secure Anonymised Information Linkage (SAIL) Databank, these are known as Anonymised Linking Fields (ALFs). Assignment of an ALF enables linkage of individuals across multiple routinely collected datasets. Within the Children Looked After (CLA) Wales dataset, only 37% of the children have an ALF, limiting linkage to other datasets and, as a result, potential research. There are also other known data issues, including discrepancies with the week of births, duplicate identifiers and year-on-year changes in identifiers.ObjectivesTo improve accuracy and availability of the ALFs in the CLA dataset, and overall research quality.MethodsUsing several datasets within the SAIL Databank, we developed a six-step CLA matching algorithm to improve the ALF matching rate and correct for data errors. To assess the performance of our algorithm, we benchmarked against routine ALFs already identified via the algorithm currently used by SAIL.ResultsOur algorithm increased ALF matching by 25%, assigning 61% of individuals an ALF. Inconsistent weeks of birth, and incorrect and duplicate identifiers were resolved. When benchmarking against the current ALF-assigning algorithm used by SAIL, our algorithm had an overall sensitivity of 90%.ConclusionWe have developed an algorithm which demonstrates comparable ALF matching performance to the current algorithm used within SAIL, and which greatly improves the ALF matching in the CLA dataset. This algorithm may help to overcome potential bias due to missing data, and increases the potential for linkage to other datasets. Further development and refinement could result in the algorithm being applied to other datasets in SAIL. |
---|---|
Keywords: |
administrative data linkage; children looked after; SAIL Databank |
College: |
Faculty of Medicine, Health and Life Sciences |
Funders: |
This work was supported by Health and Care ResearchWales and Administrative Data Research (ADR) Wales. LJGis a member of the Children’s Social Care Research andDevelopment Centre (CASCADE) partnership, which receivesinfrastructure funding from Health and Care Research Wales(HCRW) (517199). LEC is a research fellow, funded by Healthand Care Research Wales (SCF-22-07). |
Issue: |
1 |