No Cover Image

Journal article 57 views 9 downloads

Automated data collection in cancer care: State of play among registries in the United Kingdom and Europe

MANUELA ROMAN, Stephen Ali, Nader Ibrahim, Thomas Dobbs, Hayley Hutchings Orcid Logo, Iain Whitaker

Health Information Management Journal, Volume: 55, Issue: 1, Pages: 123 - 131

Swansea University Authors: MANUELA ROMAN, Stephen Ali, Thomas Dobbs, Hayley Hutchings Orcid Logo, Iain Whitaker

  • 71234.VOR.pdf

    PDF | Version of Record

    © The Author(s) 2025. This article is distributed under the terms of the Creative Commons Attribution 4.0 License.

    Download (645.71KB)

Abstract

Background: Automated clinical coding can use statistical or artificial intelligence-based technology to transform unstructured clinical data into clinical codes. These processes have the potential to enhance the quality and accuracy of data collections, save resources and accelerate research. Objec...

Full description

Published in: Health Information Management Journal
ISSN: 1833-3583 1833-3575
Published: SAGE Publications 2026
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa71234
Abstract: Background: Automated clinical coding can use statistical or artificial intelligence-based technology to transform unstructured clinical data into clinical codes. These processes have the potential to enhance the quality and accuracy of data collections, save resources and accelerate research. Objective: To evaluate the use of automated clinical coding in the United Kingdom (UK) and European cancer registries. Method: An online electronic survey was formulated to evaluate the use and user opinion of automation within cancer registries. The survey was distributed to members of the United Kingdom and Ireland Association of Cancer Registry and the European cancer registries. Data analysis was performed using Microsoft Excel 2015 ® version 15.13.3 in order to summarise the results. Results: Twenty-three of the 117 cancer registries responded to the distributed survey; 15 (12.8%) cancer registries used automation within their registry, mainly in the form of natural language processing or machine learning. Most of the sampled registries (73.3%) used these technologies to automate data collection from pathology reports; 87% of respondents reported automation as efficient; and 26.1% reported improved data quality; 12 (52.1%) of cancer registries still manually checked all the automations; and 17 (74%) respondents believed that the algorithms for difficult tasks require further development. Conclusion: Various computer-based algorithms have been used for automated clinical coding in the UK and European cancer registries in the past few decades; however, to date there are no published data to validate its use. Further research and development of these technologies is needed to ensure external validity and maximise the potential use within other cancer registries globally. Implications for health information management practice: It is clear that while automation can be advantageous in areas of clinical coding, the role of the “human” (HIMs and clinical coders) in coding and classifying registry data, and in overseeing the transition, will be required for some time yet.
Keywords: registries; registry data; natural language processing; research; clinical coding; medical record system; automated; health information management; cancer registries; computer assisted coding; automated coding
College: Faculty of Medicine, Health and Life Sciences
Issue: 1
Start Page: 123
End Page: 131