Your browser doesn't support javascript.
loading
[Automatic ICD-10 coding : Natural language processing for German MRI reports]. / Automatische ICD-10-Codierung : Natural Language Processing für deutsche MRT-Befunde.
Mittermeier, Andreas; Aßenmacher, Matthias; Schachtner, Balthasar; Grosu, Sergio; Dakovic, Vladana; Kandratovich, Viktar; Sabel, Bastian; Ingrisch, Michael.
Affiliation
  • Mittermeier A; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland. Andreas.Mittermeier@med.uni-muenchen.de.
  • Aßenmacher M; Munich Center for Machine Learning (MCML), München, Deutschland. Andreas.Mittermeier@med.uni-muenchen.de.
  • Schachtner B; Institut für Statistik, LMU München, München, Deutschland.
  • Grosu S; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
  • Dakovic V; Munich Center for Machine Learning (MCML), München, Deutschland.
  • Kandratovich V; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
  • Sabel B; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
  • Ingrisch M; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
Radiologie (Heidelb) ; 64(10): 793-800, 2024 Oct.
Article in De | MEDLINE | ID: mdl-39120724
ABSTRACT

BACKGROUND:

The medical coding of radiology reports is essential for a good quality of care and correct billing, but at the same time a complex and error-prone task.

OBJECTIVE:

To assess the performance of natural language processing (NLP) for ICD-10 coding of German radiology reports using fine tuning of suitable language models. MATERIAL AND

METHODS:

This retrospective study included all magnetic resonance imaging (MRI) radiology reports acquired at our institution between 2010 and 2020. The codes on discharge ICD-10 were matched to the corresponding reports to construct a dataset for multiclass classification. Fine tuning of GermanBERT and flanT5 was carried out on the total dataset (dstotal) containing 1035 different ICD-10 codes and 2 reduced subsets containing the 100 (ds100) and 50 (ds50) most frequent codes. The performance of the model was assessed using top­k accuracy for k = 1, 3 and 5. In an ablation study both models were trained on the accompanying metadata and the radiology report alone.

RESULTS:

The total dataset consisted of 100,672 radiology reports, the reduced subsets ds100 of 68,103 and ds50 of 52,293 reports. The performance of the model increased when several of the best predictions of the model were taken into consideration, when the number of target classes was reduced and the metadata were combined with the report. The flanT5 outperformed GermanBERT across all datasets and metrics and was is suited as a medical coding assistant, achieving a top 3 accuracy of nearly 70% in the real-world dataset dstotal.

CONCLUSION:

Finely tuned language models can reliably predict ICD-10 codes of German magnetic resonance imaging (MRI) radiology reports across various settings. As a coding assistant flanT5 can guide medical coders to make informed decisions and potentially reduce the workload.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Magnetic Resonance Imaging / International Classification of Diseases / Clinical Coding Limits: Humans Country/Region as subject: Europa Language: De Journal: Radiologie (Heidelb) Year: 2024 Document type: Article Country of publication: Germany

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Magnetic Resonance Imaging / International Classification of Diseases / Clinical Coding Limits: Humans Country/Region as subject: Europa Language: De Journal: Radiologie (Heidelb) Year: 2024 Document type: Article Country of publication: Germany