[Automatic ICD-10 coding : Natural language processing for German MRI reports].

Mittermeier, Andreas; Aßenmacher, Matthias; Schachtner, Balthasar; Grosu, Sergio; Dakovic, Vladana; Kandratovich, Viktar; Sabel, Bastian; Ingrisch, Michael

[Automatic ICD-10 coding : Natural language processing for German MRI reports]. / Automatische ICD-10-Codierung : Natural Language Processing für deutsche MRT-Befunde.

Mittermeier, Andreas; Aßenmacher, Matthias; Schachtner, Balthasar; Grosu, Sergio; Dakovic, Vladana; Kandratovich, Viktar; Sabel, Bastian; Ingrisch, Michael.

Affiliation

Mittermeier A; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland. Andreas.Mittermeier@med.uni-muenchen.de.
Aßenmacher M; Munich Center for Machine Learning (MCML), München, Deutschland. Andreas.Mittermeier@med.uni-muenchen.de.
Schachtner B; Institut für Statistik, LMU München, München, Deutschland.
Grosu S; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
Dakovic V; Munich Center for Machine Learning (MCML), München, Deutschland.
Kandratovich V; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
Sabel B; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.
Ingrisch M; Klinik und Poliklinik für Radiologie, LMU Klinikum, LMU München, München, Deutschland.

Radiologie (Heidelb) ; 64(10): 793-800, 2024 Oct.

Article in De | MEDLINE | ID: mdl-39120724

ABSTRACT

ABSTRACT

BACKGROUND:

The medical coding of radiology reports is essential for a good quality of care and correct billing, but at the same time a complex and error-prone task.

OBJECTIVE:

To assess the performance of natural language processing (NLP) for ICD-10 coding of German radiology reports using fine tuning of suitable language models. MATERIAL AND

METHODS:

This retrospective study included all magnetic resonance imaging (MRI) radiology reports acquired at our institution between 2010 and 2020. The codes on discharge ICD-10 were matched to the corresponding reports to construct a dataset for multiclass classification. Fine tuning of GermanBERT and flanT5 was carried out on the total dataset (dstotal) containing 1035 different ICD-10 codes and 2 reduced subsets containing the 100 (ds100) and 50 (ds50) most frequent codes. The performance of the model was assessed using topk accuracy for kâ¯= 1, 3 and 5. In an ablation study both models were trained on the accompanying metadata and the radiology report alone.

RESULTS:

The total dataset consisted of 100,672 radiology reports, the reduced subsets ds100 of 68,103 and ds50 of 52,293 reports. The performance of the model increased when several of the best predictions of the model were taken into consideration, when the number of target classes was reduced and the metadata were combined with the report. The flanT5 outperformed GermanBERT across all datasets and metrics and was is suited as a medical coding assistant, achieving a top 3 accuracy of nearly 70% in the real-world dataset dstotal.

CONCLUSION:

Finely tuned language models can reliably predict ICD-10 codes of German magnetic resonance imaging (MRI) radiology reports across various settings. As a coding assistant flanT5 can guide medical coders to make informed decisions and potentially reduce the workload.

Subject(s)

Clinical Coding; International Classification of Diseases; Magnetic Resonance Imaging; Natural Language Processing; Germany; Magnetic Resonance Imaging/methods; Humans; Retrospective Studies; Clinical Coding/methods

Key words

Artificial intelligence; Language models; Medical coding; Natural language processing; Radiology reports

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Magnetic Resonance Imaging / International Classification of Diseases / Clinical Coding Limits: Humans Country/Region as subject: Europa Language: De Journal: Radiologie (Heidelb) Year: 2024 Document type: Article Country of publication: Germany

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google