Your browser doesn't support javascript.
loading
Obtención automática de palabras clave en textos clínicos: una aplicación de procesamiento del lenguaje natural a datos masivos de sospecha diagnóstica en Chile / Automatic keyword retrieval from clinical texts: an application of natural language processing to massive data of Chilean suspected diagnosis
Villena, Fabián; Dunstan, Jocelyn.
  • Villena, Fabián; Universidad de Chile. Facultad de Medicina. Centro de Informática Médica y Telemedicina. Santiago. CL
  • Dunstan, Jocelyn; Universidad de Chile. Facultad de Medicina. Centro de Informática Médica y Telemedicina. Santiago. CL
Rev. méd. Chile ; 147(10): 1229-1238, oct. 2019. tab, graf
Article in Spanish | LILACS | ID: biblio-1058589
ABSTRACT

Background:

Free-text imposes a challenge in health data analysis since the lack of structure makes the extraction and integration of information difficult, particularly in the case of massive data. An appropriate machine-interpretation of electronic health records in Chile can unleash knowledge contained in large volumes of clinical texts, expanding clinical management and national research capabilities.

Aim:

To illustrate the use of a weighted frequency algorithm to find keywords. This finding was carried out in the diagnostic suspicion field of the Chilean specialty consultation waiting list, for diseases not covered by the Chilean Explicit Health Guarantees plan. Material and

Methods:

The waiting lists for a first specialty consultation for the period 2008-2018 were obtained from 17 out of 29 Chilean health services, and total of 2,592,925 diagnostic suspicions were identified. A natural language processing technique called Term Frequency-Inverse Document Frequency was used for the retrieval of diagnostic suspicion keywords.

Results:

For each specialty, four key words with the highest weighted frequency were determined. Word clouds showing words weighted by their importance were created to obtain a visual representation. These are available at cimt.uchile.cl/lechile/.

Conclusions:

The algorithm allowed to summarize unstructured clinical free-text data, improving its usefulness and accessibility.
Subject(s)


Full text: Available Index: LILACS (Americas) Main subject: Natural Language Processing / Electronic Data Processing / Medical Records / Information Storage and Retrieval / Diagnostic Techniques and Procedures / Data Mining Type of study: Diagnostic study / Prognostic study Limits: Humans Country/Region as subject: South America / Chile Language: Spanish Journal: Rev. méd. Chile Journal subject: Medicine Year: 2019 Type: Article Affiliation country: Chile Institution/Affiliation country: Universidad de Chile/CL

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Index: LILACS (Americas) Main subject: Natural Language Processing / Electronic Data Processing / Medical Records / Information Storage and Retrieval / Diagnostic Techniques and Procedures / Data Mining Type of study: Diagnostic study / Prognostic study Limits: Humans Country/Region as subject: South America / Chile Language: Spanish Journal: Rev. méd. Chile Journal subject: Medicine Year: 2019 Type: Article Affiliation country: Chile Institution/Affiliation country: Universidad de Chile/CL