Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
BMC Bioinformatics ; 23(1): 558, 2022 Dec 23.
Artículo en Inglés | MEDLINE | ID: mdl-36564712

RESUMEN

BACKGROUND: In order to detect threats to public health and to be well-prepared for endemic and pandemic illness outbreaks, countries usually rely on event-based surveillance (EBS) and indicator-based surveillance systems. Event-based surveillance systems are key components of early warning systems and focus on fast capturing of data to detect threat signals through channels other than traditional surveillance. In this study, we develop Natural Language Processing tools that can be used within EBS systems. In particular, we focus on information extraction techniques that enable digital surveillance to monitor Internet data and social media. RESULTS: We created an annotated Spanish corpus from ProMED-mail health reports regarding disease outbreaks in Latin America. The corpus has been used to train algorithms for two information extraction tasks: named entity recognition and relation extraction. The algorithms, based on deep learning and rules, have been applied to recognize diseases, hosts, and geographical locations where a disease is occurring, among other entities and relations. In addition, an in-depth analysis of micro-average F1 metrics shows the suitability of our approaches for both tasks. CONCLUSIONS: The annotated corpus and algorithms presented could leverage the development of automated tools for extracting information from news and health reports written in Spanish. Moreover, this framework could be useful within EBS systems to support the early detection of Latin American disease outbreaks.


Asunto(s)
Brotes de Enfermedades , Salud Pública , Humanos , América Latina/epidemiología , Procesamiento de Lenguaje Natural , Minería de Datos/métodos
2.
J Digit Imaging ; 32(1): 19-29, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30097747

RESUMEN

We present a methodology for the automatic recognition of negated findings in radiological reports considering morphological, syntactic, and semantic information. In order to achieve this goal, a series of rules for processing lexical and syntactic information was elaborated. This required development of an electronic dictionary of medical terminology and informatics grammars. Pertinent information for the assembly of the specialized dictionary was extracted from the ontology SNOMED CT and a medical dictionary (RANM, 2012). Likewise, a general language dictionary was also included. Lexicon-Grammar (LG), proposed by Gross (1975; Cahiers de l'institut de linguistique de Louvain, 24. 23-41 1998), was used to set up the database, which allowed an exhaustive description of the argument structure of predicates projected by lexical units. Computational framework was carried out with NooJ, a free software developed by Silberztein (Silberztein and Noo 2018, 2016), which has various utilities for treating natural language, such as morphological and syntactic grammar, as well as dictionaries. This methodology was compared with a Spanish version of NegEx (Chapman et al. Journal of Biomedical Informatics, 34(5):301-310 2001; Stricker 2016). Results show that there are minimal differences in favor of the algorithm developed using NooJ, but the quality and specificity of the data improves if lexical-grammatical information is added.


Asunto(s)
Procesamiento Automatizado de Datos/métodos , Registros Electrónicos de Salud , Lingüística/métodos , Radiología/métodos , Terminología como Asunto , Humanos , Lenguaje
3.
Stud Health Technol Inform ; 216: 634-8, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26262128

RESUMEN

Automatic detection of relevant terms in medical reports is useful for educational purposes and for clinical research. Natural language processing (NLP) techniques can be applied in order to identify them. In this work we present an approach to classify radiology reports written in Spanish into two sets: the ones that indicate pathological findings and the ones that do not. In addition, the entities corresponding to pathological findings are identified in the reports. We use RadLex, a lexicon of English radiology terms, and NLP techniques to identify the occurrence of pathological findings. Reports are classified using a simple algorithm based on the presence of pathological findings, negation and hedge terms. The implemented algorithms were tested with a test set of 248 reports annotated by an expert, obtaining a best result of 0.72 F1 measure. The output of the classification task can be used to look for specific occurrences of pathological findings.


Asunto(s)
Minería de Datos/métodos , Procesamiento de Lenguaje Natural , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Sistemas de Información Radiológica/clasificación , Terminología como Asunto , Traducción , Algoritmos , Aprendizaje Automático , Semántica , España , Vocabulario Controlado
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...