Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
1.
Appl Clin Inform ; 9(1): 62-71, 2018 01.
Article in English | MEDLINE | ID: mdl-29365341

ABSTRACT

BACKGROUND: Clinical decision support systems (CDSS) for cervical cancer prevention are generally limited to identifying patients who are overdue for their next routine/next screening, and they do not provide recommendations for follow-up of abnormal results. We previously developed a CDSS to automatically provide follow-up recommendations based on the American Society of Colposcopy and Cervical Pathology (ASCCP) guidelines for women with both previously normal and abnormal test results leveraging information available in the electronic medical record (EMR). OBJECTIVE: Enhance the CDSS by improving its accuracy and incorporating changes to reflect the latest revision of the guidelines. METHODS: After making enhancements to the CDSS, we evaluated the performance of the clinical recommendations on 393 patients selected through stratified sampling from a set of 3,704 patients in a nonclinical setting. We performed chart review of individual patient's record to evaluate the performance of the system. An expert clinician assisted by a resident manually reviewed the recommendation made by the system and verified whether the recommendations were as per the ASCCP guidelines. RESULTS: The recommendation accuracy of the enhanced CDSS improved to 93%, which is a substantial improvement over the 84% reported previously. A detailed analysis of errors is presented in this article. We fixed the errors identified in this evaluation that were amenable to correction to further improve the accuracy of the system. The source code of the updated CDSS is available at https://github.com/ohnlp/MayoNlpPapCdss. CONCLUSION: We made substantial enhancements to our earlier prototype CDSS with the updated ASCCP guidelines and performed a thorough evaluation in a nonclinical setting to improve the accuracy of the CDSS. The CDSS will be further refined as it is utilized in the practice.


Subject(s)
Decision Support Systems, Clinical , Early Detection of Cancer , Uterine Cervical Neoplasms/diagnosis , Female , Humans , Population Surveillance , Time Factors
2.
Database (Oxford) ; 2017(1)2017 01 01.
Article in English | MEDLINE | ID: mdl-28365720

ABSTRACT

Extracting meaningful relationships with semantic significance from biomedical literature is often a challenging task. BioCreative V track4 challenge for the first time has organized a comprehensive shared task to test the robustness of the text-mining algorithms in extracting semantically meaningful assertions from the evidence statement in biomedical text. In this work, we tested the ability of a rule-based semantic parser to extract Biological Expression Language (BEL) statements from evidence sentences culled out of biomedical literature as part of BioCreative V Track4 challenge. The system achieved an overall best F-measure of 21.29% in extracting the complete BEL statement. For relation extraction, the system achieved an F-measure of 65.13% on test data set. Our system achieved the best performance in five of the six criteria that was adopted for evaluation by the task organizers. Lack of ability to derive semantic inferences, limitation in the rule sets to map the textual extractions to BEL function were some of the reasons for low performance in extracting the complete BEL statement. Post shared task we also evaluated the impact of differential NER components on the ability to extract BEL statements on the test data sets besides making a single change in the rule sets that translate relation extractions into a BEL statement. There is a marked improvement by over 20% in the overall performance of the BELMiner's capability to extract BEL statement on the test set. The system is available as a REST-API at http://54.146.11.205:8484/BELXtractor/finder/. Database URL: http://54.146.11.205:8484/BELXtractor/finder/.


Subject(s)
Data Mining/methods , Natural Language Processing , Software , Semantics
3.
Pac Symp Biocomput ; : 352-63, 2014.
Article in English | MEDLINE | ID: mdl-24297561

ABSTRACT

The creation of biological pathway knowledge bases is largely driven by manual effort to curate based on evidences from the scientific literature. It is highly challenging for the curators to keep up with the literature. Text mining applications have been developed in the last decade to assist human curators to speed up the curation pace where majority of them aim to identify the most relevant papers for curation with little attempt to directly extract the pathway information from text. In this paper, we describe a rule-based literature mining system to extract pathway information from text. We evaluated the system using curated pharmacokinetic (PK) and pharmacodynamic (PD) pathways in PharmGKB. The system achieved an F-measure of 63.11% and 34.99% for entity extraction and event extraction respectively against all PubMed abstracts cited in PharmGKB. It may be possible to improve the system performance by incorporating using statistical machine learning approaches. This study also helped us gain insights into the barriers towards automated event extraction from text for pathway curation.


Subject(s)
Data Mining/statistics & numerical data , Databases, Pharmaceutical/statistics & numerical data , Knowledge Bases , Artificial Intelligence , Computational Biology , Electrophysiological Phenomena , Humans , Metabolic Networks and Pathways , Natural Language Processing , Pharmacokinetics , Pharmacological Phenomena , Semantics
4.
AMIA Jt Summits Transl Sci Proc ; 2013: 149-53, 2013.
Article in English | MEDLINE | ID: mdl-24303255

ABSTRACT

Information extraction (IE), a natural language processing (NLP) task that automatically extracts structured or semi-structured information from free text, has become popular in the clinical domain for supporting automated systems at point-of-care and enabling secondary use of electronic health records (EHRs) for clinical and translational research. However, a high performance IE system can be very challenging to construct due to the complexity and dynamic nature of human language. In this paper, we report an IE framework for cohort identification using EHRs that is a knowledge-driven framework developed under the Unstructured Information Management Architecture (UIMA). A system to extract specific information can be developed by subject matter experts through expert knowledge engineering of the externalized knowledge resources used in the framework.

5.
Ann Allergy Asthma Immunol ; 111(5): 364-9, 2013 Nov.
Article in English | MEDLINE | ID: mdl-24125142

ABSTRACT

BACKGROUND: A significant proportion of children with asthma have delayed diagnosis of asthma by health care providers. Manual chart review according to established criteria is more accurate than directly using diagnosis codes, which tend to under-identify asthmatics, but chart reviews are more costly and less timely. OBJECTIVE: To evaluate the accuracy of a computational approach to asthma ascertainment, characterizing its utility and feasibility toward large-scale deployment in electronic medical records. METHODS: A natural language processing (NLP) system was developed for extracting predetermined criteria for asthma from unstructured text in electronic medical records and then inferring asthma status based on these criteria. Using manual chart reviews as a gold standard, asthma status (yes vs no) and identification date (first date of a "yes" asthma status) were determined by the NLP system. RESULTS: Patients were a group of children (n = 112, 84% Caucasian, 49% girls) younger than 4 years (mean 2.0 years, standard deviation 1.03 years) who participated in previous studies. The NLP approach to asthma ascertainment showed sensitivity, specificity, positive predictive value, negative predictive value, and median delay in diagnosis of 84.6%, 96.5%, 88.0%, 95.4%, and 0 months, respectively; this compared favorably with diagnosis codes, at 30.8%, 93.2%, 57.1%, 82.2%, and 2.3 months, respectively. CONCLUSION: Automated asthma ascertainment from electronic medical records using NLP is feasible and more accurate than traditional approaches such as diagnosis codes. Considering the difficulty of labor-intensive manual record review, NLP approaches for asthma ascertainment should be considered for improving clinical care and research, especially in large-scale efforts.


Subject(s)
Asthma/diagnosis , Electronic Data Processing , Medical Records Systems, Computerized , Natural Language Processing , Child, Preschool , Cohort Studies , Female , Humans , Male
6.
Bioinformatics ; 22(13): 1668-9, 2006 Jul 01.
Article in English | MEDLINE | ID: mdl-16644790

ABSTRACT

A web-based version of the RLIMS-P literature mining system was developed for online mining of protein phosphorylation information from MEDLINE abstracts. The online tool presents extracted phosphorylation objects (phosphorylated proteins, phosphorylation sites and protein kinases) in summary tables and full reports with evidence-tagged abstracts. The tool further allows mapping of phosphorylated proteins to protein entries in the UniProt Knowledgebase based on PubMed ID and/or protein name. The literature mining, coupled with database association, allows retrieval of rich biological information for the phosphorylated proteins and facilitates database annotation of phosphorylation features.


Subject(s)
Computational Biology/methods , Proteins/chemistry , Animals , Databases, Bibliographic , Databases, Factual , Databases, Protein , Humans , Information Storage and Retrieval , Phosphorylation , PubMed , Software , User-Computer Interface
7.
Bioinformatics ; 21 Suppl 1: i319-27, 2005 Jun.
Article in English | MEDLINE | ID: mdl-15961474

ABSTRACT

MOTIVATION: Phosphorylation is an important biochemical reaction that plays a critical role in signal transduction pathways and cell-cycle processes. A text mining system to extract the phosphorylation relation from the literature is reported. The focus of this paper is on the new methods developed and implemented to connect and merge pieces of information about phosphorylation mentioned in different sentences in the text. The effectiveness and accuracy of the system as a whole as well as that of the methods for extraction beyond a clause/sentence is evaluated using an independently annotated dataset, the Phospho.ELM database. The new methods developed to merge pieces of information from different sentences are shown to be effective in significantly raising the recall without much difference in precision.


Subject(s)
Computational Biology/methods , Proteomics/methods , Artificial Intelligence , Database Management Systems , Databases, Protein , Humans , MEDLINE , Natural Language Processing , Pattern Recognition, Automated , Phosphorylation , Signal Transduction , Software
8.
Bioinformatics ; 21(11): 2759-65, 2005 Jun 01.
Article in English | MEDLINE | ID: mdl-15814565

ABSTRACT

MOTIVATION: A large volume of experimental data on protein phosphorylation is buried in the fast-growing PubMed literature. While of great value, such information is limited in databases owing to the laborious process of literature-based curation. Computational literature mining holds promise to facilitate database curation. RESULTS: A rule-based system, RLIMS-P (Rule-based LIterature Mining System for Protein Phosphorylation), was used to extract protein phosphorylation information from MEDLINE abstracts. An annotation-tagged literature corpus developed at PIR was used to evaluate the system for finding phosphorylation papers and extracting phosphorylation objects (kinases, substrates and sites) from abstracts. RLIMS-P achieved a precision and recall of 91.4 and 96.4% for paper retrieval, and of 97.9 and 88.0% for extraction of substrates and sites. Coupling the high recall for paper retrieval and high precision for information extraction, RLIMS-P facilitates literature mining and database annotation of protein phosphorylation.


Subject(s)
Algorithms , Artificial Intelligence , Information Storage and Retrieval/methods , MEDLINE , Natural Language Processing , Phosphorylation , Proteins/classification , Abstracting and Indexing/methods , Database Management Systems , Periodicals as Topic , Semantics , Vocabulary, Controlled
9.
Pac Symp Biocomput ; : 427-38, 2003.
Article in English | MEDLINE | ID: mdl-12603047

ABSTRACT

In this paper we describe a new named entity extraction system. Our system is based on a manually developed set of rules that rely heavily upon some crucial lexical information, linguistic constraints of English, and contextual information. This system achieves state of art results in the protein name detection task, which is what many of the current name extraction systems do. We discuss the need for detection of chemical names and show that we not only obtain a high degree of success in recognizing chemicals but that this task can help improve the precision of protein name detection as well. We use context and surrounding words for categorization of named entities and find the results obtained are encouraging.


Subject(s)
Computational Biology , Terminology as Topic , Abstracting and Indexing , Algorithms , Chemical Phenomena , Chemistry , Information Storage and Retrieval , MEDLINE , Proteins
SELECTION OF CITATIONS
SEARCH DETAIL
...