Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
BMC Med Inform Decis Mak ; 20(Suppl 4): 283, 2020 12 14.
Article in English | MEDLINE | ID: mdl-33317518

ABSTRACT

BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example, precision medicine and drug repositioning. However, most of the existing studies focused on a single dataset. It is not easy to find the most current relationships among disorder-gene-drug relationships since the relationships are distributed in heterogeneous datasets. How to mine their semantic relationships from different biomedical datasets is an important issue. METHODS: First, a variety of biomedical datasets were converted into RDF triple data; then, multisource biomedical datasets were integrated into a storage system using a data integration algorithm. Second, nine query patterns among genes, disorders, and drugs from different biomedical datasets were designed. Third, the gene-disorder-drug semantic relationship mining algorithm is presented. This algorithm can query the relationships among various entities from different datasets. RESULTS AND CONCLUSIONS: We focused on mining the putative and the most current disorder-gene-drug relationships about Parkinson's disease (PD). The results demonstrate that our method has significant advantages in mining and integrating multisource heterogeneous biomedical datasets. Twenty-five new relationships among the genes, disorders, and drugs were mined from four different datasets. The query results showed that most of them came from different datasets. The precision of the method increased by 2.51% compared to that of the multisource linked open data fusion method presented in the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019). Moreover, the number of query results increased by 7.7%, and the number of correct queries increased by 9.5%.


Subject(s)
Pharmaceutical Preparations , Semantics , Algorithms , Data Mining , Humans , Research Design
2.
Healthcare (Basel) ; 8(3)2020 Aug 28.
Article in English | MEDLINE | ID: mdl-32872330

ABSTRACT

The widespread use of social media provides a large amount of data for public sentimentanalysis. Based on social media data, researchers can study public opinions on humanpapillomavirus (HPV) vaccines on social media using machine learning-based approaches that willhelp us understand the reasons behind the low vaccine coverage. However, social media data isusually unannotated, and data annotation is costly. The lack of an abundant annotated dataset limitsthe application of deep learning methods in effectively training models. To tackle this problem, wepropose three transfer learning approaches to analyze the public sentiment on HPV vaccines onTwitter. One was transferring static embeddings and embeddings from language models (ELMo)and then processing by bidirectional gated recurrent unit with attention (BiGRU-Att), called DWEBiGRU-Att. The others were fine-tuning pre-trained models with limited annotated data, called finetuninggenerative pre-training (GPT) and fine-tuning bidirectional encoder representations fromtransformers (BERT). The fine-tuned GPT model was built on the pre-trained generative pretraining(GPT) model. The fine-tuned BERT model was constructed with BERT model. Theexperimental results on the HPV dataset demonstrated the efficacy of the three methods in thesentiment analysis of the HPV vaccination task. The experimental results on the HPV datasetdemonstrated the efficacy of the methods in the sentiment analysis of the HPV vaccination task. Thefine-tuned BERT model outperforms all other methods. It can help to find strategies to improvevaccine uptake.

3.
Healthcare (Basel) ; 8(3)2020 Aug 24.
Article in English | MEDLINE | ID: mdl-32847006

ABSTRACT

To the on-site nursing staff or field management in prehospital emergency care, it seems baffling to conduct more targeted checklist tests for a specific disease. To address this problem, we proposed a decision support method for prehospital emergency care based on ranking the importance of physiological variables. We used multiple logistic regression models to explore the effects of various physiological variables on diseases based on the area under the curve (AUC) value. We implemented the method on the intensive care database (i.e., the Medical Information Mart for Intensive Care (MIMIC-III) database) and explored the importance of 17 physiological variables for 24 diseases, both chronic and acute. We included 33,798 adult patients, using the full physiological dataset as experiment data. We ranked the importance of the physiological variables related to the diseases according to the experiments' AUC value. We discussed which physiological variables should be considered more important in adult intensive care units (ICUs) for prehospital emergency care conditions. We also discussed the relationships among the diseases based on ranking the importance of physiological variables. We used large-scale ICU patient data to obtain a cohort of physiological variables related to specific diseases. Ranking a cohort of physiological variables is a cost-effective means of reducing morbidity and mortality under prehospital emergency care conditions.

4.
J Am Med Inform Assoc ; 27(7): 1046-1056, 2020 07 01.
Article in English | MEDLINE | ID: mdl-32626903

ABSTRACT

OBJECTIVE: The goal of this study is to develop a robust Time Event Ontology (TEO), which can formally represent and reason both structured and unstructured temporal information. MATERIALS AND METHODS: Using our previous Clinical Narrative Temporal Relation Ontology 1.0 and 2.0 as a starting point, we redesigned concept primitives (clinical events and temporal expressions) and enriched temporal relations. Specifically, 2 sets of temporal relations (Allen's interval algebra and a novel suite of basic time relations) were used to specify qualitative temporal order relations, and a Temporal Relation Statement was designed to formalize quantitative temporal relations. Moreover, a variety of data properties were defined to represent diversified temporal expressions in clinical narratives. RESULTS: TEO has a rich set of classes and properties (object, data, and annotation). When evaluated with real electronic health record data from the Mayo Clinic, it could faithfully represent more than 95% of the temporal expressions. Its reasoning ability was further demonstrated on a sample drug adverse event report annotated with respect to TEO. The results showed that our Java-based TEO reasoner could answer a set of frequently asked time-related queries, demonstrating that TEO has a strong capability of reasoning complex temporal relations. CONCLUSION: TEO can support flexible temporal relation representation and reasoning. Our next step will be to apply TEO to the natural language processing field to facilitate automated temporal information annotation, extraction, and timeline reasoning to better support time-based clinical decision-making.


Subject(s)
Biological Ontologies , Electronic Health Records , Time , Decision Support Systems, Clinical , Humans , Natural Language Processing , Semantic Web
5.
Health Informatics J ; 26(2): 726-737, 2020 06.
Article in English | MEDLINE | ID: mdl-30843449

ABSTRACT

The Research Domain Criteria, launched by the National Institute of Mental Health, is a new dimensional and interdisciplinary research framework for mental disorders. The Research Domain Criteria matrix is its core part. Since an ontology has the strengths of supporting semantic inferencing and automatic data processing, we would like to transform the Research Domain Criteria matrix into an ontological structure. In terms of data normalization, which is the essential part of an ontology representation, the Research Domain Criteria elements (mainly in the Units of Analysis) have some limitations. In this article, we propose a series of solutions to improve data normalization of the Research Domain Criteria elements in the Units of Analysis, including leveraging standard terminologies (i.e. the Unified Medical Language System Metathesaurus), context-combining queries, and domain expertise. The evaluation results show the positive (Yes) percentage is more than 80 percent, indicating our work is favorably received by the mental health professionals, and we have formed a good data foundation for the Research Domain Criteria ontological representation in the future work.


Subject(s)
Semantics , Unified Medical Language System , Databases, Factual/statistics & numerical data , Humans , Research
SELECTION OF CITATIONS
SEARCH DETAIL
...