Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 129
Filter
1.
J Biomed Inform ; 100: 103318, 2019 12.
Article in English | MEDLINE | ID: mdl-31655273

ABSTRACT

BACKGROUND: Manually curating standardized phenotypic concepts such as Human Phenotype Ontology (HPO) terms from narrative text in electronic health records (EHRs) is time consuming and error prone. Natural language processing (NLP) techniques can facilitate automated phenotype extraction and thus improve the efficiency of curating clinical phenotypes from clinical texts. While individual NLP systems can perform well for a single cohort, an ensemble-based method might shed light on increasing the portability of NLP pipelines across different cohorts. METHODS: We compared four NLP systems, MetaMapLite, MedLEE, ClinPhen and cTAKES, and four ensemble techniques, including intersection, union, majority-voting and machine learning, for extracting generic phenotypic concepts. We addressed two important research questions regarding automated phenotype recognition. First, we evaluated the performance of different approaches in identifying generic phenotypic concepts. Second, we compared the performance of different methods to identify patient-specific phenotypic concepts. To better quantify the effects caused by concept granularity differences on performance, we developed a novel evaluation metric that considered concept hierarchies and frequencies. Each of the approaches was evaluated on a gold standard set of clinical documents annotated by clinical experts. One dataset containing 1,609 concepts derived from 50 clinical notes from two different institutions was used in both evaluations, and an additional dataset of 608 concepts derived from 50 case report abstracts obtained from PubMed was used for evaluation of identifying generic phenotypic concepts only. RESULTS: For generic phenotypic concept recognition, the top three performers in the NYP/CUIMC dataset are union ensemble (F1, 0.634), training-based ensemble (F1, 0.632), and majority vote-based ensemble (F1, 0.622). In the Mayo dataset, the top three are majority vote-based ensemble (F1, 0.642), cTAKES (F1, 0.615), and MedLEE (F1, 0.559). In the PubMed dataset, the top three are majority vote-based ensemble (F1, 0.719), training-based (F1, 0.696) and MetaMapLite (F1, 0.694). For identifying patient specific phenotypes, the top three performers in the NYP/CUIMC dataset are majority vote-based ensemble (F1, 0.610), MedLEE (F1, 0.609), and training-based ensemble (F1, 0.585). In the Mayo dataset, the top three are majority vote-based ensemble (F1, 0.604), cTAKES (F1, 0.531) and MedLEE (F1, 0.527). CONCLUSIONS: Our study demonstrates that ensembles of natural language processing can improve both generic phenotypic concept recognition and patient specific phenotypic concept identification over individual systems. Among the individual NLP systems, each individual system performed best when they were applied in the dataset that they were primary designed for. However, combining multiple NLP systems to create an ensemble can generally improve the performance. Specifically, the ensemble can increase the results reproducibility across different cohorts and tasks, and thus provide a more portable phenotyping solution compared to individual NLP systems.


Subject(s)
Natural Language Processing , Phenotype , Datasets as Topic , Electronic Health Records , Humans , Reproducibility of Results
2.
Stud Health Technol Inform ; 264: 1393-1397, 2019 Aug 21.
Article in English | MEDLINE | ID: mdl-31438155

ABSTRACT

This study used Amazon Mechanical Turk to crowdsource public opinions about sharing medical records for clinical research. The 1,508 valid respondents comprised 58.7% males, 54% without college degrees, 41.5% students or unemployed, and 84.3% under 40 years old. More than 74% were somewhat willing to share de-identified records. Education level, employment status, and gender were identified as significant predictors of willingness to share one's own or one's family's medical records (partially identifiable, completely identifiable, or de-identified). Thematic analysis applied to respondent comments uncovered barriers to sharing, including the inability to track uses and users of their information, potential harm (such as identity theft or healthcare denial), lack of trust, and worries about information misuse. Our study suggests that implementing reliable medical record de-identification and emphasizing trust development are essential to addressing such concerns. Amazon Mechanical Turk proved cost-effective for collecting public opinions with short surveys.


Subject(s)
Crowdsourcing , Public Opinion , Adult , Female , Humans , Male , Medical Records , Surveys and Questionnaires , Trust
3.
BMC Med Inform Decis Mak ; 19(Suppl 3): 70, 2019 04 04.
Article in English | MEDLINE | ID: mdl-30943963

ABSTRACT

BACKGROUND: A shareable repository of clinical notes is critical for advancing natural language processing (NLP) research, and therefore a goal of many NLP researchers is to create a shareable repository of clinical notes, that has breadth (from multiple institutions) as well as depth (as much individual data as possible). METHODS: We aimed to assess the degree to which individuals would be willing to contribute their health data to such a repository. A compact e-survey probed willingness to share demographic and clinical data categories. Participants were faculty, staff, and students in two geographically diverse major medical centers (Utah and New York). Such a sample could be expected to respond like a typical potential participant from the general public who is given complete and fully informed consent about the pros and cons of participating in a research study. RESULTS: Two thousand one hundred forty respondents completed the surveys. 56% of respondents were "somewhat/definitely willing" to share clinical data with identifiers, while 89% of respondents were "somewhat (17%)/definitely willing (72%)" to share without identifiers. Results were consistent across gender, age, and education, but there were some differences by geographical region. Individuals were most reluctant (50-74%) sharing mental health, substance abuse, and domestic violence data. CONCLUSIONS: We conclude that a substantial fraction of potential patient participants, once educated about risks and benefits, would be willing to donate de-identified clinical data to a shared research repository. A slight majority even would be willing to share absent de-identification, suggesting that perceptions about data misuse are not a major concern. Such a repository of clinical notes should be invaluable for clinical NLP research and advancement.


Subject(s)
Academic Medical Centers , Biomedical Research , Health Personnel , Information Dissemination , Natural Language Processing , Adolescent , Adult , Confidentiality , Female , Humans , Informed Consent , Male , Middle Aged , New York , Patient Participation , Surveys and Questionnaires , Young Adult
4.
Am J Hum Genet ; 103(1): 58-73, 2018 07 05.
Article in English | MEDLINE | ID: mdl-29961570

ABSTRACT

Integration of detailed phenotype information with genetic data is well established to facilitate accurate diagnosis of hereditary disorders. As a rich source of phenotype information, electronic health records (EHRs) promise to empower diagnostic variant interpretation. However, how to accurately and efficiently extract phenotypes from heterogeneous EHR narratives remains a challenge. Here, we present EHR-Phenolyzer, a high-throughput EHR framework for extracting and analyzing phenotypes. EHR-Phenolyzer extracts and normalizes Human Phenotype Ontology (HPO) concepts from EHR narratives and then prioritizes genes with causal variants on the basis of the HPO-coded phenotype manifestations. We assessed EHR-Phenolyzer on 28 pediatric individuals with confirmed diagnoses of monogenic diseases and found that the genes with causal variants were ranked among the top 100 genes selected by EHR-Phenolyzer for 16/28 individuals (p < 2.2 × 10-16), supporting the value of phenotype-driven gene prioritization in diagnostic sequence interpretation. To assess the generalizability, we replicated this finding on an independent EHR dataset of ten individuals with a positive diagnosis from a different institution. We then assessed the broader utility by examining two additional EHR datasets, including 31 individuals who were suspected of having a Mendelian disease and underwent different types of genetic testing and 20 individuals with positive diagnoses of specific Mendelian etiologies of chronic kidney disease from exome sequencing. Finally, through several retrospective case studies, we demonstrated how combined analyses of genotype data and deep phenotype data from EHRs can expedite genetic diagnoses. In summary, EHR-Phenolyzer leverages EHR narratives to automate phenotype-driven analysis of clinical exomes or genomes, facilitating the broader implementation of genomic medicine.


Subject(s)
Exome/genetics , Adolescent , Child , Child, Preschool , Electronic Health Records , Female , Genetic Testing/methods , Genomics/methods , Genotype , Humans , Infant , Infant, Newborn , Male , Phenotype , Renal Insufficiency, Chronic/genetics , Retrospective Studies
5.
AMIA Jt Summits Transl Sci Proc ; 2017: 142-151, 2018.
Article in English | MEDLINE | ID: mdl-29888060

ABSTRACT

Medication regimen may be optimized based on individual drug efficacy identified by pharmacogenomic testing. However, majority of current pharmacogenomic decision support tools provide assessment only of single drug-gene interactions without taking into account complex drug-drug and drug-drug-gene interactions which are prevalent in people with polypharmacy and can result in adverse drug events or insufficient drug efficacy. The main objective of this project was to develop comprehensive pharmacogenomic decision support for medication risk assessment in people with polypharmacy that simultaneously accounts for multiple drug and gene effects. To achieve this goal, the project addressed two aims: (1) development of comprehensive knowledge repository of actionable pharmacogenes; (2) introduction of scoring approaches reflecting potential adverse effect risk levels of complex medication regimens accounting for pharmacogenomic polymorphisms and multiple drug metabolizing pathways. After pharmacogenomic knowledge repository was introduced, a scoring algorithm has been built and pilot-tested using a limited data set. The resulting total risk score for frequently hospitalized older adults with polypharmacy (72.04±17.84) was statistically significantly different (p<0.05) from the total risk score for older adults with polypharmacy with low hospitalization rate (8.98±2.37). An initial prototype assessment demonstrated feasibility of our approach and identified steps for improving risk scoring algorithms.

6.
Brief Bioinform ; 19(5): 863-877, 2018 09 28.
Article in English | MEDLINE | ID: mdl-28334070

ABSTRACT

Drug-drug interactions (DDIs) constitute an important concern in drug development and postmarketing pharmacovigilance. They are considered the cause of many adverse drug effects exposing patients to higher risks and increasing public health system costs. Methods to follow-up and discover possible DDIs causing harm to the population are a primary aim of drug safety researchers. Here, we review different methodologies and recent advances using data mining to detect DDIs with impact on patients. We focus on data mining of different pharmacovigilance sources, such as the US Food and Drug Administration Adverse Event Reporting System and electronic health records from medical institutions, as well as on the diverse data mining studies that use narrative text available in the scientific biomedical literature and social media. We pay attention to the strengths but also further explain challenges related to these methods. Data mining has important applications in the analysis of DDIs showing the impact of the interactions as a cause of adverse effects, extracting interactions to create knowledge data sets and gold standards and in the discovery of novel and dangerous DDIs.


Subject(s)
Data Mining/methods , Drug Interactions , Computational Biology/methods , Drug-Related Side Effects and Adverse Reactions , Electronic Health Records/statistics & numerical data , Humans , Pharmacovigilance , Publications/statistics & numerical data , Social Media/statistics & numerical data , United States , United States Food and Drug Administration
7.
AMIA Annu Symp Proc ; 2018: 827-836, 2018.
Article in English | MEDLINE | ID: mdl-30815125

ABSTRACT

This study investigated the automated detection of antiretroviral toxicities in structured electronic health records data. The evaluation compared responses generated by 5 clinical pharmacists and 1 prototype knowledge-based application for 15 randomly selected test cases. The main outcomes were inter-subject dissimilarity of responses quantified by the Jaccard distance, and the mean proportion of correct responses by each subject. The statistical differences in inter-subject Jaccard distances suggested that the prototype was inferior to clinical pharmacists in the detection of possible antiretroviral toxicity associations from structured data. The reason for dissimilarities was attributable to inadequate domain coverage by the prototype. The differences in the mean proportion of correct responses between the clinical pharmacists and the prototype were statistically indistinguishable. Overall, this study suggests that knowledge-based applications have the potential to support automated detection of antiretroviral toxicities from structured patient records. Furthermore, the study demonstrates a systematic approach for validating such applications quantitatively.


Subject(s)
Anti-Retroviral Agents/adverse effects , Drug Monitoring/methods , Electronic Health Records , Knowledge Bases , Point-of-Care Testing , HIV Infections/drug therapy , Humans , Pharmacists , Point-of-Care Systems , Practice Guidelines as Topic
8.
BMC Med Inform Decis Mak ; 17(1): 175, 2017 Dec 19.
Article in English | MEDLINE | ID: mdl-29258594

ABSTRACT

BACKGROUND: It is beneficial for health care institutions to monitor physician prescribing patterns to ensure that high-quality and cost-effective care is being provided to patients. However, detecting treatment patterns within an institution is challenging, given that medications and conditions are often not explicitly linked in the health record. Here we demonstrate the use of statistical methods together with data from the electronic health care record (EHR) to analyze prescribing patterns at an institution. METHODS: As a demonstration of our method, which is based on regression, we collect EHR data from outpatient notes and use a case/control study design to determine the medications that are associated with hypertension. We also use regression to determine which conditions are associated with a preferential use of one or more classes of hypertension agents. Finally, we compare our method to methods based on tabulation. RESULTS: Our results show that regression methods provide more reasonable and useful results than tabulation, and successfully distinguish between medications that treat hypertension and medications that do not. These methods also provide insight into in which circumstances certain drugs are preferred over others. CONCLUSIONS: Our method can be used by health care institutions to monitor physician prescribing patterns and ensure the appropriateness of treatment.


Subject(s)
Drug Prescriptions/standards , Electronic Health Records , Practice Patterns, Physicians' , Quality of Health Care , Case-Control Studies , Humans , Practice Patterns, Physicians'/standards , Quality of Health Care/standards , Regression Analysis
9.
J Biomed Inform ; 76: 41-49, 2017 Dec.
Article in English | MEDLINE | ID: mdl-29081385

ABSTRACT

OBJECTIVE: Improving mechanisms to detect adverse drug reactions (ADRs) is key to strengthening post-marketing drug safety surveillance. Signal detection is presently unimodal, relying on a single information source. Multimodal signal detection is based on jointly analyzing multiple information sources. Building on, and expanding the work done in prior studies, the aim of the article is to further research on multimodal signal detection, explore its potential benefits, and propose methods for its construction and evaluation. MATERIAL AND METHODS: Four data sources are investigated; FDA's adverse event reporting system, insurance claims, the MEDLINE citation database, and the logs of major Web search engines. Published methods are used to generate and combine signals from each data source. Two distinct reference benchmarks corresponding to well-established and recently labeled ADRs respectively are used to evaluate the performance of multimodal signal detection in terms of area under the ROC curve (AUC) and lead-time-to-detection, with the latter relative to labeling revision dates. RESULTS: Limited to our reference benchmarks, multimodal signal detection provides AUC improvements ranging from 0.04 to 0.09 based on a widely used evaluation benchmark, and a comparative added lead-time of 7-22 months relative to labeling revision dates from a time-indexed benchmark. CONCLUSIONS: The results support the notion that utilizing and jointly analyzing multiple data sources may lead to improved signal detection. Given certain data and benchmark limitations, the early stage of development, and the complexity of ADRs, it is currently not possible to make definitive statements about the ultimate utility of the concept. Continued development of multimodal signal detection requires a deeper understanding the data sources used, additional benchmarks, and further research on methods to generate and synthesize signals.


Subject(s)
Adverse Drug Reaction Reporting Systems , Databases, Factual , Humans , United States , United States Food and Drug Administration
10.
AMIA Annu Symp Proc ; 2017: 535-544, 2017.
Article in English | MEDLINE | ID: mdl-29854118

ABSTRACT

Pharmacogenetics-related publications, which are increasing rapidly, provide important new pharmacogenetics knowledge. Automated approaches to extract information of new alleles and to identify their impact on metabolic phenotypes from publications are urgently needed to facilitate personalized medicine and improve clinical outcomes. Cytochrome polymorphisms, responsible for a wide variation of drug pharmacodynamics, individual efficacy and adverse effects, have significant potential for optimizing drug therapy. A few studies have addressed specialized efforts to automatically extract cytochrome polymorphisms and their characterizations regarding metabolic phenotypes from the literature. In this paper, we present a novel rule-based text-mining system to extract metabolic phenotypes of polymorphisms from PubMed abstracts with a focus on cytochrome P450. This system is promising as it achieved a precision of 85.71% in a preliminary proof-of-concept evaluation and is expected to automatically provide up-to-date metabolic information for cytochrome polymorphisms, which is critical to advance personalized medicine and improve clinical care.


Subject(s)
Cytochrome P-450 Enzyme System/genetics , Data Mining/methods , Phenotype , Polymorphism, Genetic , PubMed , Abstracting and Indexing , Genotype , Genotyping Techniques , Humans , Pharmacogenetics , Precision Medicine
11.
Pharmgenomics Pers Med ; 9: 107-116, 2016.
Article in English | MEDLINE | ID: mdl-27789970

ABSTRACT

Pharmacogenetic testing identifies genetic biomarkers that are predictive of individual sensitivity to particular drugs. A significant proportion of medications that are widely prescribed for older adults are metabolized by enzymes that are encoded by highly polymorphic genes. Pharmacogenetic testing is increasingly used to optimize the medication regimen; however, its potential in older adults with polypharmacy has not been systematically explored. Following the initial case-series study, this study hypothesized that frequently hospitalized older adults with polypharmacy have higher frequency of pharmacogenetic polymorphism as compared to older adults with polypharmacy who are rarely admitted to a hospital. To test this hypothesis, a nested case-control study was conducted with pharmacogenetic polymorphism as an exposure and hospitalization rate as an outcome. In this study, frequently hospitalized older adults (≥65 years of age) with polypharmacy were matched with rarely hospitalized older adults with poly-pharmacy by age, gender, race, ethnicity, and chronic disease score. Average age and number of prescription drugs did not differ in cases and controls (77.2±5.0 and 78.3±5.1 years, 14.3±5.3 and 14.0±2.9 medications, respectively). No statistically significant difference in sociodemographic, clinical, and behavioral characteristics that are known to affect hospitalization risk was found between the cases and controls. Major pharmacogenetic polymorphism defined as presence of at least one allelic combination resulting in poor or rapid metabolizer status was identified in all the cases. No major pharmacogenetic polymorphisms were detected in controls. Based on the exact McNemar's test, the difference in major pharmacogenetic polymorphism frequency between cases and controls was statistically significant (p<0.05). In 50% of cases, more than one major pharmacogenetic polymorphism was found. The frequency of CYP2C19 rapid metabolizer, CYP3A4/5 poor metabolizer, VKORC1 low sensitivity, and CYP2D6 rapid metabolizer status in cases was 67%, 33%, 33%, and 17%, respectively, which significantly exceeded respective prevalence in general population. The mean number of major gene-drug interactions found in cases was 2.8±2.2, whereas no major drug-gene interactions were identified in controls. The difference in the number of major drug-gene interactions between cases and controls was statistically significant (p<0.05). The pilot data supported the hypothesis that pharmacogenetic polymorphism may represent an independent risk factor for frequent hospitalizations in older adults with polypharmacy. Due to small sample size, the results of this proof-of-concept study cannot be conclusive. Further work on the utility of pharmacogenetic testing for optimization of medication regimens in this vulnerable group of older adults is warranted.

12.
PLoS One ; 11(10): e0164304, 2016.
Article in English | MEDLINE | ID: mdl-27716785

ABSTRACT

Recent research has suggested that the case-control study design, unlike the self-controlled study design, performs poorly in controlling confounding in the detection of adverse drug reactions (ADRs) from administrative claims and electronic health record (EHR) data, resulting in biased estimates of the causal effects of drugs on health outcomes of interest (HOI) and inaccurate confidence intervals. Here we show that using rich data on comorbidities and automatic variable selection strategies for selecting confounders can better control confounding within a case-control study design and provide a more solid basis for inference regarding the causal effects of drugs on HOIs. Four HOIs are examined: acute kidney injury, acute liver injury, acute myocardial infarction and gastrointestinal ulcer hospitalization. For each of these HOIs we use a previously published reference set of positive and negative control drugs to evaluate the performance of our methods. Our methods have AUCs that are often substantially higher than the AUCs of a baseline method that only uses demographic characteristics for confounding control. Our methods also give confidence intervals for causal effect parameters that cover the expected no effect value substantially more often than this baseline method. The case-control study design, unlike the self-controlled study design, can be used in the fairly typical setting of EHR databases without longitudinal information on patients. With our variable selection method, these databases can be more effectively used for the detection of ADRs.


Subject(s)
Drug-Related Side Effects and Adverse Reactions/diagnosis , Risk Assessment/methods , Acute Kidney Injury/drug therapy , Adult , Adverse Drug Reaction Reporting Systems , Aged , Area Under Curve , Case-Control Studies , Comorbidity , Databases, Factual , Electronic Health Records , Female , Humans , Male , Middle Aged , Myocardial Infarction/drug therapy , Research Design
13.
Article in English | MEDLINE | ID: mdl-27570654

ABSTRACT

Academic literature provides rich and up-to-date information concerning adverse drug reactions (ADR), but it is time consuming and labor intensive for physicians to obtain information of ADRs from academic literature because they would have to generate queries, review retrieved articles and summarize the results. In this study, a method is developed to automatically detect and summarize ADRs from journal articles, rank them and present them to physicians in a user-friendly interface. The method studied ADRs for 6 drugs and returned on average 4.8 ADRs that were correct. The results demonstrated this method was feasible and effective. This method can be applied in clinical practice for assisting physicians to efficiently obtain information about ADRs associated with specific drugs. Automated summarization of ADR information from recent publications may facilitate translation of academic research into actionable information at point of care.

14.
Article in English | MEDLINE | ID: mdl-27570662

ABSTRACT

An automated, user-friendly and accurate system for retrieving herb-drug interaction (HDIs) related articles in MEDLINE can increase the safety of patients, as well as improve the physicians' article retrieving ability regarding speed and experience. Previous studies show that MeSH based queries associated with negative effects of drugs can be customized, resulting in good performance in retrieving relevant information, but no study has focused on the area of herb-drug interactions (HDI). This paper adapted the characteristics of HDI related papers and created a multilayer HDI article searching system. It achieved a sensitivity of 92% at a precision of 93% in a preliminary evaluation. Instead of requiring physicians to conduct PubMed searches directly, this system applies a more user-friendly approach by employing a customized system that enhances PubMed queries, shielding users from having to write queries, dealing with PubMed, or reading many irrelevant articles. The system provides automated processes and outputs target articles based on the input.

15.
Pharmgenomics Pers Med ; 9: 31-45, 2016.
Article in English | MEDLINE | ID: mdl-27143951

ABSTRACT

Pharmacogenomic (PGx) testing has been increasingly used to optimize drug regimens; however, its potential in older adults with polypharmacy has not been systematically studied. In this hypothesis-generating study, we employed a case series design to explore potential utility of PGx testing in older adults with polypharmacy and to highlight barriers in implementing this methodology in routine clinical practice. Three patients with concurrent chronic heart and lung disease aged 74, 78, and 83 years and whose medication regimen comprised 26, 17, and 18 drugs, correspondingly, served as cases for this study. PGx testing identified major genetic polymorphisms in the first two cases. The first case was identified as "CYP3A4/CYP3A5 poor metabolizer", which affected metabolism of eleven prescribed drugs. The second case had "CYP2D6 rapid metabolizer" status affecting three prescribed medications, two of which were key drugs for managing this patient's chronic conditions. Both these patients also had VKORC1 allele *A, resulting in higher sensitivity to warfarin. All cases demonstrated a significant number of potential drug-drug interactions. Both patients with significant drug-gene interactions had a history of frequent hospitalizations (six and 23, respectively), whereas the person without impaired cytochrome P450 enzyme activity had only two acute episodes in the last 5 years, although he was older and had multiple comorbidities. Since all patients received guideline-concordant therapy from the same providers and were adherent to their drug regimen, we hypothesized that genetic polymorphism may represent an additional risk factor for higher hospitalization rates in older adults with polypharmacy. However, evidence to support or reject this hypothesis is yet to be established. Studies evaluating clinical impact of PGx testing in older adults with polypharmacy are warranted. For practical implementation of pharmacogenomics in routine clinical care, besides providing convincing evidence of its clinical effectiveness, multiple barriers must be addressed. Introduction of intelligent clinical decision support in electronic medical record systems is required to address complexities of simultaneous drug-gene and drug-drug interactions in older adults with polypharmacy. Physician training, clear clinical pathways, evidence-based guidelines, and patient education materials are necessary for unlocking full potential of pharmacogenomics into routine clinical care of older adults.

16.
J Am Med Inform Assoc ; 22(6): 1261-70, 2015 Nov.
Article in English | MEDLINE | ID: mdl-26335981

ABSTRACT

OBJECTIVE: Medication-indication information is a key part of the information needed for providing decision support for and promoting appropriate use of medications. However, this information is not readily available to end users, and a lot of the resources only contain this information in unstructured form (free text). A number of public knowledge bases (KBs) containing structured medication-indication information have been developed over the years, but a direct comparison of these resources has not yet been conducted. MATERIAL AND METHODS: We conducted a systematic review of the literature to identify all medication-indication KBs and critically appraised these resources in terms of their scope as well as their support for complex indication information. RESULTS: We identified 7 KBs containing medication-indication data. They notably differed from each other in terms of their scope, coverage for on- or off-label indications, source of information, and choice of terminologies for representing the knowledge. The majority of KBs had issues with granularity of the indications as well as with representing duration of therapy, primary choice of treatment, and comedications or comorbidities. DISCUSSION AND CONCLUSION: This is the first study directly comparing public KBs of medication indications. We identified several gaps in the existing resources, which can motivate future research.


Subject(s)
Drug Therapy, Computer-Assisted , Knowledge Bases , Humans , Off-Label Use , Systematized Nomenclature of Medicine
17.
Article in English | MEDLINE | ID: mdl-26306227

ABSTRACT

Timely dissemination of up-to-date information concerning adverse drug reactions (ADRs) at the point of care can significantly improve medication safety and prevent ADRs. Automated methods for finding relevant articles in MEDLINE which discuss ADRs for specific medications can facilitate decision making at the point of care. Previous work has focused on other types of clinical queries and on retrieval for specific ADRs or drug-ADR pairs, but little work has been published on finding ADR articles for a specific medication. We have developed a method to generate a PubMED query based on MESH, supplementary concepts, and textual terms for a particular medication. Evaluation was performed on a limited sample, resulting in a sensitivity of 90% and precision of 93%. Results demonstrated that this method is highly effective. Future work will integrate this method within an interface aimed at facilitating access to ADR information for specified drugs at the point of care.

18.
Drug Saf ; 38(10): 895-908, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26153397

ABSTRACT

INTRODUCTION: Observational healthcare data contain information useful for hastening detection of adverse drug reactions (ADRs) that may be missed by using data in spontaneous reporting systems (SRSs) alone. There are only several papers describing methods that integrate evidence from healthcare databases and SRSs. We propose a methodology that combines ADR signals from these two sources. OBJECTIVES: The aim of this study was to investigate whether the proposed method would result in more accurate ADR detection than methods using SRSs or healthcare data alone. RESEARCH DESIGN: We applied the method to four clinically serious ADRs, and evaluated it using three experiments that involve combining an SRS with a single facility small-scale electronic health record (EHR), a larger scale network-based EHR, and a much larger scale healthcare claims database. The evaluation used a reference standard comprising 165 positive and 234 negative drug-ADR pairs. MEASURES: Area under the receiver operator characteristics curve (AUC) was computed to measure performance. RESULTS: There was no improvement in the AUC when the SRS and small-scale HER were combined. The AUC of the combined SRS and large-scale EHR was 0.82 whereas it was 0.76 for each of the individual systems. Similarly, the AUC of the combined SRS and claims system was 0.82 whereas it was 0.76 and 0.78, respectively, for the individual systems. CONCLUSIONS: The proposed method resulted in a significant improvement in the accuracy of ADR detection when the resources used for combining had sufficient amounts of data, demonstrating that the method could integrate evidence from multiple sources and serve as a tool in actual pharmacovigilance practice.


Subject(s)
Delivery of Health Care/methods , Drug-Related Side Effects and Adverse Reactions/epidemiology , Adverse Drug Reaction Reporting Systems , Databases, Factual , Electronic Health Records , Female , Health Personnel , Humans , Male , Pharmacovigilance
19.
J Am Med Inform Assoc ; 22(1): 179-91, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25053577

ABSTRACT

OBJECTIVES: Drug repurposing, which finds new indications for existing drugs, has received great attention recently. The goal of our work is to assess the feasibility of using electronic health records (EHRs) and automated informatics methods to efficiently validate a recent drug repurposing association of metformin with reduced cancer mortality. METHODS: By linking two large EHRs from Vanderbilt University Medical Center and Mayo Clinic to their tumor registries, we constructed a cohort including 32,415 adults with a cancer diagnosis at Vanderbilt and 79,258 cancer patients at Mayo from 1995 to 2010. Using automated informatics methods, we further identified type 2 diabetes patients within the cancer cohort and determined their drug exposure information, as well as other covariates such as smoking status. We then estimated HRs for all-cause mortality and their associated 95% CIs using stratified Cox proportional hazard models. HRs were estimated according to metformin exposure, adjusted for age at diagnosis, sex, race, body mass index, tobacco use, insulin use, cancer type, and non-cancer Charlson comorbidity index. RESULTS: Among all Vanderbilt cancer patients, metformin was associated with a 22% decrease in overall mortality compared to other oral hypoglycemic medications (HR 0.78; 95% CI 0.69 to 0.88) and with a 39% decrease compared to type 2 diabetes patients on insulin only (HR 0.61; 95% CI 0.50 to 0.73). Diabetic patients on metformin also had a 23% improved survival compared with non-diabetic patients (HR 0.77; 95% CI 0.71 to 0.85). These associations were replicated using the Mayo Clinic EHR data. Many site-specific cancers including breast, colorectal, lung, and prostate demonstrated reduced mortality with metformin use in at least one EHR. CONCLUSIONS: EHR data suggested that the use of metformin was associated with decreased mortality after a cancer diagnosis compared with diabetic and non-diabetic cancer patients not on metformin, indicating its potential as a chemotherapeutic regimen. This study serves as a model for robust and inexpensive validation studies for drug repurposing signals using EHR data.


Subject(s)
Drug Repositioning , Electronic Health Records , Hypoglycemic Agents/therapeutic use , Information Storage and Retrieval/methods , Metformin/therapeutic use , Neoplasms/mortality , Administration, Oral , Adult , Diabetes Mellitus, Type 2/complications , Diabetes Mellitus, Type 2/drug therapy , Diabetes Mellitus, Type 2/mortality , Humans , Natural Language Processing , Neoplasms/complications , Neoplasms/prevention & control , Registries , Survival Analysis
20.
Curr Drug Metab ; 15(5): 490-501, 2014.
Article in English | MEDLINE | ID: mdl-25431152

ABSTRACT

Co-administration of drugs is a primary cause of Adverse Drug Reactions (ADRs) and a drain on the health care industry costing billions of dollars and reducing quality of life. Drug-Drug Interactions (DDIs) account for as much as 30% of all ADRs. Unfortunately, DDIs are not systematically explored pre-clinically and are difficult to detect in post-marketing drug surveillance. For this reason, the detection and prediction of DDIs is an important problem in both drug development and pharmacovigilance. The comparison of the 3D drug structures provides a powerful tool for DDI prediction. In this article, we present the first large scale model for predicting DDIs using the drug's 3D molecular structure. In addition to identifying putative drug interactions we can also isolate the pharmacological or clinical effect associated with the predicted interactions. The model has good performance in two different hold-out validations and in external test sets. We found that the top scored drug pairs were significantly enriched for known clinically relevant interactions and that 3D structure data is providing significantly independent information from other approaches, including 2D structure (p=0.003). We demonstrated the usefulness of the proposed methodology to systematically identify pharmacokinetic and pharmacodynamic interactions, provided an exploratory tool that can be used for patient safety and pre-clinical toxicity screening, and reviewed the state of the art methods used to detect DDIs.


Subject(s)
Drug Interactions , Models, Molecular , Molecular Conformation , Databases, Pharmaceutical , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...