Search | VHL Regional Portal

1.

Simulation of undiagnosed patients with novel genetic conditions.

Alsentzer, Emily; Finlayson, Samuel G; Li, Michelle M; Kobren, Shilpa N; Kohane, Isaac S.

Nat Commun ; 14(1): 6403, 2023 10 12.

Article in English | MEDLINE | ID: mdl-37828001

ABSTRACT

Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300-400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process.

Subject(s)

Patients , Rare Diseases , Humans , Computer Simulation , Phenotype , Rare Diseases/diagnosis , Rare Diseases/genetics

2.

Machine Learning and Statistics in Clinical Research Articles-Moving Past the False Dichotomy.

Finlayson, Samuel G; Beam, Andrew L; van Smeden, Maarten.

JAMA Pediatr ; 177(5): 448-450, 2023 05 01.

Article in English | MEDLINE | ID: mdl-36939696

ABSTRACT

This Viewpoint describes the false dichotomy between statistics and machine learning and suggests considerations in building and evaluating clinical prediction models.

Subject(s)

Machine Learning , Statistics as Topic , Humans

3.

The Diagnostic and Triage Accuracy of the GPT-3 Artificial Intelligence Model.

Levine, David M; Tuwani, Rudraksh; Kompa, Benjamin; Varma, Amita; Finlayson, Samuel G; Mehrotra, Ateev; Beam, Andrew.

medRxiv ; 2023 Feb 01.

Article in English | MEDLINE | ID: mdl-36778449

ABSTRACT

Importance: Artificial intelligence (AI) applications in health care have been effective in many areas of medicine, but they are often trained for a single task using labeled data, making deployment and generalizability challenging. Whether a general-purpose AI language model can perform diagnosis and triage is unknown. Objective: Compare the general-purpose Generative Pre-trained Transformer 3 (GPT-3) AI model's diagnostic and triage performance to attending physicians and lay adults who use the Internet. Design: We compared the accuracy of GPT-3's diagnostic and triage ability for 48 validated case vignettes of both common (e.g., viral illness) and severe (e.g., heart attack) conditions to lay people and practicing physicians. Finally, we examined how well calibrated GPT-3's confidence was for diagnosis and triage. Setting and Participants: The GPT-3 model, a nationally representative sample of lay people, and practicing physicians. Exposure: Validated case vignettes (<60 words; <6th grade reading level). Main Outcomes and Measures: Correct diagnosis, correct triage. Results: Among all cases, GPT-3 replied with the correct diagnosis in its top 3 for 88% (95% CI, 75% to 94%) of cases, compared to 54% (95% CI, 53% to 55%) for lay individuals (p<0.001) and 96% (95% CI, 94% to 97%) for physicians (p=0.0354). GPT-3 triaged (71% correct; 95% CI, 57% to 82%) similarly to lay individuals (74%; 95% CI, 73% to 75%; p=0.73); both were significantly worse than physicians (91%; 95% CI, 89% to 93%; p<0.001). As measured by the Brier score, GPT-3 confidence in its top prediction was reasonably well-calibrated for diagnosis (Brier score = 0.18) and triage (Brier score = 0.22). Conclusions and Relevance: A general-purpose AI language model without any content-specific training could perform diagnosis at levels close to, but below physicians and better than lay individuals. The model was performed less well on triage, where its performance was closer to that of lay individuals.

4.

Precision Medicine: Using Artificial Intelligence to Improve Diagnostics and Healthcare.

Daneshjou, Roxana; Brenner, Steven E; Chen, Jonathan H; Crawford, Dana C; Finlayson, Samuel G; Kidzinski, Lukasz; Bulyk, Martha L.

Pac Symp Biocomput ; 27: 223-230, 2022.

Article in English | MEDLINE | ID: mdl-34890151

ABSTRACT

The continued generation of large amounts of data within healthcare-from imaging to electronic medical health records to genomics and multi-omics -necessitates tools and methods to parse and interpret these data to improve healthcare outcomes. Artificial intelligence, and in particular deep learning, has enabled researchers to gain new insights from large scale and multimodal data. At the 2022 Pacific Symposium on Biocomputing (PSB) session entitled "Precision Medicine: Using Artificial Intelligence to Improve Diagnostics and Healthcare", we showcase the latest research, influenced and inspired by the idea of using technology to build a more fair, tailored, and cost-effective healthcare system after the COVID-19 pandemic.

Subject(s)

Artificial Intelligence , COVID-19 , Computational Biology , Delivery of Health Care , Humans , Pandemics , Precision Medicine , SARS-CoV-2

5.

The Clinician and Dataset Shift in Artificial Intelligence.

Finlayson, Samuel G; Subbaswamy, Adarsh; Singh, Karandeep; Bowers, John; Kupke, Annabel; Zittrain, Jonathan; Kohane, Isaac S; Saria, Suchi.

N Engl J Med ; 385(3): 283-286, 2021 07 15.

Article in English | MEDLINE | ID: mdl-34260843

Subject(s)

Artificial Intelligence , Datasets as Topic , Humans , Information Technology , Machine Learning , Medical Informatics

6.

Cross-modal representation alignment of molecular structure and perturbation-induced transcriptional profiles.

Finlayson, Samuel G; McDermott, Matthew B A; Pickering, Alex V; Lipnick, Scott L; Kohane, Isaac S.

Pac Symp Biocomput ; 26: 273-284, 2021.

Article in English | MEDLINE | ID: mdl-33691024

ABSTRACT

Modeling the relationship between chemical structure and molecular activity is a key goal in drug development. Many benchmark tasks have been proposed for molecular property prediction, but these tasks are generally aimed at specific, isolated biomedical properties. In this work, we propose a new cross-modal small molecule retrieval task, designed to force a model to learn to associate the structure of a small molecule with the transcriptional change it induces. We develop this task formally as multi-view alignment problem, and present a coordinated deep learning approach that jointly optimizes representations of both chemical structure and perturbational gene expression profiles. We benchmark our results against oracle models and principled baselines, and find that cell line variability markedly influences performance in this domain. Our work establishes the feasibility of this new task, elucidates the limitations of current data and systems, and may serve to catalyze future research in small molecule representation learning.

Subject(s)

Benchmarking , Computational Biology , Molecular Structure

7.

Examining the Use of Real-World Evidence in the Regulatory Process.

Beaulieu-Jones, Brett K; Finlayson, Samuel G; Yuan, William; Altman, Russ B; Kohane, Isaac S; Prasad, Vinay; Yu, Kun-Hsing.

Clin Pharmacol Ther ; 107(4): 843-852, 2020 04.

Article in English | MEDLINE | ID: mdl-31562770

ABSTRACT

The 21st Century Cures Act passed by the United States Congress mandates the US Food and Drug Administration to develop guidance to evaluate the use of real-world evidence (RWE) to support the regulatory process. RWE has generated important medical discoveries, especially in areas where traditional clinical trials would be unethical or infeasible. However, RWE suffers from several issues that hinder its ability to provide proof of treatment efficacy at a level comparable to randomized controlled trials. In this review article, we summarized the advantages and limitations of RWE, identified the key opportunities for RWE, and pointed the way forward to maximize the potential of RWE for regulatory purposes.

Subject(s)

Clinical Trials as Topic/legislation & jurisprudence , Evidence-Based Medicine/legislation & jurisprudence , United States Food and Drug Administration/legislation & jurisprudence , Clinical Trials as Topic/methods , Clinical Trials as Topic/statistics & numerical data , Decision Making , Evidence-Based Medicine/methods , Evidence-Based Medicine/statistics & numerical data , Humans , United States

8.

Trends and Focus of Machine Learning Applications for Health Research.

Beaulieu-Jones, Brett; Finlayson, Samuel G; Chivers, Corey; Chen, Irene; McDermott, Matthew; Kandola, Jaz; Dalca, Adrian V; Beam, Andrew; Fiterau, Madalina; Naumann, Tristan.

JAMA Netw Open ; 2(10): e1914051, 2019 10 02.

Article in English | MEDLINE | ID: mdl-31651969

ABSTRACT

Importance: The use of machine learning applications related to health is rapidly increasing and may have the potential to profoundly affect the field of health care. Objective: To analyze submissions to a popular machine learning for health venue to assess the current state of research, including areas of methodologic and clinical focus, limitations, and underexplored areas. Design, Setting, and Participants: In this data-driven qualitative analysis, 166 accepted manuscript submissions to the Third Annual Machine Learning for Health workshop at the 32nd Conference on Neural Information Processing Systems on December 8, 2018, were analyzed to understand research focus, progress, and trends. Experts reviewed each submission against a rubric to identify key data points, statistical modeling and analysis of submitting authors was performed, and research topics were quantitatively modeled. Finally, an iterative discussion of topics common in submissions and invited speakers at the workshop was held to identify key trends. Main Outcomes and Measures: Frequency and statistical measures of methods, topics, goals, and author attributes were derived from an expert review of submissions guided by a rubric. Results: Of the 166 accepted submissions, 58 (34.9%) had clinician involvement and 83 submissions (50.0%) that focused on clinical practice included clinical collaborators. A total of 97 data sets (58.4%) used in submissions were publicly available or required a standard registration process. Clinical practice was the most common application area (70 manuscripts [42.2%]), with brain and mental health (25 [15.1%]), oncology (21 [12.7%]), and cardiovascular (19 [11.4%]) being the most common specialties. Conclusions and Relevance: Trends in machine learning for health research indicate the importance of well-annotated, easily accessed data and the benefit from greater clinician involvement in the development of translational applications.

Subject(s)

Biomedical Research , Delivery of Health Care , Health Services Research , Machine Learning/trends , Congresses as Topic , Humans

9.

Systemic nature of spinal muscular atrophy revealed by studying insurance claims.

Lipnick, Scott L; Agniel, Denis M; Aggarwal, Rahul; Makhortova, Nina R; Finlayson, Samuel G; Brocato, Alexandra; Palmer, Nathan; Darras, Basil T; Kohane, Isaac; Rubin, Lee L.

PLoS One ; 14(3): e0213680, 2019.

Article in English | MEDLINE | ID: mdl-30870495

ABSTRACT

OBJECTIVE: We investigated the presence of non-neuromuscular phenotypes in patients affected by Spinal Muscular Atrophy (SMA), a disorder caused by a mutation in the Survival of Motor Neuron (SMN) gene, and whether these phenotypes may be clinically detectable prior to clinical signs of neuromuscular degeneration and therefore independent of muscle weakness. METHODS: We utilized a de-identified database of insurance claims to explore the health of 1,038 SMA patients compared to controls. Two analyses were performed: (1) claims from the entire insurance coverage window; and (2) for SMA patients, claims prior to diagnosis of any neuromuscular disease or evidence of major neuromuscular degeneration to increase the chance that phenotypes could be attributed directly to reduced SMN levels. Logistic regression was used to determine whether phenotypes were diagnosed at significantly different rates between SMA patients and controls and to obtain covariate-adjusted odds ratios. RESULTS: Results from the entire coverage window revealed a broad spectrum of phenotypes that are differentially diagnosed in SMA subjects compared to controls. Moreover, data from SMA patients prior to their first clinical signs of neuromuscular degeneration revealed numerous non-neuromuscular phenotypes including defects within the cardiovascular, gastrointestinal, metabolic, reproductive, and skeletal systems. Furthermore, our data provide evidence of a potential ordering of disease progression beginning with these non-neuromuscular phenotypes. CONCLUSIONS: Our data point to a direct relationship between early, detectable non-neuromuscular symptoms and SMN deficiency. Our findings are particularly important for evaluating the efficacy of SMN-increasing therapies for SMA, comparing the effectiveness of local versus systemically delivered therapeutics, and determining the optimal therapeutic treatment window prior to irreversible neuromuscular damage.

Subject(s)

Databases, Factual , Insurance, Health/statistics & numerical data , Muscular Atrophy, Spinal/diagnosis , Muscular Atrophy, Spinal/epidemiology , Neuromuscular Diseases/diagnosis , Adolescent , Adult , Age Factors , Aged , Child , Child, Preschool , Disease Progression , Female , Humans , Infant , Infant, Newborn , Male , Middle Aged , Muscular Atrophy, Spinal/physiopathology , Mutation , Neuromuscular Diseases/epidemiology , Neuromuscular Diseases/physiopathology , Odds Ratio , Phenotype , Regression Analysis , Survival of Motor Neuron 1 Protein/genetics , Time Factors , Young Adult

10.

Adversarial attacks on medical machine learning.

Finlayson, Samuel G; Bowers, John D; Ito, Joichi; Zittrain, Jonathan L; Beam, Andrew L; Kohane, Isaac S.

Science ; 363(6433): 1287-1289, 2019 03 22.

Article in English | MEDLINE | ID: mdl-30898923

Subject(s)

Fraud , Insurance Claim Review , Machine Learning , Humans

11.

Automatic Categorization and Scoring of Solid, Part-Solid and Non-Solid Pulmonary Nodules in CT Images with Convolutional Neural Network.

Tu, Xiaoguang; Xie, Mei; Gao, Jingjing; Ma, Zheng; Chen, Daiqiang; Wang, Qingfeng; Finlayson, Samuel G; Ou, Yangming; Cheng, Jie-Zhi.

Sci Rep ; 7(1): 8533, 2017 09 01.

Article in English | MEDLINE | ID: mdl-28864824

ABSTRACT

We present a computer-aided diagnosis system (CADx) for the automatic categorization of solid, part-solid and non-solid nodules in pulmonary computerized tomography images using a Convolutional Neural Network (CNN). Provided with only a two-dimensional region of interest (ROI) surrounding each nodule, our CNN automatically reasons from image context to discover informative computational features. As a result, no image segmentation processing is needed for further analysis of nodule attenuation, allowing our system to avoid potential errors caused by inaccurate image processing. We implemented two computerized texture analysis schemes, classification and regression, to automatically categorize solid, part-solid and non-solid nodules in CT scans, with hierarchical features in each case learned directly by the CNN model. To show the effectiveness of our CNN-based CADx, an established method based on histogram analysis (HIST) was implemented for comparison. The experimental results show significant performance improvement by the CNN model over HIST in both classification and regression tasks, yielding nodule classification and rating performance concordant with those of practicing radiologists. Adoption of CNN-based CADx systems may reduce the inter-observer variation among screening radiologists and provide a quantitative reference for further nodule analysis.

Subject(s)

Diagnosis, Computer-Assisted/methods , Lung/diagnostic imaging , Neural Networks, Computer , Solitary Pulmonary Nodule/diagnostic imaging , Tomography, X-Ray Computed/methods , Databases, Factual , Diagnosis, Differential , Humans , Lung/pathology , Lung Neoplasms/diagnostic imaging

12.

Toward rapid learning in cancer treatment selection: An analytical engine for practice-based clinical data.

Finlayson, Samuel G; Levy, Mia; Reddy, Sunil; Rubin, Daniel L.

J Biomed Inform ; 60: 104-13, 2016 Apr.

Article in English | MEDLINE | ID: mdl-26836975

ABSTRACT

OBJECTIVE: Wide-scale adoption of electronic medical records (EMRs) has created an unprecedented opportunity for the implementation of Rapid Learning Systems (RLSs) that leverage primary clinical data for real-time decision support. In cancer, where large variations among patient features leave gaps in traditional forms of medical evidence, the potential impact of a RLS is particularly promising. We developed the Melanoma Rapid Learning Utility (MRLU), a component of the RLS, providing an analytical engine and user interface that enables physicians to gain clinical insights by rapidly identifying and analyzing cohorts of patients similar to their own. MATERIALS AND METHODS: A new approach for clinical decision support in Melanoma was developed and implemented, in which patient-centered cohorts are generated from practice-based evidence and used to power on-the-fly stratified survival analyses. A database to underlie the system was generated from clinical, pharmaceutical, and molecular data from 237 patients with metastatic melanoma from two academic medical centers. The system was assessed in two ways: (1) ability to rediscover known knowledge and (2) potential clinical utility and usability through a user study of 13 practicing oncologists. RESULTS: The MRLU enables physician-driven cohort selection and stratified survival analysis. The system successfully identified several known clinical trends in melanoma, including frequency of BRAF mutations, survival rate of patients with BRAF mutant tumors in response to BRAF inhibitor therapy, and sex-based trends in prevalence and survival. Surveyed physician users expressed great interest in using such on-the-fly evidence systems in practice (mean response from relevant survey questions 4.54/5.0), and generally found the MRLU in particular to be both useful (mean score 4.2/5.0) and useable (4.42/5.0). DISCUSSION: The MRLU is an RLS analytical engine and user interface for Melanoma treatment planning that presents design principles useful in building RLSs. Further research is necessary to evaluate when and how to best use this functionality within the EMR clinical workflow for guiding clinical decision making. CONCLUSION: The MRLU is an important component in building a RLS for data driven precision medicine in Melanoma treatment that could be generalized to other clinical disorders.

Subject(s)

Decision Support Systems, Clinical , Electronic Health Records , Machine Learning , Melanoma/therapy , Software , Humans , Patient Selection , Proto-Oncogene Proteins B-raf/antagonists & inhibitors , Proto-Oncogene Proteins B-raf/genetics , User-Computer Interface

13.

Detecting unplanned care from clinician notes in electronic health records.

Tamang, Suzanne; Patel, Manali I; Blayney, Douglas W; Kuznetsov, Julie; Finlayson, Samuel G; Vetteth, Yohan; Shah, Nigam.

J Oncol Pract ; 11(3): e313-9, 2015 May.

Article in English | MEDLINE | ID: mdl-25980019

ABSTRACT

PURPOSE: Reduction in unplanned episodes of care, such as emergency department visits and unplanned hospitalizations, are important quality outcome measures. However, many events are only documented in free-text clinician notes and are labor intensive to detect by manual medical record review. METHODS: We studied 308,096 free-text machine-readable documents linked to individual entries in our electronic health records, representing care for patients with breast, GI, or thoracic cancer, whose treatment was initiated at one academic medical center, Stanford Health Care (SHC). Using a clinical text-mining tool, we detected unplanned episodes documented in clinician notes (for non-SHC visits) or in coded encounter data for SHC-delivered care and the most frequent symptoms documented in emergency department (ED) notes. RESULTS: Combined reporting increased the identification of patients with one or more unplanned care visits by 32% (15% using coded data; 20% using all the data) among patients with 3 months of follow-up and by 21% (23% using coded data; 28% using all the data) among those with 1 year of follow-up. Based on the textual analysis of SHC ED notes, pain (75%), followed by nausea (54%), vomiting (47%), infection (36%), fever (28%), and anemia (27%), were the most frequent symptoms mentioned. Pain, nausea, and vomiting co-occur in 35% of all ED encounter notes. CONCLUSION: The text-mining methods we describe can be applied to automatically review free-text clinician notes to detect unplanned episodes of care mentioned in these notes. These methods have broad application for quality improvement efforts in which events of interest occur outside of a network that allows for patient data sharing.

Subject(s)

Delivery of Health Care , Electronic Health Records , Medical Oncology , Neoplasms/therapy , Patient Care Planning , Academic Medical Centers , California , Data Mining , Emergency Service, Hospital , Hospitalization , Humans , Neoplasms/complications , Neoplasms/diagnosis , Risk Assessment , Risk Factors , Time Factors

14.

Building the graph of medicine from millions of clinical narratives.

Finlayson, Samuel G; LePendu, Paea; Shah, Nigam H.

Sci Data ; 1: 140032, 2014.

Article in English | MEDLINE | ID: mdl-25977789

ABSTRACT

Electronic health records (EHR) represent a rich and relatively untapped resource for characterizing the true nature of clinical practice and for quantifying the degree of inter-relatedness of medical entities such as drugs, diseases, procedures and devices. We provide a unique set of co-occurrence matrices, quantifying the pairwise mentions of 3 million terms mapped onto 1 million clinical concepts, calculated from the raw text of 20 million clinical notes spanning 19 years of data. Co-frequencies were computed by means of a parallelized annotation, hashing, and counting pipeline that was applied over clinical notes from Stanford Hospitals and Clinics. The co-occurrence matrix quantifies the relatedness among medical concepts which can serve as the basis for many statistical tests, and can be used to directly compute Bayesian conditional probabilities, association rules, as well as a range of test statistics such as relative risks and odds ratios. This dataset can be leveraged to quantitatively assess comorbidity, drug-drug, and drug-disease patterns for a range of clinical, epidemiological, and financial applications.

Subject(s)

Electronic Health Records , Medicine , Comorbidity , Drug Interactions , Drug Therapy , Humans , Medicine/trends , Risk

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL