Your browser doesn't support javascript.
Deep learning models for COVID-19 chest x-ray classification: Preventing shortcut learning using feature disentanglement.
Trivedi, Anusua; Robinson, Caleb; Blazes, Marian; Ortiz, Anthony; Desbiens, Jocelyn; Gupta, Sunil; Dodhia, Rahul; Bhatraju, Pavan K; Liles, W Conrad; Kalpathy-Cramer, Jayashree; Lee, Aaron Y; Lavista Ferres, Juan M.
  • Trivedi A; AI for Good Research Lab, Microsoft, Redmond, WA, United States of America.
  • Robinson C; Flipkart US R&D, Seattle, WA, United States of America.
  • Blazes M; AI for Good Research Lab, Microsoft, Redmond, WA, United States of America.
  • Ortiz A; Department of Ophthalmology, University of Washington, Seattle, WA, United States of America.
  • Desbiens J; AI for Good Research Lab, Microsoft, Redmond, WA, United States of America.
  • Gupta S; Intelligent Retinal Imaging Systems, Pensacola, FL, United States of America.
  • Dodhia R; Intelligent Retinal Imaging Systems, Pensacola, FL, United States of America.
  • Bhatraju PK; AI for Good Research Lab, Microsoft, Redmond, WA, United States of America.
  • Liles WC; Department of Medicine and Sepsis Center of Research Excellence, University of Washington (SCORE-UW), Seattle, WA, United States of America.
  • Kalpathy-Cramer J; Division of Pulmonary, Critical Care and Sleep Medicine, University of Washington, Seattle, WA, United States of America.
  • Lee AY; Department of Medicine and Sepsis Center of Research Excellence, University of Washington (SCORE-UW), Seattle, WA, United States of America.
  • Lavista Ferres JM; Neuroscience, Massachusetts General Hospital, Boston, MA, United States of America.
PLoS One ; 17(10): e0274098, 2022.
Article in English | MEDLINE | ID: covidwho-2054336
ABSTRACT
In response to the COVID-19 global pandemic, recent research has proposed creating deep learning based models that use chest radiographs (CXRs) in a variety of clinical tasks to help manage the crisis. However, the size of existing datasets of CXRs from COVID-19+ patients are relatively small, and researchers often pool CXR data from multiple sources, for example, using different x-ray machines in various patient populations under different clinical scenarios. Deep learning models trained on such datasets have been shown to overfit to erroneous features instead of learning pulmonary characteristics in a phenomenon known as shortcut learning. We propose adding feature disentanglement to the training process. This technique forces the models to identify pulmonary features from the images and penalizes them for learning features that can discriminate between the original datasets that the images come from. We find that models trained in this way indeed have better generalization performance on unseen data; in the best case we found that it improved AUC by 0.13 on held out data. We further find that this outperforms masking out non-lung parts of the CXRs and performing histogram equalization, both of which are recently proposed methods for removing biases in CXR datasets.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Deep Learning / COVID-19 Type of study: Prognostic study Limits: Humans Language: English Journal: PLoS One Journal subject: Science / Medicine Year: 2022 Document Type: Article Affiliation country: Journal.pone.0274098

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Deep Learning / COVID-19 Type of study: Prognostic study Limits: Humans Language: English Journal: PLoS One Journal subject: Science / Medicine Year: 2022 Document Type: Article Affiliation country: Journal.pone.0274098