Your browser doesn't support javascript.
Accurate virus identification with interpretable Raman signatures by machine learning.
Ye, Jiarong; Yeh, Yin-Ting; Xue, Yuan; Wang, Ziyang; Zhang, Na; Liu, He; Zhang, Kunyan; Ricker, RyeAnne; Yu, Zhuohang; Roder, Allison; Perea Lopez, Nestor; Organtini, Lindsey; Greene, Wallace; Hafenstein, Susan; Lu, Huaguang; Ghedin, Elodie; Terrones, Mauricio; Huang, Shengxi; Huang, Sharon Xiaolei.
  • Ye J; College of Information Sciences and Technology, The Pennsylvania State University, University Park, PA 16802.
  • Yeh YT; Department of Physics, The Pennsylvania State University, University Park, PA 16802.
  • Xue Y; Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218.
  • Wang Z; Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802.
  • Zhang N; Department of Physics, The Pennsylvania State University, University Park, PA 16802.
  • Liu H; Department of Physics, The Pennsylvania State University, University Park, PA 16802.
  • Zhang K; Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802.
  • Ricker R; Department of Biomedical Engineering, George Washington University, Washington, DC 20052.
  • Yu Z; Systems Genomics Section, Laboratory of Parasitic Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20894.
  • Roder A; Department of Physics, The Pennsylvania State University, University Park, PA 16802.
  • Perea Lopez N; Systems Genomics Section, Laboratory of Parasitic Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20894.
  • Organtini L; Department of Physics, The Pennsylvania State University, University Park, PA 16802.
  • Greene W; Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802.
  • Hafenstein S; Department of Pathology and Laboratory Medicine, Division of Clinical Pathology, The Pennsylvania State University College of Medicine, Hershey, PA 17033.
  • Lu H; Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802.
  • Ghedin E; Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802.
  • Terrones M; Systems Genomics Section, Laboratory of Parasitic Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20894.
  • Huang S; Department of Physics, The Pennsylvania State University, University Park, PA 16802.
  • Huang SX; Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802.
Proc Natl Acad Sci U S A ; 119(23): e2118836119, 2022 06 07.
Article in English | MEDLINE | ID: covidwho-1890407
ABSTRACT
Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device, coupled with label-free Raman spectroscopy, holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning (ML) approach applied to recognize the virus based on its Raman spectrum, which is used as a fingerprint. We present such an ML approach for analyzing Raman spectra of human and avian viruses. A convolutional neural network (CNN) classifier specifically designed for spectral data achieves very high accuracy for a variety of virus type or subtype identification tasks. In particular, it achieves 99% accuracy for classifying influenza virus type A versus type B, 96% accuracy for classifying four subtypes of influenza A, 95% accuracy for differentiating enveloped and nonenveloped viruses, and 99% accuracy for differentiating avian coronavirus (infectious bronchitis virus [IBV]) from other avian viruses. Furthermore, interpretation of neural net responses in the trained CNN model using a full-gradient algorithm highlights Raman spectral ranges that are most important to virus identification. By correlating ML-selected salient Raman ranges with the signature ranges of known biomolecules and chemical functional groups­for example, amide, amino acid, and carboxylic acid­we verify that our ML model effectively recognizes the Raman signatures of proteins, lipids, and other vital functional groups present in different viruses and uses a weighted combination of these signatures to identify viruses.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Viruses / Neural Networks, Computer / Machine Learning Type of study: Experimental Studies / Randomized controlled trials Language: English Journal: Proc Natl Acad Sci U S A Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Viruses / Neural Networks, Computer / Machine Learning Type of study: Experimental Studies / Randomized controlled trials Language: English Journal: Proc Natl Acad Sci U S A Year: 2022 Document Type: Article