Your browser doesn't support javascript.
Automated machine learning optimizes and accelerates predictive modeling from COVID-19 high throughput datasets.
Papoutsoglou, Georgios; Karaglani, Makrina; Lagani, Vincenzo; Thomson, Naomi; Røe, Oluf Dimitri; Tsamardinos, Ioannis; Chatzaki, Ekaterini.
  • Papoutsoglou G; JADBio, Gnosis Data Analysis PC, Science and Technology Park of Crete, N. Plastira 100, Vassilika Vouton, 70013, Heraklion, Crete, Greece.
  • Karaglani M; Computer Science Department, University of Crete, Voutes Campus, 70013, Heraklion, Crete, Greece.
  • Lagani V; JADBio, Gnosis Data Analysis PC, Science and Technology Park of Crete, N. Plastira 100, Vassilika Vouton, 70013, Heraklion, Crete, Greece.
  • Thomson N; Laboratory of Pharmacology, Medical School, Democritus University of Thrace, 68100, Alexandroupolis, Greece.
  • Røe OD; JADBio, Gnosis Data Analysis PC, Science and Technology Park of Crete, N. Plastira 100, Vassilika Vouton, 70013, Heraklion, Crete, Greece.
  • Tsamardinos I; Institute of Chemical Biology, Ilia State University, Kakutsa Cholokashvili Ave 3/5, 0162, Tbilisi, Georgia.
  • Chatzaki E; JADBio, Gnosis Data Analysis PC, Science and Technology Park of Crete, N. Plastira 100, Vassilika Vouton, 70013, Heraklion, Crete, Greece.
Sci Rep ; 11(1): 15107, 2021 07 23.
Article in English | MEDLINE | ID: covidwho-1322506
ABSTRACT
COVID-19 outbreak brings intense pressure on healthcare systems, with an urgent demand for effective diagnostic, prognostic and therapeutic procedures. Here, we employed Automated Machine Learning (AutoML) to analyze three publicly available high throughput COVID-19 datasets, including proteomic, metabolomic and transcriptomic measurements. Pathway analysis of the selected features was also performed. Analysis of a combined proteomic and metabolomic dataset led to 10 equivalent signatures of two features each, with AUC 0.840 (CI 0.723-0.941) in discriminating severe from non-severe COVID-19 patients. A transcriptomic dataset led to two equivalent signatures of eight features each, with AUC 0.914 (CI 0.865-0.955) in identifying COVID-19 patients from those with a different acute respiratory illness. Another transcriptomic dataset led to two equivalent signatures of nine features each, with AUC 0.967 (CI 0.899-0.996) in identifying COVID-19 patients from virus-free individuals. Signature predictive performance remained high upon validation. Multiple new features emerged and pathway analysis revealed biological relevance by implication in Viral mRNA Translation, Interferon gamma signaling and Innate Immune System pathways. In conclusion, AutoML analysis led to multiple biosignatures of high predictive performance, with reduced features and large choice of alternative predictors. These favorable characteristics are eminent for development of cost-effective assays to contribute to better disease management.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Machine Learning / SARS-CoV-2 / COVID-19 / Immunity, Innate Type of study: Diagnostic study / Prognostic study Limits: Humans Language: English Journal: Sci Rep Year: 2021 Document Type: Article Affiliation country: S41598-021-94501-0

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Machine Learning / SARS-CoV-2 / COVID-19 / Immunity, Innate Type of study: Diagnostic study / Prognostic study Limits: Humans Language: English Journal: Sci Rep Year: 2021 Document Type: Article Affiliation country: S41598-021-94501-0