Your browser doesn't support javascript.
Integrated COVID-19 Predictor: Differential expression analysis to reveal potential biomarkers and prediction of coronavirus using RNA-Seq profile data.
Iqbal, Naiyar; Kumar, Pradeep.
  • Iqbal N; Department of Computer Science and Information Technology, Maulana Azad National Urdu University, Hyderabad, Telangana, India. Electronic address: naiyariqbal.rs@manuu.edu.in.
  • Kumar P; Department of Computer Science and Information Technology, Maulana Azad National Urdu University, Hyderabad, Telangana, India. Electronic address: drpkumar1402@gmail.com.
Comput Biol Med ; 147: 105684, 2022 08.
Article in English | MEDLINE | ID: covidwho-1930823
ABSTRACT

BACKGROUND:

The world has been battling the continuous COVID-19 pandemic spread by the SARS-CoV-2 virus for last two years. The issue of viral disease prediction is constantly a matter of interest in virology and the study of disease transmission over the long years.

OBJECTIVE:

In this study, we aimed to implement genome association studies using RNA-Seq of COVID-19 and reveal highly expressed gene biomarkers and prediction based on the machine learning model of COVID-19 analysis to combat this pandemic.

METHOD:

We collected RNA-Seq gene count data for both healthy (Control) and non-healthy (Treated) COVID-19 cases. In this experiment, a sequence of bioinformatics strategies and statistical techniques, such as fold-change and adjusted p-value, were processed to identify differentially expressed genes (DEGs). We filtered biomarker sets of high DEGs, moderate DEGs, and low DEGs using DESeq2, Limma Trend, and Limma Voom methods based on intersection and union operations and applied machine learning techniques to predict COVID-19.

RESULT:

Through experimental analysis, 67 potential biomarkers were extracted, comprising 49 up-regulated and 18 down-regulated genes, using statistical techniques and a set-theory consensus strategy. We trained the machine learning models on 12 different biomarker sets and found that the SVM model performed better than the other classifiers with 99.07% classification accuracy for moderate DEGs.

CONCLUSION:

Our study revealed that identified differentially expressed genes of the moderate DEGs biomarker set, |log2FC| ≥ 2 with adjusted p-value < 0.05, work significantly as input features to implement a machine learning model using a kernel-based SVM technique to predict COVID-19.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Diagnostic study / Observational study / Prognostic study Topics: Long Covid Limits: Humans Language: English Journal: Comput Biol Med Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: COVID-19 Type of study: Diagnostic study / Observational study / Prognostic study Topics: Long Covid Limits: Humans Language: English Journal: Comput Biol Med Year: 2022 Document Type: Article