Your browser doesn't support javascript.
Persistent minimal sequences of SARS-CoV-2.
Pratas, Diogo; Silva, Jorge M.
  • Pratas D; Institute of Electronics and Informatics Engineering of Aveiro, 3810-193 Aveiro, Portugal.
  • Silva JM; Department of Electronics, Telecommunications and Informatics, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal.
Bioinformatics ; 36(21): 5129-5132, 2021 01 29.
Article in English | MEDLINE | ID: covidwho-1343669
ABSTRACT
MOTIVATION Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused more than 14 million cases and more than half million deaths. Given the absence of implemented therapies, new analysis, diagnosis and therapeutics are of great importance.

RESULTS:

Analysis of SARS-CoV-2 genomes from the current outbreak reveals the presence of short persistent DNA/RNA sequences that are absent from the human genome and transcriptome (PmRAWs). For the PmRAWs with length 12, only four exist at the same location in all SARS-CoV-2. At the gene level, we found one PmRAW of size 13 at the Spike glycoprotein coding sequence. This protein is fundamental for binding in human ACE2 and further use as an entry receptor to invade target cells. Applying protein structural prediction, we localized this PmRAW at the surface of the Spike protein, providing a potential targeted vector for diagnostics and therapeutics. In addition, we show a new pattern of relative absent words (RAWs), characterized by the progressive increase of GC content (Guanine and Cytosine) according to the decrease of RAWs length, contrarily to the virus and host genome distributions. New analysis shows the same property during the Ebola virus outbreak. At a computational level, we improved the alignment-free method to identify pathogen-specific signatures in balance with GC measures and removed previous size limitations. AVAILABILITY AND IMPLEMENTATION https//github.com/cobilab/eagle. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Spike Glycoprotein, Coronavirus / COVID-19 Type of study: Prognostic study Limits: Humans Language: English Journal: Bioinformatics Journal subject: Medical Informatics Year: 2021 Document Type: Article Affiliation country: Bioinformatics

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Spike Glycoprotein, Coronavirus / COVID-19 Type of study: Prognostic study Limits: Humans Language: English Journal: Bioinformatics Journal subject: Medical Informatics Year: 2021 Document Type: Article Affiliation country: Bioinformatics