This article is a Preprint
Preprints are preliminary research reports that have not been certified by peer review. They should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Preprints posted online allow authors to receive rapid feedback and the entire scientific community can appraise the work for themselves and respond appropriately. Those comments are posted alongside the preprints for anyone to read them and serve as a post publication assessment.
RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses
Preprint
in English
| bioRxiv
| ID: ppbiorxiv-012906
ABSTRACT
As the COVID-19 outbreak spreads, there is a growing need for a compilation of conserved RNA genome regions in the SARS-CoV-2 virus along with their structural propensities to guide development of antivirals and diagnostics. Using sequence alignments spanning a range of betacoronaviruses, we rank genomic regions by RNA sequence conservation, identifying 79 regions of length at least 15 nucleotides as exactly conserved over SARS-related complete genome sequences available near the beginning of the COVID-19 outbreak. We then confirm the conservation of the majority of these genome regions across 739 SARS-CoV-2 sequences reported to date from the current COVID-19 outbreak, and we present a curated list of 30 SARS-related-conserved regions. We find that known RNA structured elements curated as Rfam families and in prior literature are enriched in these conserved genome regions, and we predict additional conserved, stable secondary structures across the viral genome. We provide 106 SARS-CoV-2-conserved-structured regions as potential targets for antivirals that bind to structured RNA. We further provide detailed secondary structure models for the 5 UTR, frame-shifting element, and 3 UTR. Last, we predict regions of the SARS-CoV-2 viral genome have low propensity for RNA secondary structure and are conserved within SARS-CoV-2 strains. These 59 SARS-CoV-2-conserved-unstructured genomic regions may be most easily targeted in primer-based diagnostic and oligonucleotide-based therapeutic strategies.
cc_by
Full text:
Available
Collection:
Preprints
Database:
bioRxiv
Type of study:
Prognostic study
Language:
English
Year:
2020
Document type:
Preprint