Your browser doesn't support javascript.
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.
Jungreis, Irwin; Sealfon, Rachel; Kellis, Manolis.
  • Jungreis I; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA. iljungr@csail.mit.edu.
  • Sealfon R; Broad Institute of MIT and Harvard, Cambridge, MA, USA. iljungr@csail.mit.edu.
  • Kellis M; Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
Nat Commun ; 12(1): 2642, 2021 05 11.
Article in English | MEDLINE | ID: covidwho-1225505
Preprint
This scientific journal article is probably based on a previously available preprint. It has been identified through a machine matching algorithm, human confirmation is still pending.
See preprint
ABSTRACT
Despite its clinical importance, the SARS-CoV-2 gene set remains unresolved, hindering dissection of COVID-19 biology. We use comparative genomics to provide a high-confidence protein-coding gene set, characterize evolutionary constraint, and prioritize functional mutations. We select 44 Sarbecovirus genomes at ideally-suited evolutionary distances, and quantify protein-coding evolutionary signatures and overlapping constraint. We find strong protein-coding signatures for ORFs 3a, 6, 7a, 7b, 8, 9b, and a novel alternate-frame gene, ORF3c, whereas ORFs 2b, 3d/3d-2, 3b, 9c, and 10 lack protein-coding signatures or convincing experimental evidence of protein-coding function. Furthermore, we show no other conserved protein-coding genes remain to be discovered. Mutation analysis suggests ORF8 contributes to within-individual fitness but not person-to-person transmission. Cross-strain and within-strain evolutionary pressures agree, except for fewer-than-expected within-strain mutations in nsp3 and S1, and more-than-expected in nucleocapsid, which shows a cluster of mutations in a predicted B-cell epitope, suggesting immune-avoidance selection. Evolutionary histories of residues disrupted by spike-protein substitutions D614G, N501Y, E484K, and K417N/T provide clues about their biology, and we catalog likely-functional co-inherited mutations. Previously reported RNA-modification sites show no enrichment for conservation. Here we report a high-confidence gene set and evolutionary-history annotations providing valuable resources and insights on SARS-CoV-2 biology, mutations, and evolution.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Genome, Viral / SARS-CoV-2 / COVID-19 / Mutation Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: Nat Commun Journal subject: Biology / Science Year: 2021 Document Type: Article Affiliation country: S41467-021-22905-7

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Genome, Viral / SARS-CoV-2 / COVID-19 / Mutation Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: Nat Commun Journal subject: Biology / Science Year: 2021 Document Type: Article Affiliation country: S41467-021-22905-7