Your browser doesn't support javascript.
Analysis of genomic distributions of SARS-CoV-2 reveals a dominant strain type with strong allelic associations.
Yang, Hsin-Chou; Chen, Chun-Houh; Wang, Jen-Hung; Liao, Hsiao-Chi; Yang, Chih-Ting; Chen, Chia-Wei; Lin, Yin-Chun; Kao, Chiun-How; Lu, Mei-Yeh Jade; Liao, James C.
  • Yang HC; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan; hsinchou@stat.sinica.edu.tw liaoj@gate.sinica.edu.tw.
  • Chen CH; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Wang JH; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Liao HC; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Yang CT; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Chen CW; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Lin YC; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Kao CH; Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan.
  • Lu MJ; Department of Statistics, Tamkang University, New Taipei City 251301, Taiwan.
  • Liao JC; Biodiversity Research Center, Academia Sinica, Taipei 11529, Taiwan.
Proc Natl Acad Sci U S A ; 117(48): 30679-30686, 2020 12 01.
Article in English | MEDLINE | ID: covidwho-922310
ABSTRACT
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causal agent of COVID 19, continues to evolve since its first emergence in December 2019. Using the complete sequences of 1,932 SARS-CoV-2 genomes, various clustering analyses consistently identified six types of the strains. Independent of the dendrogram construction, 13 signature variations in the form of single nucleotide variations (SNVs) in protein coding regions and one SNV in the 5' untranslated region (UTR) were identified and provided a direct interpretation for the six types (types I to VI). The six types of the strains and their underlying signature SNVs were validated in two subsequent analyses of 6,228 and 38,248 SARS-CoV-2 genomes which became available later. To date, type VI, characterized by the four signature SNVs C241T (5'UTR), C3037T (nsp3 F924F), C14408T (nsp12 P4715L), and A23403G (Spike D614G), with strong allelic associations, has become the dominant type. Since C241T is in the 5' UTR with uncertain significance and the characteristics can be captured by the other three strongly associated SNVs, we focus on the other three. The increasing frequency of the type VI haplotype 3037T-14408T-23403G in the majority of the submitted samples in various countries suggests a possible fitness gain conferred by the type VI signature SNVs. The fact that strains missing one or two of these signature SNVs fail to persist implies possible interactions among these SNVs. Later SNVs such as G28881A, G28882A, and G28883C have emerged with strong allelic associations, forming new subtypes. This study suggests that SNVs may become an important consideration in SARS-CoV-2 classification and surveillance.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Genome, Viral / Genomics / Alleles / SARS-CoV-2 Type of study: Prognostic study Limits: Humans Language: English Journal: Proc Natl Acad Sci U S A Year: 2020 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Genome, Viral / Genomics / Alleles / SARS-CoV-2 Type of study: Prognostic study Limits: Humans Language: English Journal: Proc Natl Acad Sci U S A Year: 2020 Document Type: Article