Search | VHL Regional Portal

Evolutionary insights into a non-coding deletion of SARS-CoV-2 B.1.1.7

Jianing Yang; Guoqing Zhang; Dalang Yu; Ruifang Cao; Xiaoxian Wu; Yunchao Ling; Yi-Hsuan Pan; Chunyan Yi; Xiaoyu Sun; Bing Sun; Yu Zhang; Guo-Ping Zhao; Yixue Li; Haipeng Li.

Preprint in English | bioRxiv | ID: ppbiorxiv-442029

ABSTRACT

Three prevalent SARS-CoV-2 Variants of Concern (VOCs) were emerged and caused epidemic waves. It is essential to uncover the key genetic changes that cause the high transmissibility of VOCs. However, different viral mutations are generally tightly linked so traditional population genetic methods may not reliably detect beneficial mutation. In this study, we proposed a new pandemic-scale phylogenomic approach to detect mutations crucial to transmissibility. We analyzed 3,646,973 high-quality SARS-CoV-2 genomic sequences and the epidemiology metadata. Based on the sequential occurrence order of mutations and the instantaneously accelerated furcation rate, the analysis revealed that two non-coding mutations at the position of 28271 (g.a28271-/t) might be crucial for the high transmissibility of Alpha, Delta and Omicron VOCs. Both two mutations cause an A-to-T change at the core Kozak site of the N gene. The analysis also revealed that the non-coding mutations (g.a28271-/t) alone are unlikely to cause high viral transmissibility, indicating epistasis or multilocus interaction in viral transmissibility. A convergent evolutionary analysis revealed that g.a28271-/t, S:P681H/R and N:R203K/M occur independently in the three-VOC lineages, suggesting a potential interaction among these mutations. Therefore, this study unveils that non-synonymous and non-coding mutations could affect the transmissibility synergistically.

Coronavirus GenBrowser for monitoring adaptive evolution and transmission of SARS-CoV-2

Dalang Yu; Xiao Yang; Bixia Tang; Yi-Hsuan Pan; Jianing Yang; Guangya Duan; Junwei Zhu; Zi-Qian Hao; Hailong Mu; Long Dai; Wangjie Hu; Mochen Zhang; Ying Cui; Tong Jin; Cuiping Li; Lina Ma; - Language translation team; Xiao Su; Guo-Qing Zhang; Wenming Zhao; Haipeng Li.

Preprint in English | medRxiv | ID: ppmedrxiv-20248612

ABSTRACT

Genomic epidemiology is important to study the COVID-19 pandemic and more than two million SARS-CoV-2 genomic sequences were deposited into public databases. However, the exponential increase of sequences invokes unprecedented bioinformatic challenges. Here, we present the Coronavirus GenBrowser (CGB) based on a highly efficient analysis framework and a movie maker strategy. In total, 1,002,739 high quality genomic sequences with the transmission-related metadata were analyzed and visualized. The size of the core data file is only 12.20 MB, efficient for clean data sharing. Quick visualization modules and rich interactive operations are provided to explore the annotated SARS-CoV-2 evolutionary tree. CGB binary nomenclature is proposed to name each internal lineage. The pre-analyzed data can be filtered out according to the user-defined criteria to explore the transmission of SARS-CoV-2. Different evolutionary analyses can also be easily performed, such as the detection of accelerated evolution and on-going positive selection. Moreover, the 75 genomic spots conserved in SARS-CoV-2 but non-conserved in other coronaviruses were identified, which may indicate the functional elements specifically important for SARS-CoV-2. The CGB not only enables users who have no programming skills to analyze millions of genomic sequences, but also offers a panoramic vision of the transmission and evolution of SARS-CoV-2.

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL