Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters










Language
Publication year range
1.
Preprint in English | bioRxiv | ID: ppbiorxiv-442029

ABSTRACT

Three prevalent SARS-CoV-2 Variants of Concern (VOCs) were emerged and caused epidemic waves. It is essential to uncover the key genetic changes that cause the high transmissibility of VOCs. However, different viral mutations are generally tightly linked so traditional population genetic methods may not reliably detect beneficial mutation. In this study, we proposed a new pandemic-scale phylogenomic approach to detect mutations crucial to transmissibility. We analyzed 3,646,973 high-quality SARS-CoV-2 genomic sequences and the epidemiology metadata. Based on the sequential occurrence order of mutations and the instantaneously accelerated furcation rate, the analysis revealed that two non-coding mutations at the position of 28271 (g.a28271-/t) might be crucial for the high transmissibility of Alpha, Delta and Omicron VOCs. Both two mutations cause an A-to-T change at the core Kozak site of the N gene. The analysis also revealed that the non-coding mutations (g.a28271-/t) alone are unlikely to cause high viral transmissibility, indicating epistasis or multilocus interaction in viral transmissibility. A convergent evolutionary analysis revealed that g.a28271-/t, S:P681H/R and N:R203K/M occur independently in the three-VOC lineages, suggesting a potential interaction among these mutations. Therefore, this study unveils that non-synonymous and non-coding mutations could affect the transmissibility synergistically.

2.
Preprint in English | medRxiv | ID: ppmedrxiv-20248612

ABSTRACT

Genomic epidemiology is important to study the COVID-19 pandemic and more than two million SARS-CoV-2 genomic sequences were deposited into public databases. However, the exponential increase of sequences invokes unprecedented bioinformatic challenges. Here, we present the Coronavirus GenBrowser (CGB) based on a highly efficient analysis framework and a movie maker strategy. In total, 1,002,739 high quality genomic sequences with the transmission-related metadata were analyzed and visualized. The size of the core data file is only 12.20 MB, efficient for clean data sharing. Quick visualization modules and rich interactive operations are provided to explore the annotated SARS-CoV-2 evolutionary tree. CGB binary nomenclature is proposed to name each internal lineage. The pre-analyzed data can be filtered out according to the user-defined criteria to explore the transmission of SARS-CoV-2. Different evolutionary analyses can also be easily performed, such as the detection of accelerated evolution and on-going positive selection. Moreover, the 75 genomic spots conserved in SARS-CoV-2 but non-conserved in other coronaviruses were identified, which may indicate the functional elements specifically important for SARS-CoV-2. The CGB not only enables users who have no programming skills to analyze millions of genomic sequences, but also offers a panoramic vision of the transmission and evolution of SARS-CoV-2.

SELECTION OF CITATIONS
SEARCH DETAIL
...