Your browser doesn't support javascript.
Data-driven analysis of amino acid change dynamics timely reveals SARS-CoV-2 variant emergence.
Bernasconi, Anna; Mari, Lorenzo; Casagrandi, Renato; Ceri, Stefano.
  • Bernasconi A; Departement of Electronics, Information, and Bioengineering, Politecnico di Milano, 20133, Milan, Italy. anna.bernasconi@polimi.it.
  • Mari L; Departement of Electronics, Information, and Bioengineering, Politecnico di Milano, 20133, Milan, Italy.
  • Casagrandi R; Departement of Electronics, Information, and Bioengineering, Politecnico di Milano, 20133, Milan, Italy.
  • Ceri S; Departement of Electronics, Information, and Bioengineering, Politecnico di Milano, 20133, Milan, Italy.
Sci Rep ; 11(1): 21068, 2021 10 26.
Article in English | MEDLINE | ID: covidwho-1493208
Preprint
This scientific journal article is probably based on a previously available preprint. It has been identified through a machine matching algorithm, human confirmation is still pending.
See preprint
ABSTRACT
Since its emergence in late 2019, the diffusion of SARS-CoV-2 is associated with the evolution of its viral genome. The co-occurrence of specific amino acid changes, collectively named 'virus variant', requires scrutiny (as variants may hugely impact the agent's transmission, pathogenesis, or antigenicity); variant evolution is studied using phylogenetics. Yet, never has this problem been tackled by digging into data with ad hoc analysis techniques. Here we show that the emergence of variants can in fact be traced through data-driven methods, further capitalizing on the value of large collections of SARS-CoV-2 sequences. For all countries with sufficient data, we compute weekly counts of amino acid changes, unveil time-varying clusters of changes with similar-rapidly growing-dynamics, and then follow their evolution. Our method succeeds in timely associating clusters to variants of interest/concern, provided their change composition is well characterized. This allows us to detect variants' emergence, rise, peak, and eventual decline under competitive pressure of another variant. Our early warning system, exclusively relying on deposited sequences, shows the power of big data in this context, and concurs to calling for the wide spreading of public SARS-CoV-2 genome sequencing for improved surveillance and control of the COVID-19 pandemic.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: SARS-CoV-2 / COVID-19 Type of study: Observational study / Randomized controlled trials / Reviews Topics: Variants Limits: Humans Country/Region as subject: North America / Asia / Europa Language: English Journal: Sci Rep Year: 2021 Document Type: Article Affiliation country: S41598-021-00496-z

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: SARS-CoV-2 / COVID-19 Type of study: Observational study / Randomized controlled trials / Reviews Topics: Variants Limits: Humans Country/Region as subject: North America / Asia / Europa Language: English Journal: Sci Rep Year: 2021 Document Type: Article Affiliation country: S41598-021-00496-z