Your browser doesn't support javascript.
Phylogenetic Analysis of SARS-CoV-2 Data Is Difficult.
Morel, Benoit; Barbera, Pierre; Czech, Lucas; Bettisworth, Ben; Hübner, Lukas; Lutteropp, Sarah; Serdari, Dora; Kostaki, Evangelia-Georgia; Mamais, Ioannis; Kozlov, Alexey M; Pavlidis, Pavlos; Paraskevis, Dimitrios; Stamatakis, Alexandros.
  • Morel B; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Barbera P; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Czech L; Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, USA.
  • Bettisworth B; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Hübner L; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Lutteropp S; Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany.
  • Serdari D; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Kostaki EG; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Mamais I; Department of Hygiene Epidemiology and Medical Statistics, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece.
  • Kozlov AM; Department of Health Sciences, European University Cyprus, Nicosia, Cyprus.
  • Pavlidis P; Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
  • Paraskevis D; Institute of Computer Science, Foundation for Research and Technology-Hellas, Crete, Greece.
  • Stamatakis A; Department of Hygiene Epidemiology and Medical Statistics, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece.
Mol Biol Evol ; 38(5): 1777-1791, 2021 05 04.
Article in English | MEDLINE | ID: covidwho-975301
Preprint
This scientific journal article is probably based on a previously available preprint. It has been identified through a machine matching algorithm, human confirmation is still pending.
See preprint
ABSTRACT
Numerous studies covering some aspects of SARS-CoV-2 data analyses are being published on a daily basis, including a regularly updated phylogeny on nextstrain.org. Here, we review the difficulties of inferring reliable phylogenies by example of a data snapshot comprising a quality-filtered subset of 8,736 out of all 16,453 virus sequences available on May 5, 2020 from gisaid.org. We find that it is difficult to infer a reliable phylogeny on these data due to the large number of sequences in conjunction with the low number of mutations. We further find that rooting the inferred phylogeny with some degree of confidence either via the bat and pangolin outgroups or by applying novel computational methods on the ingroup phylogeny does not appear to be credible. Finally, an automatic classification of the current sequences into subclasses using the mPTP tool for molecular species delimitation is also, as might be expected, not possible, as the sequences are too closely related. We conclude that, although the application of phylogenetic methods to disentangle the evolution and spread of COVID-19 provides some insight, results of phylogenetic analyses, in particular those conducted under the default settings of current phylogenetic inference tools, as well as downstream analyses on the inferred phylogenies, should be considered and interpreted with extreme caution.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Phylogeny / Genome, Viral / Evolution, Molecular / SARS-CoV-2 / COVID-19 / Mutation Type of study: Randomized controlled trials Limits: Humans Language: English Journal: Mol Biol Evol Journal subject: Molecular Biology Year: 2021 Document Type: Article Affiliation country: Molbev

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Phylogeny / Genome, Viral / Evolution, Molecular / SARS-CoV-2 / COVID-19 / Mutation Type of study: Randomized controlled trials Limits: Humans Language: English Journal: Mol Biol Evol Journal subject: Molecular Biology Year: 2021 Document Type: Article Affiliation country: Molbev