ABSTRACT
Since the latter part of 2020, SARS-CoV-2 evolution has been characterised by the emergence of viral variants associated with distinct biological characteristics. While the main research focus has centred on the ability of new variants to increase in frequency and impact the effective reproductive number of the virus, less attention has been placed on their relative ability to establish transmission chains and to spread through a geographic area. Here, we describe a phylogeographic approach to estimate and compare the introduction and dispersal dynamics of the main SARS-CoV-2 variants - Alpha, Iota, Delta, and Omicron - that circulated in the New York City area between 2020 and 2022. Notably, our results indicate that Delta had a lower ability to establish sustained transmission chains in the NYC area and that Omicron (BA.1) was the variant fastest to disseminate across the study area. The analytical approach presented here complements non-spatially-explicit analytical approaches that seek a better understanding of the epidemiological differences that exist among successive SARS-CoV-2 variants of concern.
Subject(s)
COVID-19 , Humans , COVID-19/epidemiology , New York City/epidemiology , SARS-CoV-2/genetics , FastingABSTRACT
The COVID-19 pandemic galvanized the field of virus genomic surveillance, demonstrating its utility for public health. Now, we must harness the momentum that led to increased infrastructure, training, and political will to build a sustainable global genomic surveillance network for other epidemic and endemic viruses. We suggest a generalizable modular sequencing framework wherein users can easily switch between virus targets to maximize cost-effectiveness and maintain readiness for new threats. We also highlight challenges associated with genomic surveillance and when global inequalities persist. We propose solutions to mitigate some of these issues, including training and multilateral partnerships. Exploring alternatives to clinical sequencing can also reduce the cost of surveillance programs. Finally, we discuss how establishing genomic surveillance would aid control programs and potentially provide a warning system for outbreaks, using a global respiratory virus (RSV), an arbovirus (dengue virus), and a regional zoonotic virus (Lassa virus) as examples.
ABSTRACT
The scale of data produced during the SARS-CoV-2 pandemic has been unprecedented, with more than 13 million sequences shared publicly at the time of writing. This wealth of sequence data provides important context for interpreting local outbreaks. However, placing sequences of interest into national and international context is difficult given the size of the global dataset. Often outbreak investigations and genomic surveillance efforts require running similar analyses again and again on the latest dataset and producing reports. We developed civet (cluster investigation and virus epidemiology tool) to aid these routine analyses and facilitate virus outbreak investigation and surveillance. Civet can place sequences of interest in the local context of background diversity, resolving the query into different 'catchments' and presenting the phylogenetic results alongside metadata in an interactive, distributable report. Civet can be used on a fine scale for clinical outbreak investigation, for local surveillance and cluster discovery, and to routinely summarise the virus diversity circulating on a national level. Civet reports have helped researchers and public health bodies feedback genomic information in the appropriate context within a timeframe that is useful for public health.
ABSTRACT
The chronic infection hypothesis for novel SARS-CoV-2 variant emergence is increasingly gaining credence following the appearance of Omicron. Here we investigate intrahost evolution and genetic diversity of lineage B.1.517 during a SARS-CoV-2 chronic infection lasting for 471 days (and still ongoing) with consistently recovered infectious virus and high viral genome copies. During the infection, we find an accelerated virus evolutionary rate translating to 35 nucleotide substitutions per year, approximately two-fold higher than the global SARS-CoV-2 evolutionary rate. This intrahost evolution result in the emergence and persistence of at least three genetically distinct genotypes suggesting the establishment of spatially structured viral populations continually reseeding different genotypes into the nasopharynx. Finally, we track the temporal dynamics of genetic diversity to identify advantageous mutations and highlight hallmark changes for chronic infection. Our findings demonstrate that untreated chronic infections accelerate SARS-CoV-2 evolution, providing an opportunity for the emergence of genetically divergent variants. Graphical To understand the intrahost evolution of SARS-CoV-2 from a single patient chronically infected for at least 471 days, Chaguza et al. use whole genome sequencing to estimate the evolutionary rate, the genetic divergence of viral lineages, relative mutation rates, and frequency of mutational variants during the course of the infection.
ABSTRACT
The chronic infection hypothesis for novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variant emergence is increasingly gaining credence following the appearance of Omicron. Here, we investigate intrahost evolution and genetic diversity of lineage B.1.517 during a SARS-CoV-2 chronic infection lasting for 471 days (and still ongoing) with consistently recovered infectious virus and high viral genome copies. During the infection, we find an accelerated virus evolutionary rate translating to 35 nucleotide substitutions per year, approximately 2-fold higher than the global SARS-CoV-2 evolutionary rate. This intrahost evolution results in the emergence and persistence of at least three genetically distinct genotypes, suggesting the establishment of spatially structured viral populations continually reseeding different genotypes into the nasopharynx. Finally, we track the temporal dynamics of genetic diversity to identify advantageous mutations and highlight hallmark changes for chronic infection. Our findings demonstrate that untreated chronic infections accelerate SARS-CoV-2 evolution, providing an opportunity for the emergence of genetically divergent variants.
Subject(s)
COVID-19 , SARS-CoV-2 , Humans , Persistent Infection , Genome, Viral , GenotypeABSTRACT
The first SARS-CoV-2 variant of concern (VOC) to be designated was lineage B.1.1.7, later labelled by the World Health Organization as Alpha. Originating in early autumn but discovered in December 2020, it spread rapidly and caused large waves of infections worldwide. The Alpha variant is notable for being defined by a long ancestral phylogenetic branch with an increased evolutionary rate, along which only two sequences have been sampled. Alpha genomes comprise a well-supported monophyletic clade within which the evolutionary rate is typical of SARS-CoV-2. The Alpha epidemic continued to grow despite the continued restrictions on social mixing across the UK and the imposition of new restrictions, in particular, the English national lockdown in November 2020. While these interventions succeeded in reducing the absolute number of cases, the impact of these non-pharmaceutical interventions was predominantly to drive the decline of the SARS-CoV-2 lineages that preceded Alpha. We investigate the only two sampled sequences that fall on the branch ancestral to Alpha. We find that one is likely to be a true intermediate sequence, providing information about the order of mutational events that led to Alpha. We explore alternate hypotheses that can explain how Alpha acquired a large number of mutations yet remained largely unobserved in a region of high genomic surveillance: an under-sampled geographical location, a non-human animal population, or a chronically infected individual. We conclude that the latter provides the best explanation of the observed behaviour and dynamics of the variant, although the individual need not be immunocompromised, as persistently infected immunocompetent hosts also display a higher within-host rate of evolution. Finally, we compare the ancestral branches and mutation profiles of other VOCs and find that Delta appears to be an outlier both in terms of the genomic locations of its defining mutations and a lack of the rapid evolutionary rate on its ancestral branch. As new variants, such as Omicron, continue to evolve (potentially through similar mechanisms), it remains important to investigate the origins of other variants to identify ways to potentially disrupt their evolution and emergence.
ABSTRACT
The SARS-CoV-2 Delta (Pango lineage B.1.617.2) variant of concern spread globally, causing resurgences of COVID-19 worldwide1,2. The emergence of the Delta variant in the UK occurred on the background of a heterogeneous landscape of immunity and relaxation of non-pharmaceutical interventions. Here we analyse 52,992 SARS-CoV-2 genomes from England together with 93,649 genomes from the rest of the world to reconstruct the emergence of Delta and quantify its introduction to and regional dissemination across England in the context of changing travel and social restrictions. Using analysis of human movement, contact tracing and virus genomic data, we find that the geographic focus of the expansion of Delta shifted from India to a more global pattern in early May 2021. In England, Delta lineages were introduced more than 1,000 times and spread nationally as non-pharmaceutical interventions were relaxed. We find that hotel quarantine for travellers reduced onward transmission from importations; however, the transmission chains that later dominated the Delta wave in England were seeded before travel restrictions were introduced. Increasing inter-regional travel within England drove the nationwide dissemination of Delta, with some cities receiving more than 2,000 observable lineage introductions from elsewhere. Subsequently, increased levels of local population mixing-and not the number of importations-were associated with the faster relative spread of Delta. The invasion dynamics of Delta depended on spatial heterogeneity in contact patterns, and our findings will inform optimal spatial interventions to reduce the transmission of current and future variants of concern, such as Omicron (Pango lineage B.1.1.529).
Subject(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/transmission , COVID-19/virology , Cities/epidemiology , Contact Tracing , England/epidemiology , Genome, Viral/genetics , Humans , Quarantine/legislation & jurisprudence , SARS-CoV-2/genetics , SARS-CoV-2/growth & development , SARS-CoV-2/isolation & purification , Travel/legislation & jurisprudenceABSTRACT
The availability of pathogen sequence data and use of genomic surveillance is rapidly increasing. Genomic tools and classification systems need updating to reflect this. Here, rabies virus is used as an example to showcase the potential value of updated genomic tools to enhance surveillance to better understand epidemiological dynamics and improve disease control. Previous studies have described the evolutionary history of rabies virus, however the resulting taxonomy lacks the definition necessary to identify incursions, lineage turnover and transmission routes at high resolution. Here we propose a lineage classification system based on the dynamic nomenclature used for SARS-CoV-2, defining a lineage by phylogenetic methods for tracking virus spread and comparing sequences across geographic areas. We demonstrate this system through application to the globally distributed Cosmopolitan clade of rabies virus, defining 96 total lineages within the clade, beyond the 22 previously reported. We further show how integration of this tool with a new rabies virus sequence data resource (RABV-GLUE) enables rapid application, for example, highlighting lineage dynamics relevant to control and elimination programmes, such as identifying importations and their sources, as well as areas of persistence and routes of virus movement, including transboundary incursions. This system and the tools developed should be useful for coordinating and targeting control programmes and monitoring progress as countries work towards eliminating dog-mediated rabies, as well as having potential for broader application to the surveillance of other viruses.
Subject(s)
Phylogeny , Rabies virus , Rabies , Animals , Dogs , Genomics , Rabies/virology , Rabies virus/geneticsABSTRACT
Understanding SARS-CoV-2 transmission in higher education settings is important to limit spread between students, and into at-risk populations. In this study, we sequenced 482 SARS-CoV-2 isolates from the University of Cambridge from 5 October to 6 December 2020. We perform a detailed phylogenetic comparison with 972 isolates from the surrounding community, complemented with epidemiological and contact tracing data, to determine transmission dynamics. We observe limited viral introductions into the university; the majority of student cases were linked to a single genetic cluster, likely following social gatherings at a venue outside the university. We identify considerable onward transmission associated with student accommodation and courses; this was effectively contained using local infection control measures and following a national lockdown. Transmission clusters were largely segregated within the university or the community. Our study highlights key determinants of SARS-CoV-2 transmission and effective interventions in a higher education setting that will inform public health policy during pandemics.
Subject(s)
COVID-19/epidemiology , COVID-19/transmission , SARS-CoV-2/genetics , Universities , COVID-19/prevention & control , COVID-19/virology , Contact Tracing , Genome, Viral/genetics , Genomics , Humans , Phylogeny , RNA, Viral/genetics , Risk Factors , SARS-CoV-2/classification , SARS-CoV-2/isolation & purification , Students , United Kingdom/epidemiology , Universities/statistics & numerical dataABSTRACT
The SARS-CoV-2 epidemic in southern Africa has been characterized by three distinct waves. The first was associated with a mix of SARS-CoV-2 lineages, while the second and third waves were driven by the Beta (B.1.351) and Delta (B.1.617.2) variants, respectively1-3. In November 2021, genomic surveillance teams in South Africa and Botswana detected a new SARS-CoV-2 variant associated with a rapid resurgence of infections in Gauteng province, South Africa. Within three days of the first genome being uploaded, it was designated a variant of concern (Omicron, B.1.1.529) by the World Health Organization and, within three weeks, had been identified in 87 countries. The Omicron variant is exceptional for carrying over 30 mutations in the spike glycoprotein, which are predicted to influence antibody neutralization and spike function4. Here we describe the genomic profile and early transmission dynamics of Omicron, highlighting the rapid spread in regions with high levels of population immunity.
Subject(s)
COVID-19/epidemiology , COVID-19/virology , Immune Evasion , SARS-CoV-2/isolation & purification , Antibodies, Neutralizing/immunology , Botswana/epidemiology , COVID-19/immunology , COVID-19/transmission , Humans , Models, Molecular , Mutation , Phylogeny , Recombination, Genetic , SARS-CoV-2/classification , SARS-CoV-2/immunology , South Africa/epidemiology , Spike Glycoprotein, Coronavirus/genetics , Spike Glycoprotein, Coronavirus/immunologyABSTRACT
Late in 2020, two genetically-distinct clusters of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) with mutations of biological concern were reported, one in the United Kingdom and one in South Africa. Using a combination of data from routine surveillance, genomic sequencing and international travel we track the international dispersal of lineages B.1.1.7 and B.1.351 (variant 501Y-V2). We account for potential biases in genomic surveillance efforts by including passenger volumes from location of where the lineage was first reported, London and South Africa respectively. Using the software tool grinch (global report investigating novel coronavirus haplotypes), we track the international spread of lineages of concern with automated daily reports, Further, we have built a custom tracking website (cov-lineages.org/global_report.html) which hosts this daily report and will continue to include novel SARS-CoV-2 lineages of concern as they are detected.
ABSTRACT
Genomic epidemiology, which links pathogen genomes with associated metadata to understand disease transmission, has become a key component of outbreak response. Decreasing costs of genome sequencing and increasing computational power provide opportunities to generate and analyse large viral genomic datasets that aim to uncover the spatial scales of transmission, the demographics contributing to transmission patterns, and to forecast epidemic trends. Emerging sources of genomic data and associated metadata provide new opportunities to further unravel transmission patterns. Key challenges include how to integrate genomic data with metadata from multiple sources, how to generate efficient computational algorithms to cope with large datasets, and how to establish sampling frameworks to enable robust conclusions.
Subject(s)
Disease Outbreaks , Genome, Viral , Genome, Viral/genetics , GenomicsABSTRACT
COVID-19 transmission rates are often linked to locally circulating strains of SARS-CoV-2. Here we describe 203 SARS-CoV-2 whole genome sequences analyzed from strains circulating in Rwanda from May 2020 to February 2021. In particular, we report a shift in variant distribution towards the emerging sub-lineage A.23.1 that is currently dominating. Furthermore, we report the detection of the first Rwandan cases of the B.1.1.7 and B.1.351 variants of concern among incoming travelers tested at Kigali International Airport. To assess the importance of viral introductions from neighboring countries and local transmission, we exploit available individual travel history metadata to inform spatio-temporal phylogeographic inference, enabling us to take into account infections from unsampled locations. We uncover an important role of neighboring countries in seeding introductions into Rwanda, including those from which no genomic sequences were available. Our results highlight the importance of systematic genomic surveillance and regional collaborations for a durable response towards combating COVID-19.
Subject(s)
COVID-19/virology , Genome, Viral/genetics , SARS-CoV-2/genetics , Travel-Related Illness , Adult , COVID-19/diagnosis , COVID-19/epidemiology , COVID-19/transmission , Epidemiological Monitoring , Female , Humans , Male , Phylogeny , Phylogeography , RNA, Viral/genetics , RNA, Viral/isolation & purification , Rwanda/epidemiology , SARS-CoV-2/isolation & purification , SARS-CoV-2/pathogenicity , Whole Genome SequencingABSTRACT
The response of the global virus genomics community to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has been unprecedented, with significant advances made towards the 'real-time' generation and sharing of SARS-CoV-2 genomic data. The rapid growth in virus genome data production has necessitated the development of new analytical methods that can deal with orders of magnitude of more genomes than previously available. Here, we present and describe Phylogenetic Assignment of Named Global Outbreak Lineages (pangolin), a computational tool that has been developed to assign the most likely lineage to a given SARS-CoV-2 genome sequence according to the Pango dynamic lineage nomenclature scheme. To date, nearly two million virus genomes have been submitted to the web-application implementation of pangolin, which has facilitated the SARS-CoV-2 genomic epidemiology and provided researchers with access to actionable information about the pandemic's transmission lineages.
ABSTRACT
We present evidence for multiple independent origins of recombinant SARS-CoV-2 viruses sampled from late 2020 and early 2021 in the United Kingdom. Their genomes carry single-nucleotide polymorphisms and deletions that are characteristic of the B.1.1.7 variant of concern but lack the full complement of lineage-defining mutations. Instead, the remainder of their genomes share contiguous genetic variation with non-B.1.1.7 viruses circulating in the same geographic area at the same time as the recombinants. In four instances, there was evidence for onward transmission of a recombinant-origin virus, including one transmission cluster of 45 sequenced cases over the course of 2 months. The inferred genomic locations of recombination breakpoints suggest that every community-transmitted recombinant virus inherited its spike region from a B.1.1.7 parental virus, consistent with a transmission advantage for B.1.1.7's set of mutations.
Subject(s)
COVID-19/epidemiology , COVID-19/transmission , Pandemics , Recombination, Genetic , SARS-CoV-2/genetics , Base Sequence/genetics , COVID-19/virology , Computational Biology/methods , Gene Frequency , Genome, Viral , Genotype , Humans , Mutation , Phylogeny , Polymorphism, Single Nucleotide , United Kingdom/epidemiology , Whole Genome Sequencing/methodsABSTRACT
An Addendum to this paper has been published: https://doi.org/10.1038/s41564-021-00872-5.