ABSTRACT
With the rapid spread and evolution of SARS-CoV-2, the ability to monitor its transmission and distinguish among viral lineages is critical for pandemic response efforts. The most commonly used software for the lineage assignment of newly isolated SARS-CoV-2 genomes is pangolin, which offers two methods of assignment, pangoLEARN and pUShER. PangoLEARN rapidly assigns lineages using a machine-learning algorithm, while pUShER performs a phylogenetic placement to identify the lineage corresponding to a newly sequenced genome. In a preliminary study, we observed that pangoLEARN (decision tree model), while substantially faster than pUShER, offered less consistency across different versions of pangolin v3. Here, we expand upon this analysis to include v3 and v4 of pangolin, which moved the default algorithm for lineage assignment from pangoLEARN in v3 to pUShER in v4, and perform a thorough analysis confirming that pUShER is not only more stable across versions but also more accurate. Our findings suggest that future lineage assignment algorithms for various pathogens should consider the value of phylogenetic placement.
Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , New York City/epidemiology , COVID-19/epidemiologyABSTRACT
BACKGROUND: Comparing disease severity between SARS-CoV-2 variants among populations with varied vaccination and infection histories can help characterize emerging variants and support healthcare system preparedness. METHODS: We compared COVID-19 hospitalization risk among New York City residents with positive laboratory-based SARS-CoV-2 tests when ≥98% of sequencing results were Delta (August-November 2021) or Omicron (BA.1 and sublineages, January 2022). A secondary analysis defined variant exposure using patient-level sequencing results during July 2021-January 2022, comprising 1-18% of weekly confirmed cases. RESULTS: Hospitalization risk was lower among patients testing positive when Omicron (16,025/488,053, 3.3%) than when Delta predominated (8268/158,799, 5.2%). In multivariable analysis adjusting for demographic characteristics and prior diagnosis and vaccination status, patients testing positive when Omicron predominated, compared with Delta, had 0.72 (95% CI: 0.63, 0.82) times the hospitalization risk. In a secondary analysis of patients with sequencing results, hospitalization risk was similar among patients infected with Omicron (2042/29,866, 6.8%), compared with Delta (1780/25,272, 7.0%), and higher among the subset who received two mRNA vaccine doses (adjusted relative risk 1.64; 95% CI: 1.44, 1.87). CONCLUSIONS: Hospitalization risk was lower among patients testing positive when Omicron predominated, compared with Delta. This finding persisted after adjusting for prior diagnosis and vaccination status, suggesting intrinsic virologic properties, not population-based immunity, explained the lower severity. Secondary analyses demonstrated collider bias from the sequencing sampling frame changing over time in ways associated with disease severity. Representative data collection is necessary to avoid bias when comparing disease severity between previously dominant and newly emerging variants.
Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , COVID-19/diagnosis , COVID-19/epidemiology , New York City/epidemiology , HospitalizationABSTRACT
Recombination is an evolutionary process by which many pathogens generate diversity and acquire novel functions. Although a common occurrence during coronavirus replication, detection of recombination is only feasible when genetically distinct viruses contemporaneously infect the same host. Here, we identify an instance of SARS-CoV-2 superinfection, whereby an individual was infected with two distinct viral variants: Alpha (B.1.1.7) and Epsilon (B.1.429). This superinfection was first noted when an Alpha genome sequence failed to exhibit the classic S gene target failure behavior used to track this variant. Full genome sequencing from four independent extracts reveals that Alpha variant alleles comprise around 75% of the genomes, whereas the Epsilon variant alleles comprise around 20% of the sample. Further investigation reveals the presence of numerous recombinant haplotypes spanning the genome, specifically in the spike, nucleocapsid, and ORF 8 coding regions. These findings support the potential for recombination to reshape SARS-CoV-2 genetic diversity.