Your browser doesn't support javascript.
loading
Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies
Billy T Lau; Dmitri Pavlichin; Anna C Hooker; Alison F Almeda; GiWon Shin; Jiamin Chen; Malaya K Sahoo; ChunHong Huang; Benjamin A Pinsky; HoJoon Lee; Hanlee Ji.
Affiliation
  • Billy T Lau; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Dmitri Pavlichin; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Anna C Hooker; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Alison F Almeda; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • GiWon Shin; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Jiamin Chen; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Malaya K Sahoo; Department of Pathology, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • ChunHong Huang; Department of Pathology, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Benjamin A Pinsky; Department of Pathology, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • HoJoon Lee; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
  • Hanlee Ji; Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, United States
Preprint in English | medRxiv | ID: ppmedrxiv-20224816
Journal article
A scientific journal published article is available and is probably based on this preprint. It has been identified through a machine matching algorithm, human confirmation is still pending.
See journal article
ABSTRACT
BackgroundThe genome of SARS-CoV-2 is susceptible to mutations during viral replication due to the errors generated by RNA-dependent RNA polymerases. These mutations enable the SARS-CoV-2 to evolve into new strains. Viral quasispecies emerge from de novo mutations that occur in individual patients. In combination, these sets of viral mutations provide distinct genetic fingerprints that reveal the patterns of transmission and have utility in contract tracing. MethodsLeveraging thousands of sequenced SARS-CoV-2 genomes, we performed a viral pangenome analysis to identify conserved genomic sequences. We used a rapid and highly efficient computational approach that relies on k-mers, short tracts of sequence, instead of conventional sequence alignment. Using this method, we annotated viral mutation signatures that were associated with specific strains. Based on these highly conserved viral sequences, we developed a rapid and highly scalable targeted sequencing assay to identify mutations, detect quasispecies and identify mutation signatures from patients. These results were compared to the pangenome genetic fingerprints. ResultsWe built a k-mer index for thousands of SARS-CoV-2 genomes and identified conserved genomics regions and landscape of mutations across thousands of virus genomes. We delineated mutation profiles spanning common genetic fingerprints (the combination of mutations in a viral assembly) and rare ones that occur in only small fraction of patients. We developed a targeted sequencing assay by selecting primers from the conserved viral genome regions to flank frequent mutations. Using a cohort of SARS-CoV-2 clinical samples, we identified genetic fingerprints consisting of strain-specific mutations seen across populations and de novo quasispecies mutations localized to individual infections. We compared the mutation profiles of viral samples undergoing analysis with the features of the pangenome. ConclusionsWe conducted an analysis for viral mutation profiles that provide the basis of genetic fingerprints. Our study linked pangenome analysis with targeted deep sequenced SARS-CoV-2 clinical samples. We identified quasispecies mutations occurring within individual patients, mutations demarcating dominant species and the prevalence of mutation signatures, of which a significant number were relatively unique. Analysis of these genetic fingerprints may provide a way of conducting molecular contact tracing.
License
cc_by_nc_nd
Full text: Available Collection: Preprints Database: medRxiv Type of study: Cohort_studies / Observational study / Prognostic study Language: English Year: 2020 Document type: Preprint
Full text: Available Collection: Preprints Database: medRxiv Type of study: Cohort_studies / Observational study / Prognostic study Language: English Year: 2020 Document type: Preprint
...