Your browser doesn't support javascript.
Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples.
Bassano, Irene; Ramachandran, Vinoy K; Khalifa, Mohammad S; Lilley, Chris J; Brown, Mathew R; van Aerle, Ronny; Denise, Hubert; Rowe, William; George, Airey; Cairns, Edward; Wierzbicki, Claudia; Pickwell, Natalie D; Carlile, Matthew; Holmes, Nadine; Payne, Alexander; Loose, Matthew; Burke, Terry A; Paterson, Steve; Wade, Matthew J; Grimsley, Jasmine M S.
  • Bassano I; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • Ramachandran VK; Department of Infectious Disease, Imperial College London, London SW7 2AZ, UK.
  • Khalifa MS; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • Lilley CJ; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • Brown MR; Division of Biosciences, College of Health, Medicine and Life Sciences, Brunel University, London UB8 3PH, UK.
  • van Aerle R; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • Denise H; School of Engineering, Newcastle University, Newcastle-upon-Tyne NE1 7RU, UK.
  • Rowe W; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • George A; International Centre of Excellence for Aquatic Animal Health, Centre for Environment, Fisheries and Aquaculture Science (Cefas), Clyst Honiton EX5 2FN, UK.
  • Cairns E; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • Wierzbicki C; Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
  • Pickwell ND; Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.
  • Carlile M; Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.
  • Holmes N; Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.
  • Payne A; DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.
  • Loose M; DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.
  • Burke TA; DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.
  • Paterson S; DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.
  • Wade MJ; DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.
  • Grimsley JMS; NERC Environmental Omics Facility, Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK.
Microb Genom ; 9(4)2023 04.
Article in English | MEDLINE | ID: covidwho-2291995
ABSTRACT
Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinical sequencing, supporting the insights gained and helping to make informed public-health decisions. Consequently, many groups globally have developed bioinformatics pipelines to analyse sequencing data from wastewater. Accurate calling of mutations is critical in this process and in the assignment of circulating variants; yet, to date, the performance of variant-calling algorithms in wastewater samples has not been investigated. To address this, we compared the performance of six variant callers (VarScan, iVar, GATK, FreeBayes, LoFreq and BCFtools), used widely in bioinformatics pipelines, on 19 synthetic samples with known ratios of three different SARS-CoV-2 variants of concern (VOCs) (Alpha, Beta and Delta), as well as 13 wastewater samples collected in London between the 15th and 18th December 2021. We used the fundamental parameters of recall (sensitivity) and precision (specificity) to confirm the presence of mutational profiles defining specific variants across the six variant callers. Our results show that BCFtools, FreeBayes and VarScan found the expected variants with higher precision and recall than GATK or iVar, although the latter identified more expected defining mutations than other callers. LoFreq gave the least reliable results due to the high number of false-positive mutations detected, resulting in lower precision. Similar results were obtained for both the synthetic and wastewater samples.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: SARS-CoV-2 / COVID-19 Type of study: Diagnostic study / Experimental Studies / Observational study / Prognostic study / Randomized controlled trials Topics: Variants Limits: Humans Language: English Year: 2023 Document Type: Article Affiliation country: Mgen.0.000933

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: SARS-CoV-2 / COVID-19 Type of study: Diagnostic study / Experimental Studies / Observational study / Prognostic study / Randomized controlled trials Topics: Variants Limits: Humans Language: English Year: 2023 Document Type: Article Affiliation country: Mgen.0.000933