Your browser doesn't support javascript.
PathoLive-Real-Time Pathogen Identification from Metagenomic Illumina Datasets.
Tausch, Simon H; Loka, Tobias P; Schulze, Jakob M; Andrusch, Andreas; Klenner, Jeanette; Dabrowski, Piotr Wojciech; Lindner, Martin S; Nitsche, Andreas; Renard, Bernhard Y.
  • Tausch SH; National Study Centre for Sequencing in Risk Assessment, Department Biological Safety, German Federal Institute for Risk Assessment, 10589 Berlin, Germany.
  • Loka TP; Bioinformatics Division (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany.
  • Schulze JM; Centre for Biological Threats and Special Pathogens, Highly Pathogenic Viruses (ZBS 1), 13353 Berlin, Germany.
  • Andrusch A; Bioinformatics Division (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany.
  • Klenner J; Digital Engineering Faculty, Hasso Plattner Institute, University of Potsdam, 14482 Potsdam, Germany.
  • Dabrowski PW; Bioinformatics Division (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany.
  • Lindner MS; Bioinformatics Division (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany.
  • Nitsche A; Centre for Biological Threats and Special Pathogens, Highly Pathogenic Viruses (ZBS 1), 13353 Berlin, Germany.
  • Renard BY; Centre for Biological Threats and Special Pathogens, Highly Pathogenic Viruses (ZBS 1), 13353 Berlin, Germany.
Life (Basel) ; 12(9)2022 Aug 30.
Article in English | MEDLINE | ID: covidwho-2006125
ABSTRACT
Over the past years, NGS has become a crucial workhorse for open-view pathogen diagnostics. Yet, long turnaround times result from using massively parallel high-throughput technologies as the analysis can only be performed after sequencing has finished. The interpretation of results can further be challenged by contaminations, clinically irrelevant sequences, and the sheer amount and complexity of the data. We implemented PathoLive, a real-time diagnostics pipeline for the detection of pathogens from clinical samples hours before sequencing has finished. Based on real-time alignment with HiLive2, mappings are scored with respect to common contaminations, low-entropy areas, and sequences of widespread, non-pathogenic organisms. The results are visualized using an interactive taxonomic tree that provides an easily interpretable overview of the relevance of hits. For a human plasma sample that was spiked in vitro with six pathogenic viruses, all agents were clearly detected after only 40 of 200 sequencing cycles. For a real-world sample from Sudan, the results correctly indicated the presence of Crimean-Congo hemorrhagic fever virus. In a second real-world dataset from the 2019 SARS-CoV-2 outbreak in Wuhan, we found the presence of a SARS coronavirus as the most relevant hit without the novel virus reference genome being included in the database. For all samples, clinically irrelevant hits were correctly de-emphasized. Our approach is valuable to obtain fast and accurate NGS-based pathogen identifications and correctly prioritize and visualize them based on their clinical

significance:

PathoLive is open source and available on GitLab and BioConda.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Type of study: Prognostic study Language: English Year: 2022 Document Type: Article Affiliation country: Life12091345

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Type of study: Prognostic study Language: English Year: 2022 Document Type: Article Affiliation country: Life12091345