Your browser doesn't support javascript.
loading
PAPerFly: Partial Assembly-based Peak Finder for ab initio binding site reconstruction.
Faltejsková, Katerina; Vondrásek, Jirí.
Afiliación
  • Faltejsková K; Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo námestí 542/2, 160 00, Prague, Czech Republic. katerina.faltejskova@uochb.cas.cz.
  • Vondrásek J; Computer Science Institute, Faculty of Mathematics and Physics, Charles University, Malostranské námestí 25, 118 00, Prague, Czech Republic. katerina.faltejskova@uochb.cas.cz.
BMC Bioinformatics ; 24(1): 487, 2023 Dec 19.
Article en En | MEDLINE | ID: mdl-38114921
ABSTRACT

BACKGROUND:

The specific recognition of a DNA locus by a given transcription factor is a widely studied issue. It is generally agreed that the recognition can be influenced not only by the binding motif but by the larger context of the binding site. In this work, we present a novel heuristic algorithm that can reconstruct the unique binding sites captured in a sequencing experiment without using the reference genome.

RESULTS:

We present PAPerFly, the Partial Assembly-based Peak Finder, a tool for the binding site and binding context reconstruction from the sequencing data without any prior knowledge. This tool operates without the need to know the reference genome of the respective organism. We employ algorithmic approaches that are used during genome assembly. The proposed algorithm constructs a de Bruijn graph from the sequencing data. Based on this graph, sequences and their enrichment are reconstructed using a novel heuristic algorithm. The reconstructed sequences are aligned and the peaks in the sequence enrichment are identified. Our approach was tested by processing several ChIP-seq experiments available in the ENCODE database and comparing the results of Paperfly and standard methods.

CONCLUSIONS:

We show that PAPerFly, an algorithm tailored for experiment analysis without the reference genome, yields better results than an aggregation of ChIP-seq agnostic tools. Our tool is freely available at https//github.com/Caeph/paperfly/ or on Zenodo ( https//doi.org/10.5281/zenodo.7116424 ).
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Factores de Transcripción / Algoritmos Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: República Checa

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Factores de Transcripción / Algoritmos Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: República Checa