Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Front Microbiol ; 14: 1267695, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37869681

RESUMO

Identification of plasmids from sequencing data is an important and challenging problem related to antimicrobial resistance spread and other One-Health issues. We provide a new architecture for identifying plasmid contigs in fragmented genome assemblies built from short-read data. We employ graph neural networks (GNNs) and the assembly graph to propagate the information from nearby nodes, which leads to more accurate classification, especially for short contigs that are difficult to classify based on sequence features or database searches alone. We trained plASgraph2 on a data set of samples from the ESKAPEE group of pathogens. plASgraph2 either outperforms or performs on par with a wide range of state-of-the-art methods on testing sets of independent ESKAPEE samples and samples from related pathogens. On one hand, our study provides a new accurate and easy to use tool for contig classification in bacterial isolates; on the other hand, it serves as a proof-of-concept for the use of GNNs in genomics. Our software is available at https://github.com/cchauve/plasgraph2 and the training and testing data sets are available at https://github.com/fmfi-compbio/plasgraph2-datasets.

2.
Nat Commun ; 12(1): 6549, 2021 11 12.
Artigo em Inglês | MEDLINE | ID: mdl-34772949

RESUMO

Understanding gene expression will require understanding where regulatory factors bind genomic DNA. The frequently used sequence-based motifs of protein-DNA binding are not predictive, since a genome contains many more binding sites than are actually bound and transcription factors of the same family share similar DNA-binding motifs. Traditionally, these motifs only depict sequence but neglect DNA shape. Since shape may contribute non-linearly and combinational to binding, machine learning approaches ought to be able to better predict transcription factor binding. Here we show that a random forest machine learning approach, which incorporates the 3D-shape of DNA, enhances binding prediction for all 216 tested Arabidopsis thaliana transcription factors and improves the resolution of differential binding by transcription factor family members which share the same binding motif. We observed that DNA shape features were individually weighted for each transcription factor, even if they shared the same binding sequence.


Assuntos
Arabidopsis/metabolismo , DNA/metabolismo , Fatores de Transcrição/metabolismo , Arabidopsis/genética , Sítios de Ligação , Biologia Computacional , DNA/genética , Ligação Proteica , Fatores de Transcrição/genética
3.
Genes (Basel) ; 11(12)2020 11 28.
Artigo em Inglês | MEDLINE | ID: mdl-33260757

RESUMO

Cardiovascular diseases are the number one cause of morbidity and mortality worldwide, but the underlying molecular mechanisms remain not well understood. Cardiomyopathies are primary diseases of the heart muscle and contribute to high rates of heart failure and sudden cardiac deaths. Here, we distinguished four different genetic cardiomyopathies based on gene expression signatures. In this study, RNA-Sequencing was used to identify gene expression signatures in myocardial tissue of cardiomyopathy patients in comparison to non-failing human hearts. Therefore, expression differences between patients with specific affected genes, namely LMNA (lamin A/C), RBM20 (RNA binding motif protein 20), TTN (titin) and PKP2 (plakophilin 2) were investigated. We identified genotype-specific differences in regulated pathways, Gene Ontology (GO) terms as well as gene groups like secreted or regulatory proteins and potential candidate drug targets revealing specific molecular pathomechanisms for the four subtypes of genetic cardiomyopathies. Some regulated pathways are common between patients with mutations in RBM20 and TTN as the splice factor RBM20 targets amongst other genes TTN, leading to a similar response on pathway level, even though many differentially expressed genes (DEGs) still differ between both sample types. The myocardium of patients with mutations in LMNA is widely associated with upregulated genes/pathways involved in immune response, whereas mutations in PKP2 lead to a downregulation of genes of the extracellular matrix. Our results contribute to further understanding of the underlying molecular pathomechanisms aiming for novel and better treatment of genetic cardiomyopathies.


Assuntos
Cardiomiopatias , Predisposição Genética para Doença , Proteínas Musculares , Mutação , Miocárdio/metabolismo , Transcriptoma , Adulto , Idoso , Cardiomiopatias/genética , Cardiomiopatias/metabolismo , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Proteínas Musculares/biossíntese , Proteínas Musculares/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...