Pesquisa | Portal Regional da BVS (teste)

Patterns of transcription factor binding and epigenome at promoters allow interpretable predictability of multiple functions of non-coding and coding genes.

Chandra, Omkar; Sharma, Madhu; Pandey, Neetesh; Jha, Indra Prakash; Mishra, Shreya; Kong, Say Li; Kumar, Vibhor.

Comput Struct Biotechnol J ; 21: 3590-3603, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37520281

RESUMO

Understanding the biological roles of all genes only through experimental methods is challenging. A computational approach with reliable interpretability is needed to infer the function of genes, particularly for non-coding RNAs. We have analyzed genomic features that are present across both coding and non-coding genes like transcription factor (TF) and cofactor ChIP-seq (823), histone modifications ChIP-seq (n = 621), cap analysis gene expression (CAGE) tags (n = 255), and DNase hypersensitivity profiles (n = 255) to predict ontology-based functions of genes. Our approach for gene function prediction was reliable (>90% balanced accuracy) for 486 gene-sets. PubMed abstract mining and CRISPR screens supported the inferred association of genes with biological functions, for which our method had high accuracy. Further analysis revealed that TF-binding patterns at promoters have high predictive strength for multiple functions. TF-binding patterns at the promoter add an unexplored dimension of explainable regulatory aspects of genes and their functions. Therefore, we performed a comprehensive analysis for the functional-specificity of TF-binding patterns at promoters and used them for clustering functions to reveal many latent groups of gene-sets involved in common major cellular processes. We also showed how our approach could be used to infer the functions of non-coding genes using the CRISPR screens of coding genes, which were validated using a long non-coding RNA CRISPR screen. Thus our results demonstrated the generality of our approach by using gene-sets from CRISPR screens. Overall, our approach opens an avenue for predicting the involvement of non-coding genes in various functions.

Matching queried single-cell open-chromatin profiles to large pools of single-cell transcriptomes and epigenomes for reference supported analysis.

Mishra, Shreya; Pandey, Neetesh; Chawla, Smriti; Sharma, Madhu; Chandra, Omkar; Jha, Indra Prakash; SenGupta, Debarka; Natarajan, Kedar Nath; Kumar, Vibhor.

Genome Res ; 33(2): 218-231, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36653120

RESUMO

The true benefits of large single-cell transcriptome and epigenome data sets can be realized only with the development of new approaches and search tools for annotating individual cells. Matching a single-cell epigenome profile to a large pool of reference cells remains a major challenge. Here, we present scEpiSearch, which enables searching, comparison, and independent classification of single-cell open-chromatin profiles against a large reference of single-cell expression and open-chromatin data sets. Across performance benchmarks, scEpiSearch outperformed multiple methods in accuracy of search and low-dimensional coembedding of single-cell profiles, irrespective of platforms and species. Here we also demonstrate the unconventional utilities of scEpiSearch by applying it on single-cell epigenome profiles of K562 cells and samples from patients with acute leukaemia to reveal different aspects of their heterogeneity, multipotent behavior, and dedifferentiated states. Applying scEpiSearch on our single-cell open-chromatin profiles from embryonic stem cells (ESCs), we identified ESC subpopulations with more activity and poising for endoplasmic reticulum stress and unfolded protein response. Thus, scEpiSearch solves the nontrivial problem of amalgamating information from a large pool of single cells to identify and study the regulatory states of cells using their single-cell epigenomes.

Assuntos

Cromatina , Transcriptoma , Humanos , Cromatina/metabolismo , Epigenoma , Células-Tronco Embrionárias/metabolismo , Análise de Célula Única

Associating pathways with diseases using single-cell expression profiles and making inferences about potential drugs.

Sharma, Madhu; Jha, Indra Prakash; Chawla, Smriti; Pandey, Neetesh; Chandra, Omkar; Mishra, Shreya; Kumar, Vibhor.

Brief Bioinform ; 23(4)2022 07 18.

Artigo em Inglês | MEDLINE | ID: mdl-35772850

RESUMO

Finding direct dependencies between genetic pathways and diseases has been the target of multiple studies as it has many applications. However, due to cellular heterogeneity and limitations of the number of samples for bulk expression profiles, such studies have faced hurdles in the past. Here, we propose a method to perform single-cell expression-based inference of association between pathway, disease and cell-type (sci-PDC), which can help to understand their cause and effect and guide precision therapy. Our approach highlighted reliable relationships between a few diseases and pathways. Using the example of diabetes, we have demonstrated how sci-PDC helps in tracking variation of association between pathways and diseases with changes in age and species. The variation in pathways-disease associations in mice and humans revealed critical facts about the suitability of the mouse model for a few pathways in the context of diabetes. The coherence between results from our method and previous reports, including information about the drug target pathways, highlights its reliability for multidimensional utility.

Assuntos

Doença , Perfil Genético , Animais , Doença/genética , Humanos , Camundongos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA