Pesquisa | Portal Regional da BVS

1.

Inference of Phylogenetic Networks from Sequence Data using Composite Likelihood.

Kong, Sungsik; Swofford, David L; Kubatko, Laura S.

Syst Biol ; 2024 Oct 10.

Artigo em Inglês | MEDLINE | ID: mdl-39387633

RESUMO

While phylogenies have been essential in understanding how species evolve, they do not adequately describe some evolutionary processes. For instance, hybridization, a common phenomenon where interbreeding between two species leads to formation of a new species, must be depicted by a phylogenetic network, a structure that modifies a phylogenetic tree by allowing two branches to merge into one, resulting in reticulation. However, existing methods for estimating networks become computationally expensive as the dataset size and/or topological complexity increase. The lack of methods for scalable inference hampers phylogenetic networks from being widely used in practice, despite accumulating evidence that hybridization occurs frequently in nature. Here, we propose a novel method, PhyNEST (Phylogenetic Network Estimation using SiTe patterns), that estimates binary, level-1 phylogenetic networks with a fixed, user-specified number of reticulations directly from sequence data. By using the composite likelihood as the basis for inference, PhyNEST is able to use the full genomic data in a computationally tractable manner, eliminating the need to summarize the data as a set of gene trees prior to network estimation. To search network space, PhyNEST implements both hill climbing and simulated annealing algorithms. PhyNEST assumes that the data are composed of coalescent independent sites that evolve according to the Jukes-Cantor substitution model and that the network has a constant effective population size. Simulation studies demonstrate that PhyNEST is often more accurate than two existing composite likelihood summary methods (SNaQ and PhyloNet) and that it is robust to at least one form of model misspecification (assuming a less complex nucleotide substitution model than the true generating model). We applied PhyNEST to reconstruct the evolutionary relationships among Heliconius butterflies and Papionini primates, characterized by hybrid speciation and widespread introgression, respectively. PhyNEST is implemented in an open-source Julia package and is publicly available at https://github.com/sungsik-kong/PhyNEST.jl.

2.

GRNMOPT: Inference of gene regulatory networks based on a multi-objective optimization approach.

Dong, Heng; Ma, Baoshan; Meng, Yangyang; Wu, Yiming; Liu, Yongjing; Zeng, Tao; Huang, Jinyan.

Comput Biol Chem ; 113: 108223, 2024 Sep 23.

Artigo em Inglês | MEDLINE | ID: mdl-39340962

RESUMO

BACKGROUND AND OBJECTIVE: The reconstruction of gene regulatory networks (GRNs) stands as a vital approach in deciphering complex biological processes. The application of nonlinear ordinary differential equations (ODEs) models has demonstrated considerable efficacy in predicting GRNs. Notably, the decay rate and time delay are pivotal in authentic gene regulation, yet their systematic determination in ODEs models remains underexplored. The development of a comprehensive optimization framework for the effective estimation of these key parameters is essential for accurate GRN inference. METHOD: This study introduces GRNMOPT, an innovative methodology for inferring GRNs from time-series and steady-state data. GRNMOPT employs a combined use of decay rate and time delay in constructing ODEs models to authentically represent gene regulatory processes. It incorporates a multi-objective optimization approach, optimizing decay rate and time delay concurrently to derive Pareto optimal sets for these factors, thereby maximizing accuracy metrics such as AUROC (Area Under the Receiver Operating Characteristic curve) and AUPR (Area Under the Precision-Recall curve). Additionally, the use of XGBoost for calculating feature importance aids in identifying potential regulatory gene links. RESULTS: Comprehensive experimental evaluations on two simulated datasets from DREAM4 and three real gene expression datasets (Yeast, In vivo Reverse-engineering and Modeling Assessment [IRMA], and Escherichia coli [E. coli]) reveal that GRNMOPT performs commendably across varying network scales. Furthermore, cross-validation experiments substantiate the robustness of GRNMOPT. CONCLUSION: We propose a novel approach called GRNMOPT to infer GRNs based on a multi-objective optimization framework, which effectively improves inference accuracy and provides a powerful tool for GRNs inference.

3.

Inference of single-cell network using mutual information for scRNA-seq data analysis.

Chang, Lan-Yun; Hao, Ting-Yi; Wang, Wei-Jie; Lin, Chun-Yu.

BMC Bioinformatics ; 25(Suppl 2): 292, 2024 Sep 05.

Artigo em Inglês | MEDLINE | ID: mdl-39237886

RESUMO

BACKGROUND: With the advance in single-cell RNA sequencing (scRNA-seq) technology, deriving inherent biological system information from expression profiles at a single-cell resolution has become possible. It has been known that network modeling by estimating the associations between genes could better reveal dynamic changes in biological systems. However, accurately constructing a single-cell network (SCN) to capture the network architecture of each cell and further explore cell-to-cell heterogeneity remains challenging. RESULTS: We introduce SINUM, a method for constructing the SIngle-cell Network Using Mutual information, which estimates mutual information between any two genes from scRNA-seq data to determine whether they are dependent or independent in a specific cell. Experiments on various scRNA-seq datasets with different cell numbers based on eight performance indexes (e.g., adjusted rand index and F-measure index) validated the accuracy and robustness of SINUM in cell type identification, superior to the state-of-the-art SCN inference method. Additionally, the SINUM SCNs exhibit high overlap with the human interactome and possess the scale-free property. CONCLUSIONS: SINUM presents a view of biological systems at the network level to detect cell-type marker genes/gene pairs and investigate time-dependent changes in gene associations during embryo development. Codes for SINUM are freely available at https://github.com/SysMednet/SINUM .

Assuntos

Análise de Célula Única , Análise de Célula Única/métodos , Humanos , Análise de Sequência de RNA/métodos , Redes Reguladoras de Genes , RNA-Seq/métodos , Algoritmos , Perfilação da Expressão Gênica/métodos , Análise da Expressão Gênica de Célula Única

4.

Transcriptomic Module Discovery of Diarrhea-Predominant Irritable Bowel Syndrome: A Causal Network Inference Approach.

Guido, Davide; Maqoud, Fatima; Aloisio, Michelangelo; Mallardi, Domenica; Ura, Blendi; Gualandi, Nicolò; Cocca, Massimiliano; Russo, Francesco.

Int J Mol Sci ; 25(17)2024 Aug 28.

Artigo em Inglês | MEDLINE | ID: mdl-39273274

RESUMO

Irritable bowel syndrome with diarrhea (IBS-D) is the most prevalent subtype of IBS, characterized by chronic gastrointestinal symptoms in the absence of identifiable pathological findings. This study aims to investigate the molecular mechanisms underlying IBS-D using transcriptomic data. By employing causal network inference methods, we identify key transcriptomic modules associated with IBS-D. Utilizing data from public databases and applying advanced computational techniques, we uncover potential biomarkers and therapeutic targets. Our analysis reveals significant molecular alterations that affect cellular functions, offering new insights into the complex pathophysiology of IBS-D. These findings enhance our understanding of the disease and may foster the development of more effective treatments.

Assuntos

Diarreia , Redes Reguladoras de Genes , Síndrome do Intestino Irritável , Transcriptoma , Síndrome do Intestino Irritável/genética , Síndrome do Intestino Irritável/metabolismo , Humanos , Diarreia/genética , Perfilação da Expressão Gênica/métodos , Biologia Computacional/métodos , Biomarcadores

5.

Transcriptome data are insufficient to control false discoveries in regulatory network inference.

Kernfeld, Eric; Keener, Rebecca; Cahan, Patrick; Battle, Alexis.

Cell Syst ; 15(8): 709-724.e13, 2024 Aug 21.

Artigo em Inglês | MEDLINE | ID: mdl-39173585

RESUMO

Inference of causal transcriptional regulatory networks (TRNs) from transcriptomic data suffers notoriously from false positives. Approaches to control the false discovery rate (FDR), for example, via permutation, bootstrapping, or multivariate Gaussian distributions, suffer from several complications: difficulty in distinguishing direct from indirect regulation, nonlinear effects, and causal structure inference requiring "causal sufficiency," meaning experiments that are free of any unmeasured, confounding variables. Here, we use a recently developed statistical framework, model-X knockoffs, to control the FDR while accounting for indirect effects, nonlinear dose-response, and user-provided covariates. We adjust the procedure to estimate the FDR correctly even when measured against incomplete gold standards. However, benchmarking against chromatin immunoprecipitation (ChIP) and other gold standards reveals higher observed than reported FDR. This indicates that unmeasured confounding is a major driver of FDR in TRN inference. A record of this paper's transparent peer review process is included in the supplemental information.

Assuntos

Redes Reguladoras de Genes , Transcriptoma , Redes Reguladoras de Genes/genética , Transcriptoma/genética , Humanos , Imunoprecipitação da Cromatina/métodos , Perfilação da Expressão Gênica/métodos

6.

TopoDoE: a design of experiment strategy for selection and refinement in ensembles of executable gene regulatory networks.

Bouvier, Matteo; Zreika, Souad; Vallin, Elodie; Fourneaux, Camille; Gonin-Giraud, Sandrine; Bonnaffoux, Arnaud; Gandrillon, Olivier.

BMC Bioinformatics ; 25(1): 245, 2024 Jul 19.

Artigo em Inglês | MEDLINE | ID: mdl-39030497

RESUMO

BACKGROUND: Inference of Gene Regulatory Networks (GRNs) is a difficult and long-standing question in Systems Biology. Numerous approaches have been proposed with the latest methods exploring the richness of single-cell data. One of the current difficulties lies in the fact that many methods of GRN inference do not result in one proposed GRN but in a collection of plausible networks that need to be further refined. In this work, we present a Design of Experiment strategy to use as a second stage after the inference process. It is specifically fitted for identifying the next most informative experiment to perform for deciding between multiple network topologies, in the case where proposed GRNs are executable models. This strategy first performs a topological analysis to reduce the number of perturbations that need to be tested, then predicts the outcome of the retained perturbations by simulation of the GRNs and finally compares predictions with novel experimental data. RESULTS: We apply this method to the results of our divide-and-conquer algorithm called WASABI, adapt its gene expression model to produce perturbations and compare our predictions with experimental results. We show that our networks were able to produce in silico predictions on the outcome of a gene knock-out, which were qualitatively validated for 48 out of 49 genes. Finally, we eliminate as many as two thirds of the candidate networks for which we could identify an incorrect topology, thus greatly improving the accuracy of our predictions. CONCLUSION: These results both confirm the inference accuracy of WASABI and show how executable gene expression models can be leveraged to further refine the topology of inferred GRNs. We hope this strategy will help systems biologists further explore their data and encourage the development of more executable GRN models.

Assuntos

Algoritmos , Redes Reguladoras de Genes , Redes Reguladoras de Genes/genética , Biologia de Sistemas/métodos , Biologia Computacional/métodos , Simulação por Computador , Modelos Genéticos

7.

Ensemble learning algorithms to elucidate the core microbiome's impact on carbon content and degradation properties at the soil aggregate level.

Zhou, Fengwu; Jiang, Yunbin; Han, Cheng; Deng, Huan; Dai, Zongren; Wang, Zimeng; Zhong, Wenhui.

Sci Total Environ ; 946: 174528, 2024 Oct 10.

Artigo em Inglês | MEDLINE | ID: mdl-38971243

RESUMO

Soil aggregates are crucial for soil organic carbon (OC) accumulation. This study, utilizing a 32-year fertilization experiment, investigates whether the core microbiome can elucidate variations in carbon content and decomposition across different aggregate sizes more effectively than broader bacterial and fungal community analyses. Employing ensemble learning algorithms that integrate machine learning with network inference, we found that the core microbiome accounts for an average increase of 26 % and 20 % in the explained variance of PCoA and Adonis analyses, respectively, in response to fertilization. Compared to the control, inorganic and organic fertilizers decreased the decomposition index (DDI) by 31 % and 38 %, respectively. The fungal core microbiome predominantly influenced OC content and DDI in larger macroaggregates (>2000 µm), explaining over 35 % of the variance, while the bacterial core microbiome had a lesser impact, explaining <30 %. Conversely, in smaller aggregates (<2000 µm), the bacterial core microbiome significantly influenced DDI (R2 > 0.2), and the fungal core microbiome more strongly affected OC content (R2 > 0.3). Mantel tests showed that pH is the most significant environmental factor affecting core microbiome composition across all aggregate sizes (Mantel's r > 0.8, P < 0.01). Linear correlation analysis further confirmed that the core microbiome's community structure could accurately predict OC content and DDI in aggregates (R2 > 0.8, P < 0.05). Overall, our findings suggested that the core microbiome provides deeper insights into the variability of aggregate organic carbon content and decomposition, with the bacterial core microbiome playing a particularly pivotal role within the soil aggregates.

Assuntos

Carbono , Aprendizado de Máquina , Microbiota , Microbiologia do Solo , Solo , Carbono/metabolismo , Carbono/análise , Solo/química , Algoritmos , Fungos/metabolismo , Bactérias/metabolismo , Fertilizantes

8.

Network analysis of gut microbial communities reveal key genera for a multiple sclerosis cohort with Mycobacterium avium subspecies paratuberculosis infection.

Ashraf, Hajra; Dikarlo, Plamena; Masia, Aurora; Zarbo, Ignazio R; Solla, Paolo; Ijaz, Umer Zeeshan; Sechi, Leonardo A.

Gut Pathog ; 16(1): 37, 2024 Jul 10.

Artigo em Inglês | MEDLINE | ID: mdl-38987816

RESUMO

BACKGROUND: In gut ecosystems, there is a complex interplay of biotic and abiotic interactions that decide the overall fitness of an individual. Divulging the microbe-microbe and microbe-host interactions may lead to better strategies in disease management, as microbes rarely act in isolation. Network inference for microbial communities is often a challenging task limited by both analytical assumptions as well as experimental approaches. Even after the network topologies are obtained, identification of important nodes within the context of underlying disease aetiology remains a convoluted task. We therefore present a network perspective on complex interactions in gut microbial profiles of individuals who have multiple sclerosis with and without Mycobacterium avium subspecies paratuberculosis (MAP) infection. Our exposé is guided by recent advancements in network-wide statistical measures that identify the keystone nodes. We have utilised several centrality measures, including a recently published metric, Integrated View of Influence (IVI), that is robust against biases. RESULTS: The ecological networks were generated on microbial abundance data (n = 69 samples) utilising 16 S rRNA amplification. Using SPIEC-EASI, a sparse inverse covariance estimation approach, we have obtained networks separately for MAP positive (+), MAP negative (-) and healthy controls (as a baseline). Using IVI metric, we identified top 20 keystone nodes and regressed them against covariates of interest using a generalised linear latent variable model. Our analyses suggest Eisenbergiella to be of pivotal importance in MS irrespective of MAP infection. For MAP + cohort, Pyarmidobacter, and Peptoclostridium were predominately the most influential genera, also hinting at an infection model similar to those observed in Inflammatory Bowel Diseases (IBDs). In MAP- cohort, on the other hand, Coprostanoligenes group was the most influential genera that reduces cholesterol and supports the intestinal barrier. CONCLUSIONS: The identification of keystone nodes, their co-occurrences, and associations with the exposome (meta data) advances our understanding of biological interactions through which MAP infection shapes the microbiome in MS individuals, suggesting the link to the inflammatory process of IBDs. The associations presented in this study may lead to development of improved diagnostics and effective vaccines for the management of the disease.

9.

Analysis of Microbiome Data.

Peterson, Christine B; Saha, Satabdi; Do, Kim-Anh.

Annu Rev Stat Appl ; 11(1): 483-504, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38962089

RESUMO

The microbiome represents a hidden world of tiny organisms populating not only our surroundings but also our own bodies. By enabling comprehensive profiling of these invisible creatures, modern genomic sequencing tools have given us an unprecedented ability to characterize these populations and uncover their outsize impact on our environment and health. Statistical analysis of microbiome data is critical to infer patterns from the observed abundances. The application and development of analytical methods in this area require careful consideration of the unique aspects of microbiome profiles. We begin this review with a brief overview of microbiome data collection and processing and describe the resulting data structure. We then provide an overview of statistical methods for key tasks in microbiome data analysis, including data visualization, comparison of microbial abundance across groups, regression modeling, and network inference. We conclude with a discussion and highlight interesting future directions.

10.

Enhancing Gene Co-Expression Network Inference for the Malaria Parasite Plasmodium falciparum.

Li, Qi; Button-Simons, Katrina A; Sievert, Mackenzie A C; Chahoud, Elias; Foster, Gabriel F; Meis, Kaitlynn; Ferdig, Michael T; Milenkovic, Tijana.

Genes (Basel) ; 15(6)2024 May 25.

Artigo em Inglês | MEDLINE | ID: mdl-38927622

RESUMO

BACKGROUND: Malaria results in more than 550,000 deaths each year due to drug resistance in the most lethal Plasmodium (P.) species P. falciparum. A full P. falciparum genome was published in 2002, yet 44.6% of its genes have unknown functions. Improving the functional annotation of genes is important for identifying drug targets and understanding the evolution of drug resistance. RESULTS: Genes function by interacting with one another. So, analyzing gene co-expression networks can enhance functional annotations and prioritize genes for wet lab validation. Earlier efforts to build gene co-expression networks in P. falciparum have been limited to a single network inference method or gaining biological understanding for only a single gene and its interacting partners. Here, we explore multiple inference methods and aim to systematically predict functional annotations for all P. falciparum genes. We evaluate each inferred network based on how well it predicts existing gene-Gene Ontology (GO) term annotations using network clustering and leave-one-out crossvalidation. We assess overlaps of the different networks' edges (gene co-expression relationships), as well as predicted functional knowledge. The networks' edges are overall complementary: 47-85% of all edges are unique to each network. In terms of the accuracy of predicting gene functional annotations, all networks yielded relatively high precision (as high as 87% for the network inferred using mutual information), but the highest recall reached was below 15%. All networks having low recall means that none of them capture a large amount of all existing gene-GO term annotations. In fact, their annotation predictions are highly complementary, with the largest pairwise overlap of only 27%. We provide ranked lists of inferred gene-gene interactions and predicted gene-GO term annotations for future use and wet lab validation by the malaria community. CONCLUSIONS: The different networks seem to capture different aspects of the P. falciparum biology in terms of both inferred interactions and predicted gene functional annotations. Thus, relying on a single network inference method should be avoided when possible. SUPPLEMENTARY DATA: Attached.

Assuntos

Redes Reguladoras de Genes , Plasmodium falciparum , Plasmodium falciparum/genética , Malária Falciparum/parasitologia , Malária Falciparum/genética , Humanos , Ontologia Genética , Anotação de Sequência Molecular/métodos , Proteínas de Protozoários/genética

11.

Reverse engineering neuron type-specific and type-orthogonal splicing-regulatory networks using single-cell transcriptomes.

Moakley, Daniel F; Campbell, Melissa; Anglada-Girotto, Miquel; Feng, Huijuan; Califano, Andrea; Au, Edmund; Zhang, Chaolin.

bioRxiv ; 2024 Jun 15.

Artigo em Inglês | MEDLINE | ID: mdl-38915499

RESUMO

Cell type-specific alternative splicing (AS) enables differential gene isoform expression between diverse neuron types with distinct identities and functions. Current studies linking individual RNA-binding proteins (RBPs) to AS in a few neuron types underscore the need for holistic modeling. Here, we use network reverse engineering to derive a map of the neuron type-specific AS regulatory landscape from 133 mouse neocortical cell types defined by single-cell transcriptomes. This approach reliably inferred the regulons of 350 RBPs and their cell type-specific activities. Our analysis revealed driving factors delineating neuronal identities, among which we validated Elavl2 as a key RBP for MGE-specific splicing in GABAergic interneurons using an in vitro ESC differentiation system. We also identified a module of exons and candidate regulators specific for long- and short-projection neurons across multiple neuronal classes. This study provides a resource for elucidating splicing regulatory programs that drive neuronal molecular diversity, including those that do not align with gene expression-based classifications.

12.

LogicGep: Boolean networks inference using symbolic regression from time-series transcriptomic profiling data.

Zhang, Dezhen; Gao, Shuhua; Liu, Zhi-Ping; Gao, Rui.

Brief Bioinform ; 25(4)2024 May 23.

Artigo em Inglês | MEDLINE | ID: mdl-38886006

RESUMO

Reconstructing the topology of gene regulatory network from gene expression data has been extensively studied. With the abundance functional transcriptomic data available, it is now feasible to systematically decipher regulatory interaction dynamics in a logic form such as a Boolean network (BN) framework, which qualitatively indicates how multiple regulators aggregated to affect a common target gene. However, inferring both the network topology and gene interaction dynamics simultaneously is still a challenging problem since gene expression data are typically noisy and data discretization is prone to information loss. We propose a new method for BN inference from time-series transcriptional profiles, called LogicGep. LogicGep formulates the identification of Boolean functions as a symbolic regression problem that learns the Boolean function expression and solve it efficiently through multi-objective optimization using an improved gene expression programming algorithm. To avoid overly emphasizing dynamic characteristics at the expense of topology structure ones, as traditional methods often do, a set of promising Boolean formulas for each target gene is evolved firstly, and a feed-forward neural network trained with continuous expression data is subsequently employed to pick out the final solution. We validated the efficacy of LogicGep using multiple datasets including both synthetic and real-world experimental data. The results elucidate that LogicGep adeptly infers accurate BN models, outperforming other representative BN inference algorithms in both network topology reconstruction and the identification of Boolean functions. Moreover, the execution of LogicGep is hundreds of times faster than other methods, especially in the case of large network inference.

Assuntos

Algoritmos , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Perfilação da Expressão Gênica/métodos , Humanos , Transcriptoma , Software , Biologia Computacional/métodos , Redes Neurais de Computação

13.

Neural Activity in Quarks Language: Lattice Field Theory for a Network of Real Neurons.

Bardella, Giampiero; Franchini, Simone; Pan, Liming; Balzan, Riccardo; Ramawat, Surabhi; Brunamonti, Emiliano; Pani, Pierpaolo; Ferraina, Stefano.

Entropy (Basel) ; 26(6)2024 Jun 06.

Artigo em Inglês | MEDLINE | ID: mdl-38920504

RESUMO

Brain-computer interfaces have seen extraordinary surges in developments in recent years, and a significant discrepancy now exists between the abundance of available data and the limited headway made in achieving a unified theoretical framework. This discrepancy becomes particularly pronounced when examining the collective neural activity at the micro and meso scale, where a coherent formalization that adequately describes neural interactions is still lacking. Here, we introduce a mathematical framework to analyze systems of natural neurons and interpret the related empirical observations in terms of lattice field theory, an established paradigm from theoretical particle physics and statistical mechanics. Our methods are tailored to interpret data from chronic neural interfaces, especially spike rasters from measurements of single neuron activity, and generalize the maximum entropy model for neural networks so that the time evolution of the system is also taken into account. This is obtained by bridging particle physics and neuroscience, paving the way for particle physics-inspired models of the neocortex.

14.

Combining compositional data sets introduces error in covariance network reconstruction.

Brunner, James D; Robinson, Aaron J; Chain, Patrick S G.

ISME Commun ; 4(1): ycae057, 2024 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-38812718

RESUMO

Microbial communities are diverse biological systems that include taxa from across multiple kingdoms of life. Notably, interactions between bacteria and fungi play a significant role in determining community structure. However, these statistical associations across kingdoms are more difficult to infer than intra-kingdom associations due to the nature of the data involved using standard network inference techniques. We quantify the challenges of cross-kingdom network inference from both theoretical and practical points of view using synthetic and real-world microbiome data. We detail the theoretical issue presented by combining compositional data sets drawn from the same environment, e.g. 16S and ITS sequencing of a single set of samples, and we survey common network inference techniques for their ability to handle this error. We then test these techniques for the accuracy and usefulness of their intra- and inter-kingdom associations by inferring networks from a set of simulated samples for which a ground-truth set of associations is known. We show that while the two methods mitigate the error of cross-kingdom inference, there is little difference between techniques for key practical applications including identification of strong correlations and identification of possible keystone taxa (i.e. hub nodes in the network). Furthermore, we identify a signature of the error caused by transkingdom network inference and demonstrate that it appears in networks constructed using real-world environmental microbiome data.

15.

CVGAE: A Self-Supervised Generative Method for Gene Regulatory Network Inference Using Single-Cell RNA Sequencing Data.

Liu, Wei; Teng, Zhijie; Li, Zejun; Chen, Jing.

Interdiscip Sci ; 2024 May 23.

Artigo em Inglês | MEDLINE | ID: mdl-38778003

RESUMO

Gene regulatory network (GRN) inference based on single-cell RNA sequencing data (scRNAseq) plays a crucial role in understanding the regulatory mechanisms between genes. Various computational methods have been employed for GRN inference, but their performance in terms of network accuracy and model generalization is not satisfactory, and their poor performance is caused by high-dimensional data and network sparsity. In this paper, we propose a self-supervised method for gene regulatory network inference using single-cell RNA sequencing data (CVGAE). CVGAE uses graph neural network for inductive representation learning, which merges gene expression data and observed topology into a low-dimensional vector space. The well-trained vectors will be used to calculate mathematical distance of each gene, and further predict interactions between genes. In overall framework, FastICA is implemented to relief computational complexity caused by high dimensional data, and CVGAE adopts multi-stacked GraphSAGE layers as an encoder and an improved decoder to overcome network sparsity. CVGAE is evaluated on several single cell datasets containing four related ground-truth networks, and the result shows that CVGAE achieve better performance than comparative methods. To validate learning and generalization capabilities, CVGAE is applied in few-shot environment by change the ratio of train set and test set. In condition of few-shot, CVGAE obtains comparable or superior performance.

16.

PMF-GRN: a variational inference approach to single-cell gene regulatory network inference using probabilistic matrix factorization.

Skok Gibbs, Claudia; Mahmood, Omar; Bonneau, Richard; Cho, Kyunghyun.

Genome Biol ; 25(1): 88, 2024 04 08.

Artigo em Inglês | MEDLINE | ID: mdl-38589899

RESUMO

Inferring gene regulatory networks (GRNs) from single-cell data is challenging due to heuristic limitations. Existing methods also lack estimates of uncertainty. Here we present Probabilistic Matrix Factorization for Gene Regulatory Network Inference (PMF-GRN). Using single-cell expression data, PMF-GRN infers latent factors capturing transcription factor activity and regulatory relationships. Using variational inference allows hyperparameter search for principled model selection and direct comparison to other generative models. We extensively test and benchmark our method using real single-cell datasets and synthetic data. We show that PMF-GRN infers GRNs more accurately than current state-of-the-art single-cell GRN inference methods, offering well-calibrated uncertainty estimates.

Assuntos

Algoritmos , Redes Reguladoras de Genes

17.

Modeling Microbial Community Networks: Methods and Tools for Studying Microbial Interactions.

Srinivasan, Shanchana; Jnana, Apoorva; Murali, Thokur Sreepathy.

Microb Ecol ; 87(1): 56, 2024 Apr 08.

Artigo em Inglês | MEDLINE | ID: mdl-38587642

RESUMO

Microbial interactions function as a fundamental unit in complex ecosystems. By characterizing the type of interaction (positive, negative, neutral) occurring in these dynamic systems, one can begin to unravel the role played by the microbial species. Towards this, various methods have been developed to decipher the function of the microbial communities. The current review focuses on the various qualitative and quantitative methods that currently exist to study microbial interactions. Qualitative methods such as co-culturing experiments are visualized using microscopy-based techniques and are combined with data obtained from multi-omics technologies (metagenomics, metabolomics, metatranscriptomics). Quantitative methods include the construction of networks and network inference, computational models, and development of synthetic microbial consortia. These methods provide a valuable clue on various roles played by interacting partners, as well as possible solutions to overcome pathogenic microbes that can cause life-threatening infections in susceptible hosts. Studying the microbial interactions will further our understanding of complex less-studied ecosystems and enable design of effective frameworks for treatment of infectious diseases.

Assuntos

Interações Microbianas , Microbiota , Humanos , Consórcios Microbianos , Técnicas de Cocultura , Redes Comunitárias

18.

Inferring networks from time series: A neural approach.

Gaskin, Thomas; Pavliotis, Grigorios A; Girolami, Mark.

PNAS Nexus ; 3(4): pgae063, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38560526

RESUMO

Network structures underlie the dynamics of many complex phenomena, from gene regulation and foodwebs to power grids and social media. Yet, as they often cannot be observed directly, their connectivities must be inferred from observations of the dynamics to which they give rise. In this work, we present a powerful computational method to infer large network adjacency matrices from time series data using a neural network, in order to provide uncertainty quantification on the prediction in a manner that reflects both the degree to which the inference problem is underdetermined as well as the noise on the data. This is a feature that other approaches have hitherto been lacking. We demonstrate our method's capabilities by inferring line failure locations in the British power grid from its response to a power cut, providing probability densities on each edge and allowing the use of hypothesis testing to make meaningful probabilistic statements about the location of the cut. Our method is significantly more accurate than both Markov-chain Monte Carlo sampling and least squares regression on noisy data and when the problem is underdetermined, while naturally extending to the case of nonlinear dynamics, which we demonstrate by learning an entire cost matrix for a nonlinear model of economic activity in Greater London. Not having been specifically engineered for network inference, this method in fact represents a general parameter estimation scheme that is applicable to any high-dimensional parameter space.

19.

Network-based analysis of heterogeneous patient-matched brain and extracranial melanoma metastasis pairs reveals three homogeneous subgroups.

Grützmann, Konrad; Kraft, Theresa; Meinhardt, Matthias; Meier, Friedegund; Westphal, Dana; Seifert, Michael.

Comput Struct Biotechnol J ; 23: 1036-1050, 2024 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-38464935

RESUMO

Melanoma, the deadliest form of skin cancer, can metastasize to different organs. Molecular differences between brain and extracranial melanoma metastases are poorly understood. Here, promoter methylation and gene expression of 11 heterogeneous patient-matched pairs of brain and extracranial metastases were analyzed using melanoma-specific gene regulatory networks learned from public transcriptome and methylome data followed by network-based impact propagation of patient-specific alterations. This innovative data analysis strategy allowed to predict potential impacts of patient-specific driver candidate genes on other genes and pathways. The patient-matched metastasis pairs clustered into three robust subgroups with specific downstream targets with known roles in cancer, including melanoma (SG1: RBM38, BCL11B, SG2: GATA3, FES, SG3: SLAMF6, PYCARD). Patient subgroups and ranking of target gene candidates were confirmed in a validation cohort. Summarizing, computational network-based impact analyses of heterogeneous metastasis pairs predicted individual regulatory differences in melanoma brain metastases, cumulating into three consistent subgroups with specific downstream target genes.

20.

Studying temporal dynamics of single cells: expression, lineage and regulatory networks.

Pan, Xinhai; Zhang, Xiuwei.

Biophys Rev ; 16(1): 57-67, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38495440

RESUMO

Learning how multicellular organs are developed from single cells to different cell types is a fundamental problem in biology. With the high-throughput scRNA-seq technology, computational methods have been developed to reveal the temporal dynamics of single cells from transcriptomic data, from phenomena on cell trajectories to the underlying mechanism that formed the trajectory. There are several distinct families of computational methods including Trajectory Inference (TI), Lineage Tracing (LT), and Gene Regulatory Network (GRN) Inference which are involved in such studies. This review summarizes these computational approaches which use scRNA-seq data to study cell differentiation and cell fate specification as well as the advantages and limitations of different methods. We further discuss how GRNs can potentially affect cell fate decisions and trajectory structures. Supplementary Information: The online version contains supplementary material available at 10.1007/s12551-023-01090-5.

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA