Pesquisa | Portal Regional da BVS

1.

Bayesian mixture regression analysis for regulation of Pluripotency in ES cells.

Aflakparast, Mehran; Geeven, Geert; de Gunst, Mathisca C M.

BMC Bioinformatics ; 21(1): 3, 2020 Jan 02.

Artigo em Inglês | MEDLINE | ID: mdl-31898480

RESUMO

BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. RESULTS: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. CONCLUSIONS: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions.

Assuntos

Células-Tronco Embrionárias/metabolismo , Regulação da Expressão Gênica , Células-Tronco Pluripotentes/metabolismo , Animais , Teorema de Bayes , Cromatina/genética , Cromatina/metabolismo , Imunoprecipitação da Cromatina , Genoma , Camundongos , Análise de Regressão , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo

2.

Reconstruction of molecular network evolution from cross-sectional omics data.

Aflakparast, Mehran; de Gunst, Mathisca C M; van Wieringen, Wessel N.

Biom J ; 60(3): 547-563, 2018 05.

Artigo em Inglês | MEDLINE | ID: mdl-29320604

RESUMO

Cross-sectional studies may shed light on the evolution of a disease like cancer through the comparison of patient traits among disease stages. This problem is especially challenging when a gene-gene interaction network needs to be reconstructed from omics data, and, in addition, the patients of each stage need not form a homogeneous group. Here, the problem is operationalized as the estimation of stage-wise mixtures of Gaussian graphical models (GGMs) from high-dimensional data. These mixtures are fitted by a (fused) ridge penalized EM algorithm. The fused ridge penalty shrinks GGMs of contiguous stages. The (fused) ridge penalty parameters are chosen through cross-validation. The proposed estimation procedures are shown to be consistent and their performance in other respects is studied in simulation. The down-stream exploitation of the fitted GGMs is outlined. In a data illustration the methodology is employed to identify gene-gene interaction network changes in the transition from normal to cancer prostate tissue.

Assuntos

Biologia Computacional , Estudos Transversais , Redes Reguladoras de Genes , Humanos , Modelos Estatísticos , Distribuição Normal

3.

Gene Network Reconstruction using Global-Local Shrinkage Priors.

Leday, Gwenaël G R; de Gunst, Mathisca C M; Kpogbezan, Gino B; van der Vaart, Aad W; van Wieringen, Wessel N; van de Wiel, Mark A.

Ann Appl Stat ; 11(1): 41-68, 2017 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-28408966

RESUMO

Reconstructing a gene network from high-throughput molecular data is an important but challenging task, as the number of parameters to estimate easily is much larger than the sample size. A conventional remedy is to regularize or penalize the model likelihood. In network models, this is often done locally in the neighbourhood of each node or gene. However, estimation of the many regularization parameters is often difficult and can result in large statistical uncertainties. In this paper we propose to combine local regularization with global shrinkage of the regularization parameters to borrow strength between genes and improve inference. We employ a simple Bayesian model with non-sparse, conjugate priors to facilitate the use of fast variational approximations to posteriors. We discuss empirical Bayes estimation of hyper-parameters of the priors, and propose a novel approach to rank-based posterior thresholding. Using extensive model- and data-based simulations, we demonstrate that the proposed inference strategy outperforms popular (sparse) methods, yields more stable edges, and is more reproducible. The proposed method, termed ShrinkNet, is then applied to Glioblastoma to investigate the interactions between genes associated with patient survival.

4.

A three domain covariance framework for EEG/MEG data.

Ros, Beata P; Bijma, Fetsje; de Gunst, Mathisca C M; de Munck, Jan C.

Neuroimage ; 119: 305-15, 2015 Oct 01.

Artigo em Inglês | MEDLINE | ID: mdl-26072253

RESUMO

In this paper we introduce a covariance framework for the analysis of single subject EEG and MEG data that takes into account observed temporal stationarity on small time scales and trial-to-trial variations. We formulate a model for the covariance matrix, which is a Kronecker product of three components that correspond to space, time and epochs/trials, and consider maximum likelihood estimation of the unknown parameter values. An iterative algorithm that finds approximations of the maximum likelihood estimates is proposed. Our covariance model is applicable in a variety of cases where spontaneous EEG or MEG acts as source of noise and realistic noise covariance estimates are needed, such as in evoked activity studies, or where the properties of spontaneous EEG or MEG are themselves the topic of interest, like in combined EEG-fMRI experiments in which the correlation between EEG and fMRI signals is investigated. We use a simulation study to assess the performance of the estimator and investigate the influence of different assumptions about the covariance factors on the estimated covariance matrix and on its components. We apply our method to real EEG and MEG data sets.

Assuntos

Mapeamento Encefálico/métodos , Encéfalo/fisiologia , Eletroencefalografia/métodos , Imageamento por Ressonância Magnética/métodos , Magnetoencefalografia/métodos , Processamento de Sinais Assistido por Computador , Adulto , Algoritmos , Ondas Encefálicas , Simulação por Computador , Feminino , Humanos , Funções Verossimilhança , Masculino , Reprodutibilidade dos Testes , Adulto Jovem

5.

Comparison of targeted maximum likelihood and shrinkage estimators of parameters in gene networks.

Geeven, Geert; van der Laan, Mark J; de Gunst, Mathisca C M.

Stat Appl Genet Mol Biol ; 11(5): Article 2, 2012 Sep 25.

Artigo em Inglês | MEDLINE | ID: mdl-23023699

RESUMO

Gene regulatory networks, in which edges between nodes describe interactions between transcription factors (TFs) and their target genes, model regulatory interactions that determine the cell-type and condition-specific expression of genes. Regression methods can be used to identify TF-target gene interactions from gene expression and DNA sequence data. The response variable, i.e. observed gene expression, is modeled as a function of many predictor variables simultaneously. In practice, it is generally not possible to select a single model that clearly achieves the best fit to the observed experimental data and the selected models typically contain overlapping sets of predictor variables. Moreover, parameters that represent the marginal effect of the individual predictors are not always present. In this paper, we use the statistical framework of estimation of variable importance to define variable importance as a parameter of interest and study two different estimators of this parameter in the context of gene regulatory networks. On yeast data we show that the resulting parameter has a biologically appealing interpretation. We apply the proposed methodology on mammalian gene expression data to gain insight into the temporal activity of TFs that underly gene expression changes in F11 cells in response to Forskolin stimulation.

Assuntos

Redes Reguladoras de Genes , Funções Verossimilhança , Perfilação da Expressão Gênica/estatística & dados numéricos , Modelos Genéticos , Probabilidade , Análise de Regressão , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo

6.

Identification of context-specific gene regulatory networks with GEMULA--gene expression modeling using LAsso.

Geeven, Geert; van Kesteren, Ronald E; Smit, August B; de Gunst, Mathisca C M.

Bioinformatics ; 28(2): 214-21, 2012 Jan 15.

Artigo em Inglês | MEDLINE | ID: mdl-22106333

RESUMO

MOTIVATION: Gene regulatory networks, in which edges between nodes describe interactions between transcriptional regulators and their target genes, determine the coordinated spatiotemporal expression of genes. Especially in higher organisms, context-specific combinatorial regulation by transcription factors (TFs) is believed to determine cellular states and fates. TF-target gene interactions can be studied using high-throughput techniques such as ChIP-chip or ChIP-Seq. These experiments are time and cost intensive, and further limited by, for instance, availability of high affinity TF antibodies. Hence, there is a practical need for methods that can predict TF-TF and TF-target gene interactions in silico, i.e. from gene expression and DNA sequence data alone. We propose GEMULA, a novel approach based on linear models to predict TF-gene expression associations and TF-TF interactions from experimental data. GEMULA is based on linear models, fast and considers a wide range of biologically plausible models that describe gene expression data as a function of predicted TF binding to gene promoters. RESULTS: We show that models inferred with GEMULA are able to explain roughly 70% of the observed variation in gene expression in the yeast heat shock response. The functional relevance of the inferred TF-TF interactions in these models are validated by different sources of independent experimental evidence. We also have applied GEMULA to an in vitro model of neuronal outgrowth. Our findings confirm existing knowledge on gene regulatory interactions underlying neuronal outgrowth, but importantly also generate new insights into the temporal dynamics of this gene regulatory network that can now be addressed experimentally. AVAILABILITY: The GEMULA R-package is available from http://www.few.vu.nl/~degunst/gemula_1.0.tar.gz.

Assuntos

Redes Reguladoras de Genes , Modelos Genéticos , Software , Animais , Regulação da Expressão Gênica , Humanos , Modelos Lineares , Análise de Sequência com Séries de Oligonucleotídeos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Fatores de Transcrição/metabolismo

7.

LLM3D: a log-linear modeling-based method to predict functional gene regulatory interactions from genome-wide expression data.

Geeven, Geert; Macgillavry, Harold D; Eggers, Ruben; Sassen, Marion M; Verhaagen, Joost; Smit, August B; de Gunst, Mathisca C M; van Kesteren, Ronald E.

Nucleic Acids Res ; 39(13): 5313-27, 2011 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-21422075

RESUMO

All cellular processes are regulated by condition-specific and time-dependent interactions between transcription factors and their target genes. While in simple organisms, e.g. bacteria and yeast, a large amount of experimental data is available to support functional transcription regulatory interactions, in mammalian systems reconstruction of gene regulatory networks still heavily depends on the accurate prediction of transcription factor binding sites. Here, we present a new method, log-linear modeling of 3D contingency tables (LLM3D), to predict functional transcription factor binding sites. LLM3D combines gene expression data, gene ontology annotation and computationally predicted transcription factor binding sites in a single statistical analysis, and offers a methodological improvement over existing enrichment-based methods. We show that LLM3D successfully identifies novel transcriptional regulators of the yeast metabolic cycle, and correctly predicts key regulators of mouse embryonic stem cell self-renewal more accurately than existing enrichment-based methods. Moreover, in a clinically relevant in vivo injury model of mammalian neurons, LLM3D identified peroxisome proliferator-activated receptor Î³ (PPARÎ³) as a neuron-intrinsic transcriptional regulator of regenerative axon growth. In conclusion, LLM3D provides a significant improvement over existing methods in predicting functional transcription regulatory interactions in the absence of experimental transcription factor binding data.

Assuntos

Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Linhagem Celular , Células-Tronco Embrionárias/metabolismo , Genoma , Modelos Lineares , Camundongos , Regeneração Nervosa/genética , Neurônios/metabolismo , PPAR gama/metabolismo , Ratos , Ratos Wistar , Leveduras/genética , Leveduras/metabolismo

8.

Identification of candidate transcriptional modulators involved in successful regeneration after nerve injury.

Stam, Floor J; MacGillavry, Harold D; Armstrong, Nicola J; de Gunst, Mathisca C M; Zhang, Yi; van Kesteren, Ronald E; Smit, August B; Verhaagen, Joost.

Eur J Neurosci ; 25(12): 3629-37, 2007 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-17610582

RESUMO

Successful regeneration of injured neurons requires a complex molecular response that involves the expression, modification and transport of a large number of proteins. The identity of neuronal proteins responsible for the initiation of regenerative neurite outgrowth is largely unknown. Dorsal root ganglion (DRG) neurons display robust and successful regeneration following lesion of their peripheral neurite, whereas outgrowth of central neurites is weak and does not lead to functional recovery. We have utilized this differential response to gain insight in the early transcriptional events associated with successful regeneration. Surprisingly, our study shows that peripheral and central nerve crushes elicit very distinct transcriptional activation, revealing a large set of novel genes that are differentially regulated within the first 24 h after the lesion. Here we show that Ankrd1, a gene known to act as a transcriptional modulator, is involved in neurite outgrowth of a DRG neuron-derived cell line as well as in cultured adult DRG neurons. This gene, and others identified in this study, may be part of the transcriptional regulatory module that orchestrates the onset of successful regeneration.

Assuntos

Regulação da Expressão Gênica/fisiologia , Regeneração Nervosa/fisiologia , Neuropatia Ciática/fisiopatologia , Traumatismos da Medula Espinal/fisiopatologia , Fatores de Transcrição/metabolismo , Animais , Células Cultivadas , Feminino , Gânglios Espinais/patologia , Perfilação da Expressão Gênica/métodos , Hibridização In Situ/métodos , Masculino , Proteínas Musculares , Neurônios/metabolismo , Proteínas Nucleares , Ratos , Ratos Sprague-Dawley , Ratos Wistar , Proteínas Repressoras , Neuropatia Ciática/patologia , Traumatismos da Medula Espinal/patologia , Transfecção

9.

Genetic study of the height and weight process during infancy.

van Dommelen, Paula; de Gunst, Mathisca C M; van der Vaart, Aad W; Boomsma, Dorret I.

Twin Res ; 7(6): 607-16, 2004 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-15607012

RESUMO

Longitudinal height and weight data from 4649 Dutch twin pairs between birth and 2.5 years of age were analyzed. The data were first summarized into parameters of a polynomial of degree 4 by a mixed-effects procedure. Next, the variation and covariation in the parameters of the growth curve (size at one year of age, growth velocity, deceleration of growth, rate of change in deceleration [i.e., jerk] and rate of change in jerk [i.e., snap]) were decomposed into genetic and nongenetic sources. Additionally, the variation in the estimated size at birth and at 2 years of age interpolated from the polynomial was decomposed into genetic and nongenetic components. Variation in growth was best characterized by a genetic model which included additive genetic, common environmental and specific environmental influences, plus effects of gestational age. The effect of gestational age was largest for size at birth, explaining 39% of the variance. The differences between monozygotic and dizygotic twin correlations were largest for size at 1 and 2 years of age and growth velocity of weight, which suggests that these parameters are more influenced by heritability than size at birth, deceleration and jerk. The percentage of variance explained by additive genetic influences for height at 2 years of age was 52% for females and 58% for males. For weight at 2 years of age, heritability was approximately 58% for both sexes. Variation in snap height for males was also mainly influenced by additive genetic factors, while snap for females was influenced by both additive genetic and common environmental factors. The correlations for the additive genetic and common environmental factors for deceleration and snap are large, indicating that these parameters are almost entirely under control of the same additive genetic and common environmental factors. Female jerk and snap, and also female height at birth and height at 2 years of age, are mostly under control of the same additive genetic factor.

Assuntos

Estatura/fisiologia , Peso Corporal/fisiologia , Gêmeos/fisiologia , Adolescente , Adulto , Fatores Etários , Algoritmos , Estatura/genética , Peso Corporal/genética , Idade Gestacional , Humanos , Lactente , Recém-Nascido , Estudos Longitudinais , Modelos Genéticos , Países Baixos , Gêmeos/genética , Gêmeos Dizigóticos/genética , Gêmeos Dizigóticos/fisiologia , Gêmeos Monozigóticos/genética , Gêmeos Monozigóticos/fisiologia

10.

Morphine exposure and abstinence define specific stages of gene expression in the rat nucleus accumbens.

Spijker, Sabine; Houtzager, Siard W J; De Gunst, Mathisca C M; De Boer, Wim P H; Schoffelmeer, Anton N M; Smit, August B.

FASEB J ; 18(7): 848-50, 2004 May.

Artigo em Inglês | MEDLINE | ID: mdl-15033927

RESUMO

Intermittent exposure to addictive drugs causes long-lasting changes in responsiveness to these substances due to persistent molecular and cellular alterations within the meso-corticolimbic system. In this report, we studied the expression profiles of 159 genes in the rat nucleus accumbens during morphine exposure (14 days, 10 mg/kg s.c.) and drug-abstinence (3 weeks). We used real-time quantitative PCR to monitor gene expression after establishing its sensitivity and resolution to resolve small changes in expression for genes in various abundance classes. Morphine-exposure (5 time points) and subsequent abstinence (6 time points) induced phase-specific temporal gene expression of distinct functional groups of genes, for example, short-term homeostatic responses. Opiate withdrawal appeared to be a new stimulus in terms of gene expression and mediates a marked wave of gene repression. Prolonged abstinence resulted in persistently changed expression levels of genes involved in neuronal outgrowth and re-wiring. Our findings substantiate the hypothesis that this new gene program, initiated upon morphine-withdrawal, may subserve long-term neuronal plasticity involved in the persistent behavioral consequences of repeated drug-exposure.

Assuntos

Regulação da Expressão Gênica/efeitos dos fármacos , Dependência de Morfina/genética , Morfina/farmacologia , Proteínas do Tecido Nervoso/biossíntese , Plasticidade Neuronal/efeitos dos fármacos , Núcleo Accumbens/efeitos dos fármacos , Síndrome de Abstinência a Substâncias/genética , Adaptação Fisiológica/genética , Animais , Comportamento Animal/efeitos dos fármacos , Sistemas Computacionais , Perfilação da Expressão Gênica , Genes Precoces/efeitos dos fármacos , Proteínas Imediatamente Precoces/biossíntese , Masculino , Dependência de Morfina/metabolismo , Proteínas do Tecido Nervoso/genética , Neurotransmissores/biossíntese , Neurotransmissores/genética , Núcleo Accumbens/metabolismo , Peptídeos Opioides/biossíntese , Reação em Cadeia da Polimerase , Ratos , Ratos Wistar , Reprodutibilidade dos Testes , Síndrome de Abstinência a Substâncias/metabolismo , Transmissão Sináptica/efeitos dos fármacos , Fatores de Transcrição/biossíntese , Fatores de Transcrição/genética

11.

Exploring heterogeneity in tumour data using Markov chain Monte Carlo.

de Gunst, Mathisca C M; Dewanji, Anup; Luebeck, E Georg.

Stat Med ; 22(10): 1691-707, 2003 May 30.

Artigo em Inglês | MEDLINE | ID: mdl-12720305

RESUMO

We describe a Bayesian approach to incorporate between-individual heterogeneity associated with parameters of complicated biological models. We emphasize the use of the Markov chain Monte Carlo (MCMC) method in this context and demonstrate the implementation and use of MCMC by analysis of simulated overdispersed Poisson counts and by analysis of an experimental data set on preneoplastic liver lesions (their number and sizes) in the presence of heterogeneity. These examples show that MCMC-based estimates, derived from the posterior distribution with uniform priors, may agree well with maximum likelihood estimates (if available). However, with heterogeneous parameters, maximum likelihood estimates can be difficult to obtain, involving many integrations. In this case, the MCMC method offers substantial computational advantages.

Assuntos

Neoplasias Hepáticas/patologia , Cadeias de Markov , Modelos Biológicos , Método de Monte Carlo , Animais , Teorema de Bayes , Nitrosaminas , Distribuição de Poisson , Lesões Pré-Cancerosas , Ratos , Processos Estocásticos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA