Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
Philos Trans R Soc Lond B Biol Sci ; 374(1771): 20180025, 2019 04 29.
Artigo em Inglês | MEDLINE | ID: mdl-30852998

RESUMO

From neuroscience, brain imaging and the psychology of memory, we are beginning to assemble an integrated theory of the brain subsystems and pathways that allow the compression, storage and reconstruction of memories for past events and their use in contextualizing the present and reasoning about the future-mental time travel (MTT). Using computational models, embedded in humanoid robots, we are seeking to test the sufficiency of this theoretical account and to evaluate the usefulness of brain-inspired memory systems for social robots. In this contribution, we describe the use of machine learning techniques-Gaussian process latent variable models-to build a multimodal memory system for the iCub humanoid robot and summarize results of the deployment of this system for human-robot interaction. We also outline the further steps required to create a more complete robotic implementation of human-like autobiographical memory and MTT. We propose that generative memory models, such as those that form the core of our robot memory system, can provide a solution to the symbol grounding problem in embodied artificial intelligence. This article is part of the theme issue 'From social brains to social robots: applying neurocognitive insights to human-robot interaction'.


Assuntos
Cognição , Aprendizado de Máquina , Memória Episódica , Robótica , Humanos , Modelos Teóricos , Comportamento Social , Fatores de Tempo , Viagem
2.
Cogn Sci ; 42 Suppl 3: 809-832, 2018 06.
Artigo em Inglês | MEDLINE | ID: mdl-29315735

RESUMO

Both scientists and children make important structural discoveries, yet their computational underpinnings are not well understood. Structure discovery has previously been formalized as probabilistic inference about the right structural form-where form could be a tree, ring, chain, grid, etc. (Kemp & Tenenbaum, 2008). Although this approach can learn intuitive organizations, including a tree for animals and a ring for the color circle, it assumes a strong inductive bias that considers only these particular forms, and each form is explicitly provided as initial knowledge. Here we introduce a new computational model of how organizing structure can be discovered, utilizing a broad hypothesis space with a preference for sparse connectivity. Given that the inductive bias is more general, the model's initial knowledge shows little qualitative resemblance to some of the discoveries it supports. As a consequence, the model can also learn complex structures for domains that lack intuitive description, as well as predict human property induction judgments without explicit structural forms. By allowing form to emerge from sparsity, our approach clarifies how both the richness and flexibility of human conceptual organization can coexist.


Assuntos
Formação de Conceito , Modelos Teóricos , Algoritmos , Animais , Conjuntos de Dados como Assunto , Humanos , Julgamento
3.
Bioinformatics ; 33(23): 3776-3783, 2017 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-28961802

RESUMO

MOTIVATION: Regulation of gene expression in prokaryotes involves complex co-regulatory mechanisms involving large numbers of transcriptional regulatory proteins and their target genes. Uncovering these genome-scale interactions constitutes a major bottleneck in systems biology. Sparse latent factor models, assuming activity of transcription factors (TFs) as unobserved, provide a biologically interpretable modelling framework, integrating gene expression and genome-wide binding data, but at the same time pose a hard computational inference problem. Existing probabilistic inference methods for such models rely on subjective filtering and suffer from scalability issues, thus are not well-suited for realistic genome-scale applications. RESULTS: We present a fast Bayesian sparse factor model, which takes input gene expression and binding sites data, either from ChIP-seq experiments or motif predictions, and outputs active TF-gene links as well as latent TF activities. Our method employs an efficient variational Bayes scheme for model inference enabling its application to large datasets which was not feasible with existing MCMC-based inference methods for such models. We validate our method on synthetic data against a similar model in the literature, employing MCMC for inference, and obtain comparable results with a small fraction of the computational time. We also apply our method to large-scale data from Mycobacterium tuberculosis involving ChIP-seq data on 113 TFs and matched gene expression data for 3863 putative target genes. We evaluate our predictions using an independent transcriptomics experiment involving over-expression of TFs. AVAILABILITY AND IMPLEMENTATION: An easy-to-use Jupyter notebook demo of our method with data is available at https://github.com/zhenwendai/SITAR. CONTACT: mudassar.iqbal@manchester.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sítios de Ligação , Imunoprecipitação da Cromatina/métodos , Perfilação da Expressão Gênica/métodos , Regulação Bacteriana da Expressão Gênica , Modelos Biológicos , Mycobacterium tuberculosis/genética , Fatores de Transcrição/metabolismo , Teorema de Bayes , Biologia Computacional/métodos , Transcrição Gênica
4.
Sci Immunol ; 2(9)2017 Mar 03.
Artigo em Inglês | MEDLINE | ID: mdl-28345074

RESUMO

Differentiation of naïve CD4+ T cells into functionally distinct T helper subsets is crucial for the orchestration of immune responses. Due to extensive heterogeneity and multiple overlapping transcriptional programs in differentiating T cell populations, this process has remained a challenge for systematic dissection in vivo. By using single-cell transcriptomics and computational analysis using a temporal mixtures of Gaussian processes model, termed GPfates, we reconstructed the developmental trajectories of Th1 and Tfh cells during blood-stage Plasmodium infection in mice. By tracking clonality using endogenous TCR sequences, we first demonstrated that Th1/Tfh bifurcation had occurred at both population and single-clone levels. Next, we identified genes whose expression was associated with Th1 or Tfh fates, and demonstrated a T-cell intrinsic role for Galectin-1 in supporting a Th1 differentiation. We also revealed the close molecular relationship between Th1 and IL-10-producing Tr1 cells in this infection. Th1 and Tfh fates emerged from a highly proliferative precursor that upregulated aerobic glycolysis and accelerated cell cycling as cytokine expression began. Dynamic gene expression of chemokine receptors around bifurcation predicted roles for cell-cell in driving Th1/Tfh fates. In particular, we found that precursor Th cells were coached towards a Th1 but not a Tfh fate by inflammatory monocytes. Thus, by integrating genomic and computational approaches, our study has provided two unique resources, a database www.PlasmoTH.org, which facilitates discovery of novel factors controlling Th1/Tfh fate commitment, and more generally, GPfates, a modelling framework for characterizing cell differentiation towards multiple fates.

5.
Proc Natl Acad Sci U S A ; 112(42): 13115-20, 2015 Oct 20.
Artigo em Inglês | MEDLINE | ID: mdl-26438844

RESUMO

Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes.


Assuntos
Genoma Humano , Modelos Genéticos , RNA/biossíntese , Transcrição Gênica , Receptor alfa de Estrogênio/metabolismo , Humanos , Cinética , Células MCF-7 , RNA/genética , Transdução de Sinais
6.
IEEE Trans Pattern Anal Mach Intell ; 37(2): 383-93, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-26353249

RESUMO

In this publication, we combine two Bayesian nonparametric models: the Gaussian Process (GP) and the Dirichlet Process (DP). Our innovation in the GP model is to introduce a variation on the GP prior which enables us to model structured time-series data, i.e., data containing groups where we wish to model inter- and intra-group variability. Our innovation in the DP model is an implementation of a new fast collapsed variational inference procedure which enables us to optimize our variational approximation significantly faster than standard VB approaches. In a biological time series application we show how our model better captures salient features of the data, leading to better consistency with existing biological classifications, while the associated inference algorithm provides a significant speed-up over EM-based variational inference.


Assuntos
Análise por Conglomerados , Biologia Computacional/métodos , Distribuição Normal , Simulação por Computador , Perfilação da Expressão Gênica , Estatísticas não Paramétricas
7.
Nat Commun ; 5: 4890, 2014 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-25234577

RESUMO

Linear mixed models (LMMs) are a powerful and established tool for studying genotype-phenotype relationships. A limitation of the LMM is that the model assumes Gaussian distributed residuals, a requirement that rarely holds in practice. Violations of this assumption can lead to false conclusions and loss in power. To mitigate this problem, it is common practice to pre-process the phenotypic values to make them as Gaussian as possible, for instance by applying logarithmic or other nonlinear transformations. Unfortunately, different phenotypes require different transformations, and choosing an appropriate transformation is challenging and subjective. Here we present an extension of the LMM that estimates an optimal transformation from the observed data. In simulations and applications to real data from human, mouse and yeast, we show that using transformations inferred by our model increases power in genome-wide association studies and increases the accuracy of heritability estimation and phenotype prediction.


Assuntos
Modelos Lineares , Modelos Genéticos , Animais , Simulação por Computador , Bases de Dados Factuais , Fungos/genética , Fungos/metabolismo , Estudos de Associação Genética , Estudo de Associação Genômica Ampla , Humanos , Camundongos , Distribuição Normal , Fenótipo , Polimorfismo de Nucleotídeo Único , Leveduras
8.
PLoS Comput Biol ; 10(5): e1003598, 2014 May.
Artigo em Inglês | MEDLINE | ID: mdl-24830797

RESUMO

Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work, we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription speed and to infer the temporal pol-II activity profile at the gene promoter. Model parameters are estimated using either maximum likelihood estimation or via Bayesian inference using Markov chain Monte Carlo sampling. The Bayesian approach provides confidence intervals for parameter estimates and allows the use of priors that capture domain knowledge, e.g. the expected range of transcription speeds, based on previous experiments. The model describes the movement of pol-II down the gene body and can be used to identify the time of induction for transcriptionally engaged genes. By clustering the inferred promoter activity time profiles, we are able to determine which genes respond quickly to stimuli and group genes that share activity profiles and may therefore be co-regulated. We apply our methodology to biological data obtained using ChIP-seq to measure pol-II occupancy genome-wide when MCF-7 human breast cancer cells are treated with estradiol (E2). The transcription speeds we obtain agree with those obtained previously for smaller numbers of genes with the advantage that our approach can be applied genome-wide. We validate the biological significance of the pol-II promoter activity clusters by investigating cluster-specific transcription factor binding patterns and determining canonical pathway enrichment. We find that rapidly induced genes are enriched for both estrogen receptor alpha (ERα) and FOXA1 binding in their proximal promoter regions.


Assuntos
Imunoprecipitação da Cromatina/métodos , RNA Polimerases Dirigidas por DNA/genética , Modelos Genéticos , Modelos Estatísticos , Regiões Promotoras Genéticas/genética , Transcrição Gênica/genética , Ativação Transcricional/genética , Animais , Simulação por Computador , Humanos , Ligação Proteica
9.
IEEE Trans Pattern Anal Mach Intell ; 35(11): 2693-705, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-24051729

RESUMO

Purely data-driven approaches for machine learning present difficulties when data are scarce relative to the complexity of the model or when the model is forced to extrapolate. On the other hand, purely mechanistic approaches need to identify and specify all the interactions in the problem at hand (which may not be feasible) and still leave the issue of how to parameterize the system. In this paper, we present a hybrid approach using Gaussian processes and differential equations to combine data-driven modeling with a physical model of the system. We show how different, physically inspired, kernel functions can be developed through sensible, simple, mechanistic assumptions about the underlying system. The versatility of our approach is illustrated with three case studies from motion capture, computational biology, and geostatistics.


Assuntos
Algoritmos , Inteligência Artificial , Modelos Lineares , Distribuição Normal , Reconhecimento Automatizado de Padrão/métodos , Simulação por Computador , Tamanho da Amostra
10.
BMC Bioinformatics ; 14: 252, 2013 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-23962281

RESUMO

BACKGROUND: Time course data from microarrays and high-throughput sequencing experiments require simple, computationally efficient and powerful statistical models to extract meaningful biological signal, and for tasks such as data fusion and clustering. Existing methodologies fail to capture either the temporal or replicated nature of the experiments, and often impose constraints on the data collection process, such as regularly spaced samples, or similar sampling schema across replications. RESULTS: We propose hierarchical Gaussian processes as a general model of gene expression time-series, with application to a variety of problems. In particular, we illustrate the method's capacity for missing data imputation, data fusion and clustering.The method can impute data which is missing both systematically and at random: in a hold-out test on real data, performance is significantly better than commonly used imputation methods. The method's ability to model inter- and intra-cluster variance leads to more biologically meaningful clusters. The approach removes the necessity for evenly spaced samples, an advantage illustrated on a developmental Drosophila dataset with irregular replications. CONCLUSION: The hierarchical Gaussian process model provides an excellent statistical basis for several gene-expression time-series tasks. It has only a few additional parameters over a regular GP, has negligible additional complexity, is easily implemented and can be integrated into several existing algorithms. Our experiments were implemented in python, and are available from the authors' website: http://staffwww.dcs.shef.ac.uk/people/J.Hensman/.


Assuntos
Teorema de Bayes , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Modelos Genéticos , Animais , Análise por Conglomerados , Drosophila/genética , Drosophila/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Distribuição Normal , Análise de Sequência com Séries de Oligonucleotídeos/métodos
11.
Bioinformatics ; 29(11): 1382-9, 2013 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-23559640

RESUMO

MOTIVATION: Genomic studies have revealed a substantial heritable component of the transcriptional state of the cell. To fully understand the genetic regulation of gene expression variability, it is important to study the effect of genotype in the context of external factors such as alternative environmental conditions. In model systems, explicit environmental perturbations have been considered for this purpose, allowing to directly test for environment-specific genetic effects. However, such experiments are limited to species that can be profiled in controlled environments, hampering their use in important systems such as human. Moreover, even in seemingly tightly regulated experimental conditions, subtle environmental perturbations cannot be ruled out, and hence unknown environmental influences are frequent. Here, we propose a model-based approach to simultaneously infer unmeasured environmental factors from gene expression profiles and use them in genetic analyses, identifying environment-specific associations between polymorphic loci and individual gene expression traits. RESULTS: In extensive simulation studies, we show that our method is able to accurately reconstruct environmental factors and their interactions with genotype in a variety of settings. We further illustrate the use of our model in a real-world dataset in which one environmental factor has been explicitly experimentally controlled. Our method is able to accurately reconstruct the true underlying environmental factor even if it is not given as an input, allowing to detect genuine genotype-environment interactions. In addition to the known environmental factor, we find unmeasured factors involved in novel genotype-environment interactions. Our results suggest that interactions with both known and unknown environmental factors significantly contribute to gene expression variability. AVAILABILITY: and implementation: Software available at http://pmbio.github.io/envGPLVM/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Interação Gene-Ambiente , Regulação Fúngica da Expressão Gênica , Genótipo , Humanos , Modelos Lineares , Modelos Genéticos , Locos de Características Quantitativas
12.
Methods Mol Biol ; 939: 59-67, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23192541

RESUMO

Reverse engineering the gene regulatory network is challenging because the amount of available data is very limited compared to the complexity of the underlying network. We present a technique addressing this problem through focussing on a more limited problem: inferring direct targets of a transcription factor from short expression time series. The method is based on combining Gaussian process priors and ordinary differential equation models allowing inference on limited potentially unevenly sampled data. The method is implemented as an R/Bioconductor package, and it is demonstrated by ranking candidate targets of the p53 tumour suppressor.


Assuntos
Biologia Computacional/métodos , Mineração de Dados/métodos , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Fatores de Transcrição/genética , Regulação da Expressão Gênica , Genoma Humano , Humanos , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Análise de Sequência de RNA/métodos , Software , Fatores de Transcrição/metabolismo , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo
13.
BMC Syst Biol ; 6: 53, 2012 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-22647244

RESUMO

BACKGROUND: Complete transcriptional regulatory network inference is a huge challenge because of the complexity of the network and sparsity of available data. One approach to make it more manageable is to focus on the inference of context-specific networks involving a few interacting transcription factors (TFs) and all of their target genes. RESULTS: We present a computational framework for Bayesian statistical inference of target genes of multiple interacting TFs from high-throughput gene expression time-series data. We use ordinary differential equation models that describe transcription of target genes taking into account combinatorial regulation. The method consists of a training and a prediction phase. During the training phase we infer the unobserved TF protein concentrations on a subnetwork of approximately known regulatory structure. During the prediction phase we apply Bayesian model selection on a genome-wide scale and score all alternative regulatory structures for each target gene. We use our methodology to identify targets of five TFs regulating Drosophila melanogaster mesoderm development. We find that confident predicted links between TFs and targets are significantly enriched for supporting ChIP-chip binding events and annotated TF-gene interations. Our method statistically significantly outperforms existing alternatives. CONCLUSIONS: Our results show that it is possible to infer regulatory links between multiple interacting TFs and their target genes even from a single relatively short time series and in presence of unmodelled confounders and unreliable prior knowledge on training network connectivity. Introducing data from several different experimental perturbations significantly increases the accuracy.


Assuntos
Biologia Computacional/métodos , Fatores de Transcrição/metabolismo , Transcriptoma , Animais , Teorema de Bayes , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Redes Reguladoras de Genes , Fatores de Tempo
14.
PLoS Comput Biol ; 8(5): e1002496, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22570605

RESUMO

Meiosis is the cell division that halves the genetic component of diploid cells to form gametes or spores. To achieve this, meiotic cells undergo a radical spatial reorganisation of chromosomes. This reorganisation is a prerequisite for the pairing of parental homologous chromosomes and the reductional division, which halves the number of chromosomes in daughter cells. Of particular note is the change from a centromere clustered layout (Rabl configuration) to a telomere clustered conformation (bouquet stage). The contribution of the bouquet structure to homologous chromosome pairing is uncertain. We have developed a new in silico model to represent the chromosomes of Saccharomyces cerevisiae in space, based on a worm-like chain model constrained by attachment to the nuclear envelope and clustering forces. We have asked how these constraints could influence chromosome layout, with particular regard to the juxtaposition of homologous chromosomes and potential nonallelic, ectopic, interactions. The data support the view that the bouquet may be sufficient to bring short chromosomes together, but the contribution to long chromosomes is less. We also find that persistence length is critical to how much influence the bouquet structure could have, both on pairing of homologues and avoiding contacts with heterologues. This work represents an important development in computer modeling of chromosomes, and suggests new explanations for why elucidating the functional significance of the bouquet by genetics has been so difficult.


Assuntos
Pareamento Cromossômico/genética , Cromossomos Fúngicos/genética , Meiose/genética , Modelos Genéticos , Saccharomyces cerevisiae/genética , Telômero/genética , Animais , Humanos , Modelos Químicos , Modelos Moleculares , Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/ultraestrutura , Homologia de Sequência do Ácido Nucleico , Relação Estrutura-Atividade , Telômero/química , Telômero/ultraestrutura
15.
PLoS Comput Biol ; 8(1): e1002330, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22241974

RESUMO

Expression quantitative trait loci (eQTL) studies are an integral tool to investigate the genetic component of gene expression variation. A major challenge in the analysis of such studies are hidden confounding factors, such as unobserved covariates or unknown subtle environmental perturbations. These factors can induce a pronounced artifactual correlation structure in the expression profiles, which may create spurious false associations or mask real genetic association signals. Here, we report PANAMA (Probabilistic ANAlysis of genoMic dAta), a novel probabilistic model to account for confounding factors within an eQTL analysis. In contrast to previous methods, PANAMA learns hidden factors jointly with the effect of prominent genetic regulators. As a result, this new model can more accurately distinguish true genetic association signals from confounding variation. We applied our model and compared it to existing methods on different datasets and biological systems. PANAMA consistently performs better than alternative methods, and finds in particular substantially more trans regulators. Importantly, our approach not only identifies a greater number of associations, but also yields hits that are biologically more plausible and can be better reproduced between independent studies. A software implementation of PANAMA is freely available online at http://ml.sheffield.ac.uk/qtl/.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Regulação da Expressão Gênica/genética , Variação Genética/genética , Modelos Genéticos , Modelos Estatísticos , Locos de Características Quantitativas/genética , Animais , Simulação por Computador , Fatores de Confusão Epidemiológicos , Interpretação Estatística de Dados , Humanos , Sensibilidade e Especificidade
16.
BMC Bioinformatics ; 12: 180, 2011 May 20.
Artigo em Inglês | MEDLINE | ID: mdl-21599902

RESUMO

BACKGROUND: The analysis of gene expression from time series underpins many biological studies. Two basic forms of analysis recur for data of this type: removing inactive (quiet) genes from the study and determining which genes are differentially expressed. Often these analysis stages are applied disregarding the fact that the data is drawn from a time series. In this paper we propose a simple model for accounting for the underlying temporal nature of the data based on a Gaussian process. RESULTS: We review Gaussian process (GP) regression for estimating the continuous trajectories underlying in gene expression time-series. We present a simple approach which can be used to filter quiet genes, or for the case of time series in the form of expression ratios, quantify differential expression. We assess via ROC curves the rankings produced by our regression framework and compare them to a recently proposed hierarchical Bayesian model for the analysis of gene expression time-series (BATS). We compare on both simulated and experimental data showing that the proposed approach considerably outperforms the current state of the art. CONCLUSIONS: Gaussian processes offer an attractive trade-off between efficiency and usability for the analysis of microarray time series. The Gaussian process framework offers a natural way of handling biological replicates and missing values and provides confidence intervals along the estimated curves of gene expression. Therefore, we believe Gaussian processes should be a standard tool in the analysis of gene expression time series.


Assuntos
Perfilação da Expressão Gênica , Análise de Regressão , Software , Teorema de Bayes , Distribuição Normal , Análise de Sequência com Séries de Oligonucleotídeos
17.
Bioinformatics ; 27(7): 1026-7, 2011 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-21300702

RESUMO

UNLABELLED: tigre is an R/Bioconductor package for inference of transcription factor activity and ranking candidate target genes from gene expression time series. The underlying methodology is based on Gaussian process inference on a differential equation model that allows the use of short, unevenly sampled, time series. The method has been designed with efficient parallel implementation in mind, and the package supports parallel operation even without additional software. AVAILABILITY: The tigre package is included in Bioconductor since release 2.6 for R 2.11. The package and a user's guide are available at http://www.bioconductor.org.


Assuntos
Software , Fatores de Transcrição/metabolismo , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos
18.
Bioinformatics ; 26(20): 2635-6, 2010 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-20739311

RESUMO

SUMMARY: TFInfer is a novel open access, standalone tool for genome-wide inference of transcription factor activities from gene expression data. Based on an earlier MATLAB version, the software has now been extended in a number of ways. It has been significantly optimised in terms of performance, and it was given novel functionality, by allowing the user to model both time series and data from multiple independent conditions. With a full documentation and intuitive graphical user interface, together with an in-built data base of yeast and Escherichia coli transcription factors, the software does not require any mathematical or computational expertise to be used effectively. AVAILABILITY: http://homepages.inf.ed.ac.uk/gsanguin/TFInfer.html CONTACT: gsanguin@staffmail.ed.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Modelos Estatísticos , Software , Fatores de Transcrição/química , Biologia Computacional , Bases de Dados Factuais , Escherichia coli/metabolismo , Expressão Gênica , Fatores de Transcrição/metabolismo , Leveduras/metabolismo
19.
Proc Natl Acad Sci U S A ; 107(17): 7793-8, 2010 Apr 27.
Artigo em Inglês | MEDLINE | ID: mdl-20385836

RESUMO

We present a computational method for identifying potential targets of a transcription factor (TF) using wild-type gene expression time series data. For each putative target gene we fit a simple differential equation model of transcriptional regulation, and the model likelihood serves as a score to rank targets. The expression profile of the TF is modeled as a sample from a Gaussian process prior distribution that is integrated out using a nonparametric Bayesian procedure. This results in a parsimonious model with relatively few parameters that can be applied to short time series datasets without noticeable overfitting. We assess our method using genome-wide chromatin immunoprecipitation (ChIP-chip) and loss-of-function mutant expression data for two TFs, Twist, and Mef2, controlling mesoderm development in Drosophila. Lists of top-ranked genes identified by our method are significantly enriched for genes close to bound regions identified in the ChIP-chip data and for genes that are differentially expressed in loss-of-function mutants. Targets of Twist display diverse expression profiles, and in this case a model-based approach performs significantly better than scoring based on correlation with TF expression. Our approach is found to be comparable or superior to ranking based on mutant differential expression scores. Also, we show how integrating complementary wild-type spatial expression data can further improve target ranking performance.


Assuntos
Proteínas de Drosophila/metabolismo , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Modelos Genéticos , Fatores de Regulação Miogênica/metabolismo , Biologia de Sistemas/métodos , Proteína 1 Relacionada a Twist/metabolismo , Teorema de Bayes , Imunoprecipitação da Cromatina , Regulação da Expressão Gênica/fisiologia , Funções Verossimilhança , Mutação/genética
20.
J Physiol ; 588(Pt 1): 187-99, 2010 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-19917569

RESUMO

Mammalian cochlear inner hair cells (IHCs) are specialized to process developmental signals during immature stages and sound stimuli in adult animals. These signals are conveyed onto auditory afferent nerve fibres. Neurotransmitter release at IHC ribbon synapses is controlled by L-type Ca(V)1.3 Ca(2+) channels, the biophysics of which are still unknown in native mammalian cells. We have investigated the localization and elementary properties of Ca(2+) channels in immature mouse IHCs under near-physiological recording conditions. Ca(V)1.3 Ca(2+) channels at the cell pre-synaptic site co-localize with about half of the total number of ribbons present in immature IHCs. These channels activated at about 70 mV, showed a relatively short first latency and weak inactivation, which would allow IHCs to generate and accurately encode spontaneous Ca(2+) action potential activity characteristic of these immature cells. The Ca(V)1.3 Ca(2+) channels showed a very low open probability (about 0.15 at 20 mV: near the peak of an action potential). Comparison of elementary and macroscopic Ca(2+) currents indicated that very few Ca(2+) channels are associated with each docked vesicle at IHC ribbon synapses. Finally, we found that the open probability of Ca(2+) channels, but not their opening time, was voltage dependent. This finding provides a possible correlation between presynaptic Ca(2+) channel properties and the characteristic frequency/amplitude of EPSCs in auditory afferent fibres.


Assuntos
Potenciais de Ação/fisiologia , Canais de Cálcio Tipo L/fisiologia , Sinalização do Cálcio/fisiologia , Cálcio/metabolismo , Células Ciliadas Auditivas Internas/fisiologia , Ativação do Canal Iônico/fisiologia , Potenciais da Membrana/fisiologia , Animais , Células Cultivadas , Camundongos , Camundongos Endogâmicos C57BL
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...