Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
PLoS One ; 18(7): e0287734, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37418392

RESUMO

In this work, we develop a new set of Bayesian models to perform registration of real-valued functions. A Gaussian process prior is assigned to the parameter space of time warping functions, and a Markov chain Monte Carlo (MCMC) algorithm is utilized to explore the posterior distribution. While the proposed model can be defined on the infinite-dimensional function space in theory, dimension reduction is needed in practice because one cannot store an infinite-dimensional function on the computer. Existing Bayesian models often rely on some pre-specified, fixed truncation rule to achieve dimension reduction, either by fixing the grid size or the number of basis functions used to represent a functional object. In comparison, the new models in this paper randomize the truncation rule. Benefits of the new models include the ability to make inference on the smoothness of the functional parameters, a data-informative feature of the truncation rule, and the flexibility to control the amount of shape-alteration in the registration process. For instance, using both simulated and real data, we show that when the observed functions exhibit more local features, the posterior distribution on the warping functions automatically concentrates on a larger number of basis functions. Supporting materials including code and data to perform registration and reproduce some of the results presented herein are available online.


Assuntos
Algoritmos , Teorema de Bayes , Cadeias de Markov , Método de Monte Carlo
2.
PLoS One ; 18(6): e0286624, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37267337

RESUMO

Advances in observational and computational assets have led to revolutions in the range and quality of results in many science and engineering settings. However, those advances have led to needs for new research in treating model errors and assessing their impacts. We consider two settings. The first involves physically-based statistical models that are sufficiently manageable to allow incorporation of a stochastic "model error process". In the second case we consider large-scale models in which incorporation of a model error process and updating its distribution is impractical. Our suggestion is to treat dimension-reduced model output as if it is observational data, with a data model that incorporates a bias component to represent the impacts of model error. We believe that our suggestions are valuable quantitative, yet relatively simple, ways to extract useful information from models while including adjustment for model error. These ideas are illustrated and assessed using an application inspired by a classical oceanographic problem.


Assuntos
Engenharia , Modelos Estatísticos , Teorema de Bayes , Viés , Processos Estocásticos
3.
Stat Appl Genet Mol Biol ; 17(3)2018 06 06.
Artigo em Inglês | MEDLINE | ID: mdl-29874197

RESUMO

The increasing availability of population-level allele frequency data across one or more related populations necessitates the development of methods that can efficiently estimate population genetics parameters, such as the strength of selection acting on the population(s), from such data. Existing methods for this problem in the setting of the Wright-Fisher diffusion model are primarily likelihood-based, and rely on numerical approximation for likelihood computation and on bootstrapping for assessment of variability in the resulting estimates, requiring extensive computation. Recent work has provided a method for obtaining exact samples from general Wright-Fisher diffusion processes, enabling the development of methods for Bayesian estimation in this setting. We develop and implement a Bayesian method for estimating the strength of selection based on the Wright-Fisher diffusion for data sampled at a single time point. The method utilizes the latest algorithms for exact sampling to devise a Markov chain Monte Carlo procedure to draw samples from the joint posterior distribution of the selection coefficient and the allele frequencies. We demonstrate that when assumptions about the initial allele frequencies are accurate the method performs well for both simulated data and for an empirical data set on hypoxia in flies, where we find evidence for strong positive selection in a region of chromosome 2L previously identified. We discuss possible extensions of our method to the more general settings commonly encountered in practice, highlighting the advantages of Bayesian approaches to inference in this setting.


Assuntos
Teorema de Bayes , Frequência do Gene , Genética Populacional , Modelos Genéticos , Algoritmos , Animais , Drosophila melanogaster/genética , Hipóxia/genética , Funções Verossimilhança , Cadeias de Markov , Método de Monte Carlo , Polimorfismo de Nucleotídeo Único
4.
PLoS One ; 12(3): e0173453, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28301529

RESUMO

We examine the performance of a strategy for Markov chain Monte Carlo (MCMC) developed by simulating a discrete approximation to a stochastic differential equation (SDE). We refer to the approach as diffusion MCMC. A variety of motivations for the approach are reviewed in the context of Bayesian analysis. In particular, implementation of diffusion MCMC is very simple to set-up, even in the presence of nonlinear models and non-conjugate priors. Also, it requires comparatively little problem-specific tuning. We implement the algorithm and assess its performance for both a test case and a glaciological application. Our results demonstrate that in some settings, diffusion MCMC is a faster alternative to a general Metropolis-Hastings algorithm.


Assuntos
Cadeias de Markov , Método de Monte Carlo , Teorema de Bayes , Processos Estocásticos
5.
PLoS One ; 11(7): e0159038, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27441705

RESUMO

The McMurdo Dry Valleys constitute the largest ice free area of Antarctica. The area is a polar desert with an annual precipitation of ∼ 3 cm water equivalent, but contains several lakes fed by glacial melt water streams that flow from four to twelve weeks of the year. Over the past ∼20 years, data have been collected on the lakes located in Taylor Valley, Antarctica as part of the McMurdo Dry Valley Long-Term Ecological Research program (MCM-LTER). This work aims to understand the impact of climate variations on the biological processes in all the ecosystem types within Taylor Valley, including the lakes. These lakes are stratified, closed-basin systems and are perennially covered with ice. Each lake contains a variety of planktonic and benthic algae that require nutrients for photosynthesis and growth. The work presented here focuses on Lake Fryxell, one of the three main lakes of Taylor Valley; it is fed by thirteen melt-water streams. We use a functional regression approach to link the physical, chemical, and biological processes within the stream-lake system to evaluate the input of water and nutrients on the biological processes in the lakes. The technique has been shown previously to provide important insights into these Antarctic lacustrine systems where data acquisition is not temporally coherent. We use data on primary production (PPR) and chlorophyll-A (CHL)from Lake Fryxell as well as discharge observations from two streams flowing into the lake. Our findings show an association between both PPR, CHL and stream input.


Assuntos
Ecossistema , Hidrologia , Lagos , Regiões Antárticas , Clorofila/análise , Clorofila A , Geografia , Nitrogênio/análise , Fosfatos/análise , Análise de Regressão , Rios , Solubilidade , Fatores de Tempo
6.
Mol Phylogenet Evol ; 94(Pt A): 290-7, 2016 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-26358614

RESUMO

The quality of phylogenetic inference made from protein-coding genes depends, in part, on the realism with which the codon substitution process is modeled. Here we propose a new mechanistic model that combines the standard M0 substitution model of Yang (1997) with a simplified model from Gilchrist (2007) that includes selection on synonymous substitutions as a function of codon-specific nonsense error rates. We tested the newly proposed model by applying it to 104 protein-coding genes in brewer's yeast, and compared the fit of the new model to the standard M0 model and to the mutation-selection model of Yang and Nielsen (2008) using the AIC. Our new model provided significantly better fit in approximately 85% of the cases considered for the basic M0 model and in approximately 25% of the cases for the M0 model with estimated codon frequencies, but only in a few cases when the mutation-selection model was considered. However, our model includes a parameter that can be interpreted as a measure of the rate of protein production, and the estimates of this parameter were highly correlated with an independent measure of protein production for the yeast genes considered here. Finally, we found that in some cases the new model led to the preference of a different phylogeny for a subset of the genes considered, indicating that substitution model choice may have an impact on the estimated phylogeny.


Assuntos
Códon/genética , Código Genético , Modelos Genéticos , Seleção Genética , Genes Fúngicos , Nucleotídeos/genética , Filogenia , Mutação Puntual , Saccharomyces cerevisiae/classificação , Proteínas de Saccharomyces cerevisiae
7.
Math Biosci ; 268: 9-21, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26256054

RESUMO

One of the fundamental goals in phylogenetics is to make inferences about the evolutionary pattern among a group of individuals, such as genes or species, using present-day genetic material. This pattern is represented by a phylogenetic tree, and as computational methods have caught up to the statistical theory, Bayesian methods of making inferences about phylogenetic trees have become increasingly popular. Bayesian inference of phylogenetic trees requires sampling from intractable probability distributions. Common methods of sampling from these distributions include Markov chain Monte Carlo (MCMC) and Sequential Monte Carlo (SMC) methods, and one way that both of these methods can proceed is by first simulating a tree topology and then taking a sample from the posterior distribution of the branch lengths given the tree topology and the data set. In many MCMC methods, it is difficult to verify that the underlying Markov chain is geometrically ergodic, and thus, it is necessary to rely on output-based convergence diagnostics in order to assess convergence on an ad hoc basis. These diagnostics suffer from several important limitations, so in an effort to circumvent these limitations, this work establishes geometric convergence for a particular Markov chain that is used to sample branch lengths under a fairly general class of nucleotide substitution models and provides a numerical method for estimating the time this Markov chain takes to converge.


Assuntos
Teorema de Bayes , Modelos Teóricos , Filogenia
8.
Stat Appl Genet Mol Biol ; 12(1): 39-48, 2013 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-23459470

RESUMO

Markov chains are widely used for modeling in many areas of molecular biology and genetics. As the complexity of such models advances, it becomes increasingly important to assess the rate at which a Markov chain converges to its stationary distribution in order to carry out accurate inference. A common measure of convergence to the stationary distribution is the total variation distance, but this measure can be difficult to compute when the state space of the chain is large. We propose a Monte Carlo method to estimate the total variation distance that can be applied in this situation, and we demonstrate how the method can be efficiently implemented by taking advantage of GPU computing techniques. We apply the method to two Markov chains on the space of phylogenetic trees, and discuss the implications of our findings for the development of algorithms for phylogenetic inference.


Assuntos
Simulação por Computador , Cadeias de Markov , Modelos Genéticos , Método de Monte Carlo , Algoritmos , Evolução Molecular , Variação Genética , Modelos Estatísticos , Filogenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...