Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters










Database
Language
Publication year range
1.
Science ; 317(5845): 1756-60, 2007 Sep 21.
Article in English | MEDLINE | ID: mdl-17885136

ABSTRACT

Parasitic nematodes that cause elephantiasis and river blindness threaten hundreds of millions of people in the developing world. We have sequenced the approximately 90 megabase (Mb) genome of the human filarial parasite Brugia malayi and predict approximately 11,500 protein coding genes in 71 Mb of robustly assembled sequence. Comparative analysis with the free-living, model nematode Caenorhabditis elegans revealed that, despite these genes having maintained little conservation of local synteny during approximately 350 million years of evolution, they largely remain in linkage on chromosomal units. More than 100 conserved operons were identified. Analysis of the predicted proteome provides evidence for adaptations of B. malayi to niches in its human and vector hosts and insights into the molecular basis of a mutualistic relationship with its Wolbachia endosymbiont. These findings offer a foundation for rational drug design.


Subject(s)
Brugia malayi/genetics , Genome, Helminth , Animals , Brugia malayi/physiology , Caenorhabditis/genetics , Drosophila melanogaster/genetics , Drug Resistance/genetics , Filariasis/parasitology , Humans , Molecular Sequence Data
2.
BMC Bioinformatics ; 7: 126, 2006 Mar 10.
Article in English | MEDLINE | ID: mdl-16529652

ABSTRACT

BACKGROUND: Determining whether a gene is differentially expressed in two different samples remains an important statistical problem. Prior work in this area has featured the use of t-tests with pooled estimates of the sample variance based on similarly expressed genes. These methods do not display consistent behavior across the entire range of pooling and can be biased when the prior hyperparameters are specified heuristically. RESULTS: A two-sample Bayesian t-test is proposed for use in determining whether a gene is differentially expressed in two different samples. The test method is an extension of earlier work that made use of point estimates for the variance. The method proposed here explicitly calculates in analytic form the marginal distribution for the difference in the mean expression of two samples, obviating the need for point estimates of the variance without recourse to posterior simulation. The prior distribution involves a single hyperparameter that can be calculated in a statistically rigorous manner, making clear the connection between the prior degrees of freedom and prior variance. CONCLUSION: The test is easy to understand and implement and application to both real and simulated data shows that the method has equal or greater power compared to the previous method and demonstrates consistent Type I error rates. The test is generally applicable outside the microarray field to any situation where prior information about the variance is available and is not limited to cases where estimates of the variance are based on many similar observations.


Subject(s)
Algorithms , Data Interpretation, Statistical , Gene Expression Profiling/methods , Models, Genetic , Oligonucleotide Array Sequence Analysis/methods , Bayes Theorem , Computer Simulation , Logistic Models , Models, Statistical
3.
Bioinformatics ; 21 Suppl 1: i126-35, 2005 Jun.
Article in English | MEDLINE | ID: mdl-15961449

ABSTRACT

MOTIVATION: The evolution of protein sequences is constrained by complex interactions between amino acid residues. Because harmful substitutions may be compensated for by other substitutions at neighboring sites, residues can coevolve. We describe a Bayesian phylogenetic approach to the detection of coevolving residues in protein families. This method, Bayesian mutational mapping (BMM), assigns mutations to the branches of the evolutionary tree stochastically, and then test statistics are calculated to determine whether a coevolutionary signal exists in the mapping. Posterior predictive P-values provide an estimate of significance, and specificity is maintained by integrating over uncertainty in the estimation of the tree topology, branch lengths and substitution rates. A coevolutionary Markov model for codon substitution is also described, and this model is used as the basis of several test statistics. RESULTS: Results on simulated coevolutionary data indicate that the BMM method can successfully detect nearly all coevolving sites when the model has been correctly specified, and that non-parametric statistics such as mutual information are generally less powerful than parametric statistics. On a dataset of eukaryotic proteins from the phosphoglycerate kinase (PGK) family, interdomain site contacts yield a significantly greater coevolutionary signal than interdomain non-contacts, an indication that the method provides information about interacting sites. Failure to account for the heterogeneity in rates across sites in PGK resulted in a less discriminating test, yielding a marked increase in the number of reported positives at both contact and non-contact sites. SUPPLEMENTARY INFORMATION: http://www.dimmic.net/supplement/


Subject(s)
Amino Acids/chemistry , Chromosome Mapping/methods , Computational Biology/methods , DNA Mutational Analysis , Bayes Theorem , Binding Sites , Evolution, Molecular , Likelihood Functions , Markov Chains , Models, Statistical , Multigene Family , Mutation , Phosphoglycerate Kinase/genetics
5.
Biochemistry ; 42(49): 14522-31, 2003 Dec 16.
Article in English | MEDLINE | ID: mdl-14661965

ABSTRACT

G-Protein-coupled receptors (GPCRs) are an important superfamily of transmembrane proteins involved in cellular communication. Recently, it has been shown that dimerization is a widely occurring phenomenon in the GPCR superfamily, with likely important physiological roles. Here we use a novel hidden-site class model of evolution as a sequence analysis tool to predict possible dimerization interfaces in GPCRs. This model aims to simulate the evolution of proteins at the amino acid level, allowing the analysis of their sequences in an explicitly evolutionary context. Applying this model to aminergic GPCR sequences, we first validate the general reasoning behind the model. We then use the model to perform a family specific analysis of GPCRs. Accounting for the family structure of these proteins, this approach detects different evolutionarily conserved and accessible patches on transmembrane (TM) helices 4-6 in different families. On the basis of these findings, we propose an experimentally testable dimerization mechanism, involving interactions among different combinations of these helices in different families of aminergic GPCRs.


Subject(s)
Evolution, Molecular , Models, Molecular , Receptors, Biogenic Amine/chemistry , Receptors, Biogenic Amine/classification , Receptors, G-Protein-Coupled/chemistry , Receptors, G-Protein-Coupled/classification , Amino Acid Substitution/genetics , Animals , Computational Biology/methods , Computational Biology/statistics & numerical data , Dimerization , Humans , Models, Chemical , Models, Statistical , Multigene Family , Probability , Receptors, Biogenic Amine/genetics , Receptors, G-Protein-Coupled/genetics , Sequence Analysis, Protein/methods , Sequence Analysis, Protein/statistics & numerical data
6.
J Mol Evol ; 55(1): 65-73, 2002 Jul.
Article in English | MEDLINE | ID: mdl-12165843

ABSTRACT

Retroviral and other reverse transcriptase (RT)-containing sequences may be subject to unique evolutionary pressures, and models of molecular sequence evolution developed using other kinds of sequences may not be optimal. Here we develop and present a new substitution matrix for maximum likelihood (ML) phylogenetic analysis which has been optimized on a dataset of 33 amino acid sequences from the retroviral Pol proteins. When compared to other matrices, this model (rtREV) yields higher log-likelihood values on a range of datasets including lentiviruses, spumaviruses, betaretroviruses, gammaretroviruses, and other elements containing reverse transcriptase. We provide evidence that rtREV is a more realistic evolutionary model for analyses of the pol gene, although it is inapplicable to analyses involving the gag gene.


Subject(s)
Phylogeny , RNA-Directed DNA Polymerase/genetics , Retroviridae/enzymology , Retroviridae/genetics , Amino Acid Substitution , Evolution, Molecular , Genetic Techniques , Likelihood Functions , Retroviridae/classification
7.
Pac Symp Biocomput ; : 625-36, 2002.
Article in English | MEDLINE | ID: mdl-11928514

ABSTRACT

A novel method to analyze evolutionary change is presented and its application to the analysis of sequence data is discussed. The investigated method uses phylogenetic trees of related proteins with an evolutionary model in order to gain insight about protein structure and function. The evolutionary model, based on amino acid substitutions, contains adjustable parameters related to amino acid and sequence properties. A maximum likelihood approach is used with a phylogenetic tree to optimize these parameters. The model is applied to a set of Muscarinic receptors, members of the G-protein coupled receptor family. Here we show that the optimized parameters of the model are able to highlight the general structural features of these receptors.


Subject(s)
Biological Evolution , GTP-Binding Proteins/genetics , Receptors, Cell Surface/chemistry , Receptors, Cell Surface/genetics , Animals , Humans , Likelihood Functions , Models, Genetic , Models, Molecular , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL
...