Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 35
Filter
Add more filters










Publication year range
1.
Physiol Genomics ; 49(12): 747-762, 2017 12 01.
Article in English | MEDLINE | ID: mdl-29093195

ABSTRACT

High-fat diet (HFD) during lactation alters milk composition and is associated with development of metabolic diseases in the offspring. We hypothesized that HFD affects milk microRNA (miRNA) and mRNA content, which potentially impact offspring development. Our objective was to determine the effect of maternal HFD on secreted milk transcriptome. To meet this objective, 4 wk old female ICR mice were divided into two treatments: control diet containing 10% kcal fat and HFD containing 60% kcal fat. After 4 wk on CD or HFD, mice were bred while continuously fed the same diets. On postnatal day 2 (P2), litters were normalized to 10 pups, and half the pups in each litter were cross-fostered between treatments. Milk was collected from dams on P10 and P12. Total RNA was isolated from milk fat fraction of P10 samples and used for mRNA-Seq and small RNA-Seq. P12 milk was used to determine macronutrient composition. After 4 wk of prepregnancy feeding HFD mice weighed significantly more than did the control mice. Lactose and fat concentration were significantly ( P < 0.05) higher in milk of HFD dams. Pup weight was significantly greater ( P < 0.05) in groups suckled by HFD vs. control dams. There were 25 miRNA and over 1,500 mRNA differentially expressed (DE) in milk of HFD vs. control dams. DE mRNA and target genes of DE miRNA enriched categories that were primarily related to multicellular organismal development. Maternal HFD impacts mRNA and miRNA content of milk, if bioactive nucleic acids are absorbed by neonate differences may affect development.


Subject(s)
Diet, High-Fat/adverse effects , Milk/metabolism , Animals , Fats/analysis , Female , Lactation/genetics , Lactation/physiology , Lactose/analysis , Mice , RNA, Messenger/genetics , Transcriptome/genetics , Transcriptome/physiology
2.
Semin Cell Dev Biol ; 35: 109-23, 2014 Nov.
Article in English | MEDLINE | ID: mdl-25016297

ABSTRACT

Mathematical modeling of developmental signaling networks has played an increasingly important role in the identification of regulatory mechanisms by providing a sandbox for hypothesis testing and experiment design. Whether these models consist of an equation with a few parameters or dozens of equations with hundreds of parameters, a prerequisite to model-based discovery is to bring simulated behavior into agreement with observed data via parameter estimation. These parameters provide insight into the system (e.g., enzymatic rate constants describe enzyme properties). Depending on the nature of the model fit desired - from qualitative (relative spatial positions of phosphorylation) to quantitative (exact agreement of spatial position and concentration of gene products) - different measures of data-model mismatch are used to estimate different parameter values, which contain different levels of usable information and/or uncertainty. To facilitate the adoption of modeling as a tool for discovery alongside other tools such as genetics, immunostaining, and biochemistry, careful consideration needs to be given to how well a model fits the available data, what the optimized parameter values mean in a biological context, and how the uncertainty in model parameters and predictions plays into experiment design. The core discussion herein pertains to the quantification of model-to-data agreement, which constitutes the first measure of a model's performance and future utility to the problem at hand. Integration of this experimental data and the appropriate choice of objective measures of data-model agreement will continue to drive modeling forward as a tool that contributes to experimental discovery. The Drosophila melanogaster gap gene system, in which model parameters are optimized against in situ immunofluorescence intensities, demonstrates the importance of error quantification, which is applicable to a wide array of developmental modeling studies.


Subject(s)
Drosophila melanogaster/genetics , Homeodomain Proteins/genetics , Models, Genetic , Morphogenesis/genetics , Signal Transduction/genetics , Trans-Activators/genetics , Algorithms , Animals , Computer Simulation , Drosophila Proteins , Drosophila melanogaster/embryology , Gene Expression Regulation, Developmental , Homeodomain Proteins/metabolism , Trans-Activators/metabolism
3.
Science ; 313(5793): 1596-604, 2006 Sep 15.
Article in English | MEDLINE | ID: mdl-16973872

ABSTRACT

We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs of duplicated genes from that event survived in the Populus genome. A second, older duplication event is indistinguishably coincident with the divergence of the Populus and Arabidopsis lineages. Nucleotide substitution, tandem gene duplication, and gross chromosomal rearrangement appear to proceed substantially more slowly in Populus than in Arabidopsis. Populus has more protein-coding genes than Arabidopsis, ranging on average from 1.4 to 1.6 putative Populus homologs for each Arabidopsis gene. However, the relative frequency of protein domains in the two genomes is similar. Overrepresented exceptions in Populus include genes associated with lignocellulosic wall biosynthesis, meristem development, disease resistance, and metabolite transport.


Subject(s)
Gene Duplication , Genome, Plant , Populus/genetics , Sequence Analysis, DNA , Arabidopsis/genetics , Chromosome Mapping , Computational Biology , Evolution, Molecular , Expressed Sequence Tags , Gene Expression , Genes, Plant , Oligonucleotide Array Sequence Analysis , Phylogeny , Plant Proteins/chemistry , Plant Proteins/genetics , Polymorphism, Single Nucleotide , Populus/growth & development , Populus/metabolism , Protein Structure, Tertiary , RNA, Plant/analysis , RNA, Untranslated/analysis
4.
Pac Symp Biocomput ; : 78-88, 2004.
Article in English | MEDLINE | ID: mdl-14992494

ABSTRACT

A unique microarray approach has been developed to profile alternative splicing in the cell. To support the development of this approach, we have developed the Manually Annotated Alternatively Spliced Events (MAASE) database system, which is a unique alternative splicing information resource designed specifically with experimentalists in mind. MAASE is an online resource for the convenient access, identification, and annotation of alternative splicing events (ASEs). MAASE consists of two components: an annotation system and a curated database. The annotation system is a web-based workspace that combines manual and computational approaches to identifying and annotating ASEs, a combination that is vital if a comprehensive collection is to be obtained. The annotation system is publicly available and provides a scalable solution to acquiring as well as contributing to annotated ASEs. MAASE annotated ASEs are deposited into the database component, which can either be queried one entry at a time or multiple entries at a time with convenient access to alternatively spliced junctional and surrounding sequences to facilitate the design of microarray experiments.


Subject(s)
Alternative Splicing , Computational Biology , Databases, Nucleic Acid , Database Management Systems , Oligonucleotide Array Sequence Analysis/statistics & numerical data , Software Design
5.
Trends Plant Sci ; 6(10): 448-9, 2001 Oct.
Article in English | MEDLINE | ID: mdl-11590042

ABSTRACT

Genome-scale sequencing projects have provided the essential information required for the construction of entire genome chips or microarrays for RNA expression studies. The Arabidopsis and rice genomes have been sequenced and whole-genome oligonucleotide arrays are being manufactured. These should soon become available to researchers. Expression studies using genomic-scale expression arrays are providing us with a vast quantity of information at a rapid pace. The rate-limiting step in this type of experiments is not the data generation step but rather the data analysis component of experiments. We report improvements that should facilitate the analysis of Affymetrix Genechip expression data.


Subject(s)
Arabidopsis/genetics , Database Management Systems , Databases, Factual , Genome, Plant , Oryza/genetics , Plant Proteins/genetics
6.
Plant Physiol ; 126(4): 1646-67, 2001 Aug.
Article in English | MEDLINE | ID: mdl-11500563

ABSTRACT

Uptake and translocation of cationic nutrients play essential roles in physiological processes including plant growth, nutrition, signal transduction, and development. Approximately 5% of the Arabidopsis genome appears to encode membrane transport proteins. These proteins are classified in 46 unique families containing approximately 880 members. In addition, several hundred putative transporters have not yet been assigned to families. In this paper, we have analyzed the phylogenetic relationships of over 150 cation transport proteins. This analysis has focused on cation transporter gene families for which initial characterizations have been achieved for individual members, including potassium transporters and channels, sodium transporters, calcium antiporters, cyclic nucleotide-gated channels, cation diffusion facilitator proteins, natural resistance-associated macrophage proteins (NRAMP), and Zn-regulated transporter Fe-regulated transporter-like proteins. Phylogenetic trees of each family define the evolutionary relationships of the members to each other. These families contain numerous members, indicating diverse functions in vivo. Closely related isoforms and separate subfamilies exist within many of these gene families, indicating possible redundancies and specialized functions. To facilitate their further study, the PlantsT database (http://plantst.sdsc.edu) has been created that includes alignments of the analyzed cation transporters and their chromosomal locations.


Subject(s)
Arabidopsis/genetics , Carrier Proteins/genetics , Cation Transport Proteins , Ion Channels/genetics , Antiporters/classification , Antiporters/genetics , Arabidopsis/classification , Biological Transport, Active , Carrier Proteins/classification , Carrier Proteins/metabolism , Cations , Chromosome Mapping , Cyclic Nucleotide-Gated Cation Channels , Ion Channels/classification , Ion Transport/genetics , Membrane Proteins/metabolism , Phylogeny , Potassium/metabolism
7.
Genome Res ; 11(6): 1114-25, 2001 Jun.
Article in English | MEDLINE | ID: mdl-11381037

ABSTRACT

We performed a systematic analysis of 929 human disease gene entries associated with at least one mutant allele in the Online Mendelian Inheritance in Man (OMIM) database against the recently completed genome sequence of Drosophila melanogaster. The results of this search have been formatted as an updateable and searchable on-line database called Homophila. Our analysis identified 714 distinct human disease genes (77% of disease genes searched) matching 548 unique Drosophila sequences, which we have summarized by disease category. This breakdown into disease classes creates a picture of disease genes that are amenable to study using Drosophila as the model organism. Of the 548 Drosophila genes related to human disease genes, 153 are associated with known mutant alleles and 56 more are tagged by P-element insertions in or near the gene. Examples of how to use the database to identify Drosophila genes related to human disease genes are presented. We anticipate that cross-genomic analysis of human disease genes using the power of Drosophila second-site modifier screens will promote interaction between human and Drosophila research groups, accelerating the understanding of the pathogenesis of human genetic disease. The Homophila database is available at http://homophila.sdsc.edu.


Subject(s)
Drosophila melanogaster/genetics , Genes, Insect/genetics , Genetic Diseases, Inborn/genetics , Amino Acid Sequence/genetics , Animals , Computational Biology/methods , Conserved Sequence/genetics , Databases, Factual , Humans , Molecular Sequence Data , Sequence Homology, Nucleic Acid , Signal Transduction/genetics
8.
Bioinformatics ; 17(4): 319-26, 2001 Apr.
Article in English | MEDLINE | ID: mdl-11301300

ABSTRACT

MOTIVATION: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results. RESULTS: We describe a data mining method that uses indexing terms ('keywords') from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes.


Subject(s)
Databases, Factual , Gene Expression Profiling , Abstracting and Indexing , Information Storage and Retrieval , MEDLINE , Oligonucleotide Array Sequence Analysis
9.
Nucleic Acids Res ; 29(1): 111-3, 2001 Jan 01.
Article in English | MEDLINE | ID: mdl-11125063

ABSTRACT

The PlantsP database is a curated database that combines information derived from sequences with experimental functional genomics information. PlantsP focuses on plant protein kinases and protein phosphatases. The database will specifically provide a resource for information on a collection of T-DNA insertion mutants (knockouts) in each protein kinase and phosphatase in Arabidopsis thaliana. PlantsP also provides a curated view of each protein that includes a comprehensive annotation of functionally related sequence motifs, sequence family definitions, alignments and phylogenetic trees, and descriptive information drawn directly from the literature. PlantsP is available at http://PlantsP.sdsc.edu.


Subject(s)
Databases, Factual , Plants/genetics , Arabidopsis/enzymology , Arabidopsis/genetics , Genome, Plant , Internet , Mutation , Phosphoprotein Phosphatases/genetics , Phosphoprotein Phosphatases/metabolism , Phosphorylation , Plants/enzymology , Protein Kinases/genetics , Protein Kinases/metabolism
10.
Bioinformatics ; 16(9): 749, 2000 Sep.
Article in English | MEDLINE | ID: mdl-11108696
12.
Trends Plant Sci ; 5(4): 154-9, 2000 Apr.
Article in English | MEDLINE | ID: mdl-10740296

ABSTRACT

Numerous stimuli can alter the Ca2+concentration in the cytoplasm, a factor common to many physiological responses in plant and animal cells. Calcium-binding proteins decode information contained in the temporal and spatial patterns of these Ca2+ signals and bring about changes in metabolism and gene expression. In addition to calmodulin, a calcium-binding protein found in all eukaryotes, plants contain a large family of calcium-binding regulatory protein kinases. Evidence is accumulating that these protein kinases participate in numerous aspects of plant growth and development.


Subject(s)
Calcium Signaling , Calcium-Calmodulin-Dependent Protein Kinases/metabolism , Substrate Specificity
13.
Bioinformatics ; 15(5): 347, 1999 May.
Article in English | MEDLINE | ID: mdl-10484667
14.
Arch Virol ; 144(6): 1139-58, 1999.
Article in English | MEDLINE | ID: mdl-10446649

ABSTRACT

The delta antigen of hepatitis delta virus exhibits sequence specific binding to its own RNA and is essential for viral replication. Using statistical methods we have detected significant similarity between the RNA-binding domain of the hepatitis delta antigen and the HMG box of SRY. Our analysis suggests that the RNA-binding domain of HDV antigen evolved from the DNA-binding domain of the HMG box. SRY, or a related protein, is a probable cellular cognate of HDV.


Subject(s)
DNA-Binding Proteins/genetics , Hepatitis Antigens/genetics , Hepatitis Delta Virus/genetics , Nuclear Proteins , Transcription Factors , Hepatitis Antigens/metabolism , Hepatitis Delta Virus/metabolism , High Mobility Group Proteins/genetics , Molecular Sequence Data , RNA, Viral/metabolism , Sequence Alignment , Sequence Analysis , Sequence Homology, Amino Acid , Sex-Determining Region Y Protein
15.
Pac Symp Biocomput ; : 118-29, 1998.
Article in English | MEDLINE | ID: mdl-9697176

ABSTRACT

Illustrations of macromolecular structure in the scientific literature contain a high level of semantic content through which the authors convey, among other features, the biological function of that macromolecule. We refer to these illustrations as molecular scenes. Such scenes, if available electronically, are not readily accessible for further interactive interrogation. The basic PDB format does not retain features of the scene; formats like PostScript retain the scene but are not interactive; and the many formats used by individual graphics programs, while capable of reproducing the scene, are neither interchangeable nor can they be stored in a database and queried for features of the scene. MICE defines a Molecular Scene Description Language (MSDL) which allows scenes to be stored in a relational database (a molecular scene gallery) and queried. Scenes retrieved from the gallery are rendered in Virtual Reality Modeling Language (VRML) and currently displayed in WebView, a VRML browser modified to support the Virtual Reality Behavior System (VRBS) protocol. VRBS provides communication between multiple client browsers, each capable of manipulating the scene. This level of collaboration works well over standard Internet connections and holds promise for collaborative research at a distance and distance learning. Further, via VRBS, the VRML world can be used as a visual cue to trigger an application such as a remote MEME search. MICE is very much work in progress. Current work seeks to replace WebView with Netscape, Cosmoplayer, a standard VRML plug-in, and a Java-based console. The console consists of a generic kernel suitable for multiple collaborative applications and additional application-specific controls. Further details of the MICE project are available at http:/(/)mice.sdsc.edu.


Subject(s)
Computational Biology , Programming Languages , Protein Conformation , Proteins/chemistry , Software , Computer Simulation , Models, Molecular , Protein Structure, Secondary , Protein Structure, Tertiary , Static Electricity , Surface Properties
16.
J Comput Biol ; 5(2): 211-21, 1998.
Article in English | MEDLINE | ID: mdl-9672829

ABSTRACT

Position-specific scoring matrices are useful for representing and searching for protein sequence motifs. A sequence family can often be described by a group of one or more motifs, and an effective search must combine the scores for matching a sequence to each of the motifs in the group. We describe three methods for combining match scores and estimating the statistical significance of the combined scores and evaluate the search quality (classification accuracy) and the accuracy of the estimate of statistical significance of each. The three methods are: 1) sum of scores, 2) sum of reduced variates, 3) product of score p-values. We show that method 3) is superior to the other two methods in both regards, and that combining motif scores indeed gives better search accuracy. The MAST sequence homology search algorithm utilizing the product of p-values scoring method is available for interactive use and downloading at URL http:/(/)www.sdsc.edu/MEME.


Subject(s)
Algorithms , Databases, Factual , Proteins , Sequence Alignment/methods , Models, Theoretical , Sequence Homology, Amino Acid , Software
17.
Bioinformatics ; 14(1): 48-54, 1998.
Article in English | MEDLINE | ID: mdl-9520501

ABSTRACT

MOTIVATION: To illustrate an intuitive and statistically valid method for combining independent sources of evidence that yields a p-value for the complete evidence, and to apply it to the problem of detecting simultaneous matches to multiple patterns in sequence homology searches. RESULTS: In sequence analysis, two or more (approximately) independent measures of the membership of a sequence (or sequence region) in some class are often available. We would like to estimate the likelihood of the sequence being a member of the class in view of all the available evidence. An example is estimating the significance of the observed match of a macromolecular sequence (DNA or protein) to a set of patterns (motifs) that characterize a biological sequence family. An intuitive way to do this is to express each piece of evidence as a p-value, and then use the product of these p-values as the measure of membership in the family. We derive a formula and algorithm (QFAST) for calculating the statistical distribution of the product of n independent p-values. We demonstrate that sorting sequences by this p-value effectively combines the information present in multiple motifs, leading to highly accurate and sensitive sequence homology searches.


Subject(s)
Mathematical Computing , Sequence Homology, Amino Acid , Sequence Homology, Nucleic Acid , Algorithms , DNA , Proteins
19.
J Comput Biol ; 4(1): 45-59, 1997.
Article in English | MEDLINE | ID: mdl-9109037

ABSTRACT

Several computer algorithms now exist for discovering multiple motifs (expressed as weight matrices) that characterize a family of protein sequences known to be homologous. This paper describes a method for performing similarity searches of protein sequence databases using such a group of motifs. By simultaneously using all the motifs that characterize a protein family, the sensitivity and specificity of the database search are increased. We define the p-value for a target sequence to be the probability of a random sequence of the same length scoring as well or better in comparison to all the motifs that characterize the family. (The p-value of a database search can be determined from this value and the size of the database.) We show that estimating the distribution of single motif scores by a Gaussian extreme value distribution is insufficiently accurate to provide a useful estimate of the p-value, but that this deficiency can be corrected by reestimating the parameters of the underlying Gaussian distribution from observed scores for comparison of a given motif and sequence database. These parameters are used to calculate a "reduced variate" which has a Gumbel limiting distribution. Multiple motif scores are combined to give a single p-value by using the sum of the reduced variates for the motif scores as the test statistic. We give a computationally efficient approximation to the distribution of the sum of independent Gumbel random variables and verify experimentally that it closely approximates the distribution of the test statistic. Experiments on pseudorandom sequences show that the approximated p-values are conservative, so the significance of high scores in database searches will not be overstated. Experiments with real protein sequences and motifs identified by the MEME algorithm show that determining an overall p-value based on the combination of multiple motifs gives significantly better database search results than using p-values of single motifs.


Subject(s)
Proteins/chemistry , Models, Chemical
20.
Proteins ; 27(2): 184-94, 1997 Feb.
Article in English | MEDLINE | ID: mdl-9061782

ABSTRACT

The structure of a complex between a hexapeptide-based inhibitor, MVT-101, and the chemically synthesized (Aba 67,95,167,195; Aba: L-alpha-amino-n-butyric acid) protease from the human immunodeficiency virus (HIV-1), reported previously at 2.3 A has now been refined to a crystallographic R factor of 15.4% at 2.0 A resolution. Root mean square deviations from ideality are 0.18 A for bond lengths and 2.4 degrees for the angles. The inhibitor can be fitted to the difference electron density map in two alternative orientations. Drastic differences are observed for positions and interactions at P3/S3 and P3'/S3' subsites of the two orientations due to different crystallographic environments.


Subject(s)
HIV Protease Inhibitors/chemistry , HIV Protease/chemistry , Oligopeptides/chemistry , Crystallography, X-Ray , HIV Protease/chemical synthesis , Humans , Models, Molecular , Protein Conformation , Water/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL
...