Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
Science ; 292(5523): 1915-8, 2001 Jun 08.
Article in English | MEDLINE | ID: mdl-11397946

ABSTRACT

Experimental murine genetic models of complex human disease show great potential for understanding human disease pathogenesis. To reduce the time required for analysis of such models from many months down to milliseconds, a computational method for predicting chromosomal regions regulating phenotypic traits and a murine database of single nucleotide polymorphisms were developed. After entry of phenotypic information obtained from inbred mouse strains, the phenotypic and genotypic information is analyzed in silico to predict the chromosomal regions regulating the phenotypic trait.


Subject(s)
Algorithms , Chromosome Mapping/methods , Disease Models, Animal , Polymorphism, Single Nucleotide , Quantitative Trait, Heritable , Animals , Bone Density , Crosses, Genetic , Databases, Factual , Female , Genetic Linkage , Genotype , Humans , Linkage Disequilibrium , Major Histocompatibility Complex/genetics , Male , Mice , Mice, Inbred C57BL , Mice, Inbred Strains , Phenotype , Polymerase Chain Reaction , Software
2.
Bioinformatics ; 16(3): 203-11, 2000 Mar.
Article in English | MEDLINE | ID: mdl-10869013

ABSTRACT

MOTIVATION: Supplementary cDNA or EST evidence is often decisive for discriminating between alternative gene predictions derived from computational sequence inspection by any of a number of requisite programs. Without additional experimental effort, this approach must rely on the occurrence of cognate ESTs for the gene under consideration in available, generally incomplete, EST collections for the given species. In some cases, particular exon assignments can be supported by sequence matching even if the cDNA or EST is produced from non-cognate genomic DNA, including different loci of a gene family or homologous loci from different species. However, marginally significant sequence matching alone can also be misleading. We sought to develop an algorithm that would simultaneously score for predicted intrinsic splice site strength and sequence matching between the genomic DNA template and a related cDNA or EST. In this case, weakly predicted splice sites may be chosen for the optimal scoring spliced alignment on the basis of surrounding sequence matching. Strongly predicted splice sites will enter the optimal spliced alignment even without strong sequence matching. RESULTS: We designed a novel algorithm that produces the optimal spliced alignment of a genomic DNA with a cDNA or EST based on scoring for both sequence matching and intrinsic splice site strength. By example, we demonstrate that this combined approach appears to improve gene prediction accuracy compared with current methods that rely only on either search by content and signal or on sequence similarity. AVAILABILITY: The algorithm is available as a C subroutine and is implemented in the SplicePredictor and GeneSeqer programs. The source code is available via anonymous ftp from ftp. zmdb.iastate.edu. Both programs are also implemented as a Web service at http://gremlin1.zool.iastate.edu/cgi-bin/s p.cgiand http://gremlin1.zool.iastate.edu/cgi-bin/g s.cgi, respectively. CONTACT: vbrendel@iastate.edu


Subject(s)
Algorithms , DNA, Complementary , RNA Splicing , Sequence Alignment/methods , Arabidopsis/genetics , Base Sequence , Molecular Sequence Data , Sequence Homology, Nucleic Acid , Templates, Genetic
3.
J Mol Biol ; 297(5): 1075-85, 2000 Apr 14.
Article in English | MEDLINE | ID: mdl-10764574

ABSTRACT

Gene identification in genomic DNA from eukaryotes is complicated by the vast combinatorial possibilities of potential exon assemblies. If the gene encodes a protein that is closely related to known proteins, gene identification is aided by matching similarity of potential translation products to those target proteins. The genomic DNA and protein sequences can be aligned directly by scoring the implied residues of in-frame nucleotide triplets against the protein residues in conventional ways, while allowing for long gaps in the alignment corresponding to introns in the genomic DNA. We describe a novel method for such spliced alignment. The method derives an optimal alignment based on scoring for both sequence similarity of the predicted gene product to the protein sequence and intrinsic splice site strength of the predicted introns. Application of the method to a representative set of 50 known genes from Arabidopsis thaliana showed significant improvement in prediction accuracy compared to previous spliced alignment methods. The method is also more accurate than ab initio gene prediction methods, provided sufficiently close target proteins are available. In view of the fast growth of public sequence repositories, we argue that close targets will be available for the majority of novel genes, making spliced alignment an excellent practical tool for high-throughput automated genome annotation.


Subject(s)
Arabidopsis/genetics , Genes, Plant/genetics , Genome, Plant , Plant Proteins/chemistry , Plant Proteins/genetics , RNA Splicing/genetics , Sequence Alignment/methods , Algorithms , Amino Acid Sequence , Automation/methods , Base Sequence , Bias , Codon/genetics , Computational Biology/methods , Computational Biology/statistics & numerical data , Exons/genetics , Introns/genetics , Molecular Sequence Data , Nucleotides/genetics , Regulatory Sequences, Nucleic Acid/genetics , Reproducibility of Results , Sensitivity and Specificity , Sequence Alignment/statistics & numerical data , Sequence Homology, Amino Acid , Software
SELECTION OF CITATIONS
SEARCH DETAIL
...