Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
J Comput Biol ; 14(5): 578-93, 2007 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-17683262

RESUMEN

This paper presents a new method for studying protein folding kinetics. It uses the recently introduced Stochastic Roadmap Simulation (SRS) method to estimate the transition state ensemble (TSE) and predict the rates and the Phi-values for protein folding. The new method was tested on 16 proteins, whose rates and Phi-values have been determined experimentally. Comparison with experimental data shows that our method estimates the TSE much more accurately than an existing method based on dynamic programming. This improvement leads to better folding-rate predictions. We also compute the mean first passage time of the unfolded states and show that the computed values correlate with experimentally determined folding rates. The results on Phi-value predictions are mixed, possibly due to the simple energy model used in the tests. This is the first time that results obtained from SRS have been compared against a substantial amount of experimental data. The results further validate the SRS method and indicate its potential as a general tool for studying protein folding kinetics.


Asunto(s)
Simulación por Computador , Modelos Químicos , Pliegue de Proteína , Cristalografía por Rayos X , Cinética , Valor Predictivo de las Pruebas , Conformación Proteica , Procesos Estocásticos
2.
Nucleic Acids Res ; 35(2): 678-86, 2007.
Artículo en Inglés | MEDLINE | ID: mdl-17182633

RESUMEN

The level of conservation between two homologous sequences often varies among sequence regions; functionally important domains are more conserved than the remaining regions. Thus, multiple parameter sets should be used in alignment of homologous sequences with a stringent parameter set for highly conserved regions and a moderate parameter set for weakly conserved regions. We describe an alignment algorithm to allow dynamic use of multiple parameter sets with different levels of stringency in computation of an optimal alignment of two sequences. The algorithm dynamically considers various candidate alignments, partitions each candidate alignment into sections, and determines the most appropriate set of parameter values for each section of the alignment. The algorithm and its local alignment version are implemented in a computer program named GAP4. The local alignment algorithm in GAP4, that in its predecessor GAP3, and an ordinary local alignment program SIM were evaluated on 257,716 pairs of homologous sequences from 100 protein families. On 168,475 of the 257,716 pairs (a rate of 65.4%), alignments from GAP4 were more statistically significant than alignments from GAP3 and SIM.


Asunto(s)
Algoritmos , Alineación de Secuencia/métodos , Homología de Secuencia de Aminoácido , Biología Computacional/métodos , Programas Informáticos
3.
Proc Natl Acad Sci U S A ; 103(46): 17355-60, 2006 Nov 14.
Artículo en Inglés | MEDLINE | ID: mdl-17065321

RESUMEN

Understanding the genetic basis of HIV-1 drug resistance is essential to developing new antiretroviral drugs and optimizing the use of existing drugs. This understanding, however, is hampered by the large numbers of mutation patterns associated with cross-resistance within each antiretroviral drug class. We used five statistical learning methods (decision trees, neural networks, support vector regression, least-squares regression, and least angle regression) to relate HIV-1 protease and reverse transcriptase mutations to in vitro susceptibility to 16 antiretroviral drugs. Learning methods were trained and tested on a public data set of genotype-phenotype correlations by 5-fold cross-validation. For each learning method, four mutation sets were used as input features: a complete set of all mutations in > or =2 sequences in the data set, the 30 most common data set mutations, an expert panel mutation set, and a set of nonpolymorphic treatment-selected mutations from a public database linking protease and reverse transcriptase sequences to antiretroviral drug exposure. The nonpolymorphic treatment-selected mutations led to the best predictions: 80.1% accuracy at classifying sequences as susceptible, low/intermediate resistant, or highly resistant. Least angle regression predicted susceptibility significantly better than other methods when using the complete set of mutations. The three regression methods provided consistent estimates of the quantitative effect of mutations on drug susceptibility, identifying nearly all previously reported genotype-phenotype associations and providing strong statistical support for many new associations. Mutation regression coefficients showed that, within a drug class, cross-resistance patterns differ for different mutation subsets and that cross-resistance has been underestimated.


Asunto(s)
Farmacorresistencia Viral/genética , VIH-1/efectos de los fármacos , VIH-1/genética , Farmacorresistencia Viral/efectos de los fármacos , Genotipo , VIH-1/aislamiento & purificación , Mutación/genética , Fenotipo , Inhibidores de la Transcriptasa Inversa/farmacología
4.
Nucleic Acids Res ; 34(20): 5730-9, 2006.
Artículo en Inglés | MEDLINE | ID: mdl-17041233

RESUMEN

Given a set of known binding sites for a specific transcription factor, it is possible to build a model of the transcription factor binding site, usually called a motif model, and use this model to search for other sites that bind the same transcription factor. Typically, this search is performed using a position-specific scoring matrix (PSSM), also known as a position weight matrix. In this paper we analyze a set of eukaryotic transcription factor binding sites and show that there is extensive clustering of similar k-mers in eukaryotic motifs, owing to both functional and evolutionary constraints. The apparent limitations of probabilistic models in representing complex nucleotide dependencies lead us to a graph-based representation of motifs. When deciding whether a candidate k-mer is part of a motif or not, we base our decision not on how well the k-mer conforms to a model of the motif as a whole, but how similar it is to specific, known k-mers in the motif. We elucidate the reasons why we expect graph-based methods to perform well on motif data. Our MotifScan algorithm shows greatly improved performance over the prevalent PSSM-based method for the detection of eukaryotic motifs.


Asunto(s)
Algoritmos , Modelos Genéticos , Elementos Reguladores de la Transcripción , Análisis de Secuencia de ADN/métodos , Factores de Transcripción/metabolismo , Sitios de Unión , Expresión Génica , Humanos , Modelos Estadísticos , Nucleótidos/análisis , Saccharomyces cerevisiae/genética
5.
Bioinformatics ; 22(14): e150-7, 2006 Jul 15.
Artículo en Inglés | MEDLINE | ID: mdl-16873465

RESUMEN

MOTIVATION: DNA motif finding is one of the core problems in computational biology, for which several probabilistic and discrete approaches have been developed. Most existing methods formulate motif finding as an intractable optimization problem and rely either on expectation maximization (EM) or on local heuristic searches. Another challenge is the choice of motif model: simpler models such as the position-specific scoring matrix (PSSM) impose biologically unrealistic assumptions such as independence of the motif positions, while more involved models are harder to parametrize and learn. RESULTS: We present MotifCut, a graph-theoretic approach to motif finding leading to a convex optimization problem with a polynomial time solution. We build a graph where the vertices represent all k-mers in the input sequences, and edges represent pairwise k-mer similarity. In this graph, we search for a motif as the maximum density subgraph, which is a set of k-mers that exhibit a large number of pairwise similarities. Our formulation does not make strong assumptions regarding the structure of the motif and in practice both motifs that fit well the PSSM model, and those that exhibit strong dependencies between position pairs are found as dense subgraphs. We benchmark MotifCut on both synthetic and real yeast motifs, and find that it compares favorably to existing popular methods. The ability of MotifCut to detect motifs appears to scale well with increasing input size. Moreover, the motifs we discover are different from those discovered by the other methods. AVAILABILITY: MotifCut server and other materials can be found at motifcut.stanford.edu.


Asunto(s)
Algoritmos , ADN/genética , Modelos Genéticos , Elementos Reguladores de la Transcripción/genética , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Factores de Transcripción/genética , Secuencia de Bases , Sitios de Unión , Mapeo Cromosómico/métodos , Simulación por Computador , Modelos Estadísticos , Datos de Secuencia Molecular , Unión Proteica , Alineación de Secuencia/métodos
6.
Proc Natl Acad Sci U S A ; 103(5): 1412-7, 2006 Jan 31.
Artículo en Inglés | MEDLINE | ID: mdl-16432200

RESUMEN

A striking feature of the human genome is the dearth of CpG dinucleotides (CpGs) interrupted occasionally by CpG islands (CGIs), regions with relatively high content of the dinucleotide. CGIs are generally associated with promoters; genes, whose promoters are especially rich in CpG sequences, tend to be expressed in most tissues. However, all working definitions of what constitutes a CGI rely on ad hoc thresholds. Here we adopt a direct and comprehensive survey to identify the locations of all CpGs in the human genome and find that promoters segregate naturally into two classes by CpG content. Seventy-two percent of promoters belong to the class with high CpG content (HCG), and 28% are in the class whose CpG content is characteristic of the overall genome (low CpG content). The enrichment of CpGs in the HCG class is symmetric and peaks around the core promoter. The broad-based expression of the HCG promoters is not a consequence of a correlation with CpG content because within the HCG class the breadth of expression is independent of the CpG content. The overall depletion of CpGs throughout the genome is thought to be a consequence of the methylation of some germ-line CpGs and their susceptibility to mutation. A comparison of the frequencies of inferred deamination mutations at CpG and GpC dinucleotides in the two classes of promoters using SNPs in human-chimpanzee sequence alignments shows that CpGs mutate at a lower frequency in the HCG promoters, suggesting that CpGs in the HCG class are hypomethylated in the germ line.


Asunto(s)
Islas de CpG , Genoma Humano , Mutación , Animales , Metilación de ADN , Epigénesis Genética , Exones , Mutación de Línea Germinal , Humanos , Intrones , Modelos Genéticos , Nucleótidos/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Pan troglodytes , Regiones Promotoras Genéticas , Análisis de Secuencia de ADN
7.
J Mol Biol ; 357(2): 665-75, 2006 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-16427083

RESUMEN

Poliovirus VPg is a 22 amino acid residue peptide that serves as the protein primer for replication of the viral RNA genome. VPg is known to bind directly to the viral RNA-dependent RNA polymerase, 3D, for covalent uridylylation, yielding mono and di-uridylylated products, VPg-pU and VPg-pUpU, which are subsequently elongated. To model the docking of the VPg substrate to a putative VPg-binding site on the 3D polymerase molecule, we performed a variety of structure-based computations followed by experimental verification. First, potential VPg folded structures were identified, yielding a suite of predicted beta-hairpin structures. These putative VPg structures were then docked to the region of the polymerase implicated by genetic experiments to bind VPg, using grid-based and fragment-based methods. Residues in VPg predicted to affect binding were identified through molecular dynamics simulations, and their effects on the 3D-VPg interaction were tested computationally and biochemically. Experiments with mutant VPg and mutant polymerase molecules confirmed the predicted binding site for VPg on the back side of the polymerase molecule during the uridylylation reaction, opposite to that predicted to bind elongating RNA primers.


Asunto(s)
Nucleótidos/metabolismo , Conformación Proteica , ARN Polimerasa Dependiente del ARN/química , Ribonucleoproteínas/química , Uridina/metabolismo , Proteínas no Estructurales Virales/química , Secuencia de Aminoácidos , Sitios de Unión , Simulación por Computador , Genoma Viral , Modelos Moleculares , Datos de Secuencia Molecular , Mutación , Poliovirus/genética , Poliovirus/metabolismo , ARN Viral , ARN Polimerasa Dependiente del ARN/genética , ARN Polimerasa Dependiente del ARN/metabolismo , Ribonucleoproteínas/genética , Ribonucleoproteínas/metabolismo , Alineación de Secuencia , Proteínas no Estructurales Virales/genética , Proteínas no Estructurales Virales/metabolismo , Replicación Viral
8.
J Chem Inf Model ; 45(1): 128-35, 2005.
Artículo en Inglés | MEDLINE | ID: mdl-15667138

RESUMEN

The superfamily of ligand-gated ion channels (LGICs) has been implicated in anesthetic and alcohol responses. Mutations within glycine and GABA receptors have demonstrated that possible sites of anesthetic action exist within the transmembrane subunits of these receptors. The exact molecular arrangement of this transmembrane region remains at intermediate resolution with current experimental techniques. Homology modeling methods were therefore combined with experimental data to produce a more exact model of this region. A consensus from multiple bioinformatics techniques predicted the topology within the transmembrane domain of a glycine alpha one receptor (GlyRa1) to be alpha helical. This fold information was combined with sequence information using the SeqFold algorithm to search for modeling templates. Independently, the FoldMiner algorithm was used to search for templates that had structural folds similar to published coordinates of the homologous nAChR (1OED). Both SeqFold and Foldminer identified the same modeling template. The GlyRa1 sequence was aligned with this template using multiple scoring criteria. Refinement of the alignment closed gaps to produce agreement with labeling studies carried out on the homologous receptors of the superfamily. Structural assignment and refinement was achieved using Modeler. The final structure demonstrated a cavity within the core of a four-helix bundle. Residues known to be involved in modulating anesthetic potency converge on and line this cavity. This suggests that the binding sites for volatile anesthetics in the LGICs are the cavities formed within the core of transmembrane four-helix bundles.


Asunto(s)
Anestésicos/metabolismo , Receptores de Glicina/química , Secuencia de Aminoácidos , Sitios de Unión , Humanos , Modelos Químicos , Modelos Moleculares , Datos de Secuencia Molecular , Unión Proteica , Conformación Proteica , Receptores de Glicina/metabolismo , Alineación de Secuencia , Homología de Secuencia de Aminoácido
9.
Nucleic Acids Res ; 33(Database issue): D178-82, 2005 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-15608172

RESUMEN

Classifying proteins into families and superfamilies allows identification of functionally important conserved domains. The motifs and scoring matrices derived from such conserved regions provide computational tools that recognize similar patterns in novel sequences, and thus enable the prediction of protein function for genomes. The eBLOCKs database enumerates a cascade of protein blocks with varied conservation levels for each functional domain. A biologically important region is most stringently conserved among a smaller family of highly similar proteins. The same region is often found in a larger group of more remotely related proteins with a reduced stringency. Through enumeration, highly specific signatures can be generated from blocks with more columns and fewer family members, while highly sensitive signatures can be derived from blocks with fewer columns and more members as in a superfamily. By applying PSI-BLAST and a modified K-means clustering algorithm, eBLOCKs automatically groups protein sequences according to different levels of similarity. Multiple sequence alignments are made and trimmed into a series of ungapped blocks. Motifs and position-specific scoring matrices were derived from eBLOCKs and made available for sequence search and annotation. The eBLOCKs database provides a tool for high-throughput genome annotation with maximal specificity and sensitivity. The eBLOCKs database is freely available on the World Wide Web at http://motif.stanford.edu/eblocks/ to all users for online usage. Academic and not-for-profit institutions wishing copies of the program may contact Douglas L. Brutlag (brutlag@stanford.edu). Commercial firms wishing copies of the program for internal installation may contact Jacqueline Tay at the Stanford Office of Technology Licensing (jacqueline.tay@stanford.edu; http://otl.stanford.edu/).


Asunto(s)
Bases de Datos de Proteínas , Análisis de Secuencia de Proteína , Algoritmos , Secuencia de Aminoácidos , Secuencia Conservada , Estructura Terciaria de Proteína , Proteínas/clasificación , Alineación de Secuencia , Programas Informáticos
10.
Nucleic Acids Res ; 32(Web Server issue): W204-7, 2004 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-15215381

RESUMEN

The identification of regulatory motifs is important for the study of gene expression. Here we present a suite of programs that we have developed to search for regulatory sequence motifs: (i) BioProspector, a Gibbs-sampling-based program for predicting regulatory motifs from co-regulated genes in prokaryotes or lower eukaryotes; (ii) CompareProspector, an extension to BioProspector which incorporates comparative genomics features to be used for higher eukaryotes; (iii) MDscan, a program for finding protein-DNA interaction sites from ChIP-on-chip targets. All three programs examine a group of sequences that may share common regulatory motifs and output a list of putative motifs as position-specific probability matrices, the individual sites used to construct the motifs and the location of each site on the input sequences. The web servers and executables can be accessed at http://seqmotifs.stanford.edu.


Asunto(s)
ADN/química , Secuencias Reguladoras de Ácidos Nucleicos , Programas Informáticos , Factores de Transcripción/metabolismo , Algoritmos , Sitios de Unión , ADN/metabolismo , Regulación de la Expresión Génica , Internet , Transcripción Genética , Interfaz Usuario-Computador
11.
J Comput Biol ; 10(3-4): 257-81, 2003.
Artículo en Inglés | MEDLINE | ID: mdl-12935328

RESUMEN

Classic molecular motion simulation techniques, such as Monte Carlo (MC) simulation, generate motion pathways one at a time and spend most of their time in the local minima of the energy landscape defined over a molecular conformation space. Their high computational cost prevents them from being used to compute ensemble properties (properties requiring the analysis of many pathways). This paper introduces stochastic roadmap simulation (SRS) as a new computational approach for exploring the kinetics of molecular motion by simultaneously examining multiple pathways. These pathways are compactly encoded in a graph, which is constructed by sampling a molecular conformation space at random. This computation, which does not trace any particular pathway explicitly, circumvents the local-minima problem. Each edge in the graph represents a potential transition of the molecule and is associated with a probability indicating the likelihood of this transition. By viewing the graph as a Markov chain, ensemble properties can be efficiently computed over the entire molecular energy landscape. Furthermore, SRS converges to the same distribution as MC simulation. SRS is applied to two biological problems: computing the probability of folding, an important order parameter that measures the "kinetic distance" of a protein's conformation from its native state; and estimating the expected time to escape from a ligand-protein binding site. Comparison with MC simulations on protein folding shows that SRS produces arguably more accurate results, while reducing computation time by several orders of magnitude. Computational studies on ligand-protein binding also demonstrate SRS as a promising approach to study ligand-protein interactions.


Asunto(s)
Algoritmos , Fenómenos Bioquímicos , Biología Computacional , Procesos Estocásticos , Ligandos , Cadenas de Markov , Conformación Molecular , Unión Proteica , Pliegue de Proteína
12.
Nucleic Acids Res ; 31(13): 3324-7, 2003 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-12824318

RESUMEN

WebFEATURE (http://feature.stanford.edu/webfeature/) is a web-accessible structural analysis tool that allows users to scan query structures for functional sites in both proteins and nucleic acids. WebFEATURE is the public interface to the scanning algorithm of the FEATURE package, a supervised learning algorithm for creating and identifying 3D, physicochemical motifs in molecular structures. Given an input structure or Protein Data Bank identifier (PDB ID), and a statistical model of a functional site, WebFEATURE will return rank-scored 'hits' in 3D space that identify regions in the structure where similar distributions of physicochemical properties occur relative to the site model. Users can visualize and interactively manipulate scored hits and the query structure in web browsers that support the Chime plug-in. Alternatively, results can be downloaded and visualized through other freely available molecular modeling tools, like RasMol, PyMOL and Chimera. A major application of WebFEATURE is in rapid annotation of function to structures in the context of structural genomics.


Asunto(s)
Modelos Moleculares , Ácidos Nucleicos/química , Proteínas/química , Programas Informáticos , Algoritmos , Gráficos por Computador , Genómica , Internet , Sustancias Macromoleculares , Modelos Estadísticos , Estructura Molecular , Conformación de Ácido Nucleico , Ácidos Nucleicos/fisiología , Conformación Proteica , Proteínas/fisiología , Interfaz Usuario-Computador
13.
Nucleic Acids Res ; 31(13): 3328-32, 2003 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-12824319

RESUMEN

Computational methods such as sequence alignment and motif construction are useful in grouping related proteins into families, as well as helping to annotate new proteins of unknown function. These methods identify conserved amino acids in protein sequences, but cannot determine the specific functional or structural roles of conserved amino acids without additional study. In this work, we present 3MATRIX (http://3matrix.stanford.edu) and 3MOTIF (http://3motif.stanford.edu), a web-based sequence motif visualization system that displays sequence motif information in its appropriate three-dimensional (3D) context. This system is flexible in that users can enter sequences, keywords, structures or sequence motifs to generate visualizations. In 3MOTIF, users can search using discrete sequence motifs such as PROSITE patterns, eMOTIFs, or any other regular expression-like motif. Similarly, 3MATRIX accepts an eMATRIX position-specific scoring matrix, or will convert a multiple sequence alignment block into an eMATRIX for visualization. Each query motif is used to search the protein structure database for matches, in which the motif is then visually highlighted in three dimensions. Important properties of motifs such as sequence conservation and solvent accessible surface area are also displayed in the visualizations, using carefully chosen color shading schemes.


Asunto(s)
Secuencias de Aminoácidos , Modelos Moleculares , Programas Informáticos , Aminoácidos/química , Aminoácidos/fisiología , Animales , Gráficos por Computador , Secuencia Conservada , Bases de Datos de Proteínas , Internet , Conformación Proteica , Proteínas/química , Alineación de Secuencia
14.
Bioinformatics ; 19(4): 541-2, 2003 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-12611811

RESUMEN

SUMMARY: 3MOTIF is a web application that visually maps conserved sequence motifs onto three-dimensional protein structures in the Protein Data Bank (PDB; Berman et al., Nucleic Acids Res., 28, 235-242, 2000). Important properties of motifs such as conservation strength and solvent accessible surface area at each position are visually represented on the structure using a variety of color shading schemes. Users can manipulate the displayed motifs using the freely available Chime plugin. AVAILABILITY: http://motif.stanford.edu/3motif/


Asunto(s)
Bases de Datos de Proteínas , Imagenología Tridimensional/métodos , Modelos Moleculares , Proteínas/química , Alineación de Secuencia/métodos , Interfaz Usuario-Computador , Secuencias de Aminoácidos , Secuencia de Aminoácidos , Secuencia Conservada , Sistemas de Administración de Bases de Datos , Internet , Datos de Secuencia Molecular , Conformación Proteica , Análisis de Secuencia de Proteína/métodos , Programas Informáticos
15.
Bioinformatics ; 18 Suppl 2: S18-26, 2002.
Artículo en Inglés | MEDLINE | ID: mdl-12385979

RESUMEN

Understanding the dynamics of ligand-protein interactions is indispensable in the design of novel therapeutic agents. In this paper, we establish the use of Stochastic Roadmap Simulation (SRS) for the study of ligand-protein interactions through two studies. In our first study, we measure the effects of mutations on the catalytic site of a protein, a process called computational mutagenesis. In our second study, we focus on distinguishing the catalytic site from other putative binding sites. SRS compactly represents many Monte Carlo (MC) simulation paths in a compact graph structure, or roadmap. Furthermore, SRS allows us to analyze all the paths in this roadmap simultaneously. In our application of SRS to the domain of ligand-protein interactions, we consider a new parameter called escape time, the expected number of MC simulation steps required for the ligand to escape from the 'funnel of attraction' of the binding site, as a metric for analyzing such interactions. Although computing escape times would probably be infeasible with MC simulation, these computations can be performed very efficiently with SRS. Our results for six mutant complexes for the first study and seven ligand-protein complexes for the second study, are very promising: In particular, the first results agree well with the biological interpretation of the mutations, while the second results show that escape time is a good metric to distinguish the catalytic site for five out of seven complexes.


Asunto(s)
Aminoácidos/química , Modelos Químicos , Modelos Moleculares , Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Aminoácidos/análisis , Sitios de Unión , Simulación por Computador , Ligandos , Modelos Estadísticos , Datos de Secuencia Molecular , Complejos Multiproteicos/análisis , Complejos Multiproteicos/química , Mutagénesis Sitio-Dirigida , Unión Proteica , Conformación Proteica , Proteínas/análisis , Procesos Estocásticos , Relación Estructura-Actividad
16.
Nat Biotechnol ; 20(8): 835-9, 2002 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-12101404

RESUMEN

Chromatin immunoprecipitation followed by cDNA microarray hybridization (ChIP-array) has become a popular procedure for studying genome-wide protein-DNA interactions and transcription regulation. However, it can only map the probable protein-DNA interaction loci within 1-2 kilobases resolution. To pinpoint interaction sites down to the base-pair level, we introduce a computational method, Motif Discovery scan (MDscan), that examines the ChIP-array-selected sequences and searches for DNA sequence motifs representing the protein-DNA interaction sites. MDscan combines the advantages of two widely adopted motif search strategies, word enumeration and position-specific weight matrix updating, and incorporates the ChIP-array ranking information to accelerate searches and enhance their success rates. MDscan correctly identified all the experimentally verified motifs from published ChIP-array experiments in yeast (STE12, GAL4, RAP1, SCB, MCB, MCM1, SFF, and SWI5), and predicted two motif patterns for the differential binding of Rap1 protein in telomere regions. In our studies, the method was faster and more accurate than several established motif-finding algorithms. MDscan can be used to find DNA motifs not only in ChIP-array experiments but also in other experiments in which a subgroup of the sequences can be inferred to contain relatively abundant motif sites. The MDscan web server can be accessed at http://BioProspector.stanford.edu/MDscan/.


Asunto(s)
Algoritmos , Cromatina/metabolismo , Proteínas de Unión al ADN/metabolismo , ADN/genética , ADN/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Pruebas de Precipitina/métodos , Secuencia de Bases , Sitios de Unión , Cromatina/genética , Biología Computacional/métodos , Regulación de la Expresión Génica , Genes Fúngicos/genética , Internet , Unión Proteica , Elementos de Respuesta/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Sensibilidad y Especificidad , Complejo Shelterina , Programas Informáticos , Telómero/genética , Telómero/metabolismo , Proteínas de Unión a Telómeros/metabolismo , Factores de Tiempo , Factores de Transcripción/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...