Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Biol Chem ; 285(19): 14701-10, 2010 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-20167602

RESUMO

The YTH (YT521-B homology) domain was identified by sequence comparison and is found in 174 different proteins expressed in eukaryotes. It is characterized by 14 invariant residues within an alpha-helix/beta-sheet structure. Here we show that the YTH domain is a novel RNA binding domain that binds to a short, degenerated, single-stranded RNA sequence motif. The presence of the binding motif in alternative exons is necessary for YT521-B to directly influence splice site selection in vivo. Array analyses demonstrate that YT521-B predominantly regulates vertebrate-specific exons. An NMR titration experiment identified the binding surface for single-stranded RNA on the YTH domain. Structural analyses indicate that the YTH domain is related to the pseudouridine synthase and archaeosine transglycosylase (PUA) domain. Our data show that the YTH domain conveys RNA binding ability to a new class of proteins that are found in all eukaryotic organisms.


Assuntos
Proteínas do Tecido Nervoso/genética , Splicing de RNA/fisiologia , Proteínas de Ligação a RNA/genética , RNA/metabolismo , Sítios de Ligação , Biomarcadores/metabolismo , Perfilação da Expressão Gênica , Humanos , Espectroscopia de Ressonância Magnética , Modelos Moleculares , Análise de Sequência com Séries de Oligonucleotídeos , Conformação Proteica , Dobramento de Proteína , Estrutura Terciária de Proteína , RNA/genética , Fatores de Processamento de RNA
2.
BMC Bioinformatics ; 9: 477, 2008 Nov 12.
Artigo em Inglês | MEDLINE | ID: mdl-19014490

RESUMO

BACKGROUND: Alternative splicing is a major contributor to the diversity of eukaryotic transcriptomes and proteomes. Currently, large scale detection of alternative splicing using expressed sequence tags (ESTs) or microarrays does not capture all alternative splicing events. Moreover, for many species genomic data is being produced at a far greater rate than corresponding transcript data, hence in silico methods of predicting alternative splicing have to be improved. RESULTS: Here, we show that the use of Bayesian networks (BNs) allows accurate prediction of evolutionary conserved exon skipping events. At a stringent false positive rate of 0.5%, our BN achieves an improved true positive rate of 61%, compared to a previously reported 50% on the same dataset using support vector machines (SVMs). Incorporating several novel discriminative features such as intronic splicing regulatory elements leads to the improvement. Features related to mRNA secondary structure increase the prediction performance, corroborating previous findings that secondary structures are important for exon recognition. Random labelling tests rule out overfitting. Cross-validation on another dataset confirms the increased performance. When using the same dataset and the same set of features, the BN matches the performance of an SVM in earlier literature. Remarkably, we could show that about half of the exons which are labelled constitutive but receive a high probability of being alternative by the BN, are in fact alternative exons according to the latest EST data. Finally, we predict exon skipping without using conservation-based features, and achieve a true positive rate of 29% at a false positive rate of 0.5%. CONCLUSION: BNs can be used to achieve accurate identification of alternative exons and provide clues about possible dependencies between relevant features. The near-identical performance of the BN and SVM when using the same features shows that good classification depends more on features than on the choice of classifier. Conservation based features continue to be the most informative, and hence distinguishing alternative exons from constitutive ones without using conservation based features remains a challenging problem.


Assuntos
Algoritmos , Processamento Alternativo/genética , Biologia Computacional/métodos , Éxons/genética , Inteligência Artificial , Teorema de Bayes , Conformação de Ácido Nucleico , RNA Mensageiro/química
3.
Nucleic Acids Res ; 35(Web Server issue): W688-93, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17537825

RESUMO

BioBayesNet is a new web application that allows the easy modeling and classification of biological data using Bayesian networks. To learn Bayesian networks the user can either upload a set of annotated FASTA sequences or a set of pre-computed feature vectors. In case of FASTA sequences, the server is able to generate a wide range of sequence and structural features from the sequences. These features are used to learn Bayesian networks. An automatic feature selection procedure assists in selecting discriminative features, providing an (locally) optimal set of features. The output includes several quality measures of the overall network and individual features as well as a graphical representation of the network structure, which allows to explore dependencies between features. Finally, the learned Bayesian network or another uploaded network can be used to classify new data. BioBayesNet facilitates the use of Bayesian networks in biological sequences analysis and is flexible to support modeling and classification applications in various scientific fields. The BioBayesNet server is available at http://biwww3.informatik.uni-freiburg.de:8080/BioBayesNet/.


Assuntos
Algoritmos , Biologia Computacional/métodos , Modelos Moleculares , Reconhecimento Automatizado de Padrão/métodos , Proteínas/análise , Proteínas/química , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Teorema de Bayes , Simulação por Computador , Armazenamento e Recuperação da Informação , Internet , Dados de Sequência Molecular , Conformação Proteica , Dobramento de Proteína , Proteínas/classificação , Relação Estrutura-Atividade
4.
Nucleic Acids Res ; 34(17): e117, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16987907

RESUMO

RNA binding proteins recognize RNA targets in a sequence specific manner. Apart from the sequence, the secondary structure context of the binding site also affects the binding affinity. Binding sites are often located in single-stranded RNA regions and it was shown that the sequestration of a binding motif in a double-strand abolishes protein binding. Thus, it is desirable to include knowledge about RNA secondary structures when searching for the binding motif of a protein. We present the approach MEMERIS for searching sequence motifs in a set of RNA sequences and simultaneously integrating information about secondary structures. To abstract from specific structural elements, we precompute position-specific values measuring the single-strandedness of all substrings of an RNA sequence. These values are used as prior knowledge about the motif starts to guide the motif search. Extensive tests with artificial and biological data demonstrate that MEMERIS is able to identify motifs in single-stranded regions even if a stronger motif located in double-strand parts exists. The discovered motif occurrences in biological datasets mostly coincide with known protein-binding sites. This algorithm can be used for finding the binding motif of single-stranded RNA-binding proteins in SELEX or other biological sequence data.


Assuntos
Proteínas de Ligação a RNA/metabolismo , RNA/química , Análise de Sequência de RNA/métodos , Algoritmos , Sequência de Bases , Sítios de Ligação , Dados de Sequência Molecular , Conformação de Ácido Nucleico , RNA/metabolismo
5.
Bioinformatics ; 21(14): 3082-8, 2005 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-15905283

RESUMO

MOTIVATION: The identification of transcription factor binding sites in promoter sequences is an important problem, since it reveals information about the transcriptional regulation of genes. For analysing transcriptional regulation, computational approaches for predicting putative binding sites are applied. Commonly used stochastic models for binding sites are position-specific score matrices, which show weak predictive power. RESULTS: We have developed a probabilistic modelling approach, which allows to consider diverse characteristic binding site properties to obtain more accurate representations of binding sites. These properties are modelled as random variables in Bayesian networks, which are capable of dealing with dependencies among binding site properties. Cross-validation on several datasets shows improvements in the false positive error rate and the significance (P-value) of true binding sites.


Assuntos
Algoritmos , Modelos Químicos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Análise de Sequência de Proteína/métodos , Fatores de Transcrição/química , Sítios de Ligação , Simulação por Computador , Modelos Genéticos , Modelos Estatísticos , Ligação Proteica , Fatores de Transcrição/análise , Fatores de Transcrição/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...