Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 38(11): 3029-3036, 2022 05 26.
Artigo em Inglês | MEDLINE | ID: mdl-35451453

RESUMO

MOTIVATION: Segmentation and genome annotation (SAGA) algorithms are widely used to understand genome activity and gene regulation. These methods take as input a set of sequencing-based assays of epigenomic activity, such as ChIP-seq measurements of histone modification and transcription factor binding. They output an annotation of the genome that assigns a chromatin state label to each genomic position. Existing SAGA methods have several limitations caused by the discrete annotation framework: such annotations cannot easily represent varying strengths of genomic elements, and they cannot easily represent combinatorial elements that simultaneously exhibit multiple types of activity. To remedy these limitations, we propose an annotation strategy that instead outputs a vector of chromatin state features at each position rather than a single discrete label. Continuous modeling is common in other fields, such as in topic modeling of text documents. We propose a method, epigenome-ssm-nonneg, that uses a non-negative state space model to efficiently annotate the genome with chromatin state features. We also propose several measures of the quality of a chromatin state feature annotation and we compare the performance of several alternative methods according to these quality measures. RESULTS: We show that chromatin state features from epigenome-ssm-nonneg are more useful for several downstream applications than both continuous and discrete alternatives, including their ability to identify expressed genes and enhancers. Therefore, we expect that these continuous chromatin state features will be valuable reference annotations to be used in visualization and downstream analysis. AVAILABILITY AND IMPLEMENTATION: Source code for epigenome-ssm is available at https://github.com/habibdanesh/epigenome-ssm and Zenodo (DOI: 10.5281/zenodo.6507585). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Cromatina , Epigenoma , Humanos , Epigenômica/métodos , Genômica/métodos , Software
2.
Bioinformatics ; 38(4): 1126-1128, 2022 01 27.
Artigo em Inglês | MEDLINE | ID: mdl-34718413

RESUMO

MOTIVATION: With the advancement of sequencing technologies, genomic data sets are constantly being expanded by high volumes of different data types. One recently introduced data type in genomic science is genomic signals, which are usually short-read coverage measurements over the genome. To understand and evaluate the results of such studies, one needs to understand and analyze the characteristics of the input data. RESULTS: SigTools is an R-based genomic signals visualization package developed with two objectives: (i) to facilitate genomic signals exploration in order to uncover insights for later model training, refinement and development by including distribution and autocorrelation plots; (ii) to enable genomic signals interpretation by including correlation and aggregation plots. In addition, our corresponding web application, SigTools-Shiny, extends the accessibility scope of these modules to people who are more comfortable working with graphical user interfaces instead of command-line tools. AVAILABILITY AND IMPLEMENTATION: SigTools source code, installation guide and manual is freely available on http://github.com/shohre73.


Assuntos
Genoma , Genômica , Humanos , Genômica/métodos , Software , Análise de Sequência
3.
PLoS One ; 14(5): e0210281, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31059508

RESUMO

Previously, we have introduced an improved version of jViz.RNA which enabled faster and more stable RNA visualization by employing compressed tree graphs. However, the new RNA representation and visualization method required a sophisticated mechanism of pseudoknot visualization. In this work, we present our novel pseudoknot classification and implementation of pseudoknot visualization in the context of the new RNA graph model. We then compare our approach with other RNA visualization software, and demonstrate jViz.RNA 4.0's benefits compared to other software. Additionally, we introduce interactive editing functionality into jViz.RNA and demonstrate its benefits in exploring and building RNA structures. The results presented highlight the new high degree of utility jViz.RNA 4.0 now offers. Users are now able to visualize pseudoknotted RNA, manipulate the resulting automatic layouts to suit their individual needs, and change both positioning and connectivity of the RNA molecules examined. Care was taken to limit overlap between structural elements, particularly in the case of pseudoknots to ensure an intuitive and informative layout of the final RNA structure. Availability: The software is freely available at: https://jviz.cs.sfu.ca/.


Assuntos
Biologia Computacional , Edição de RNA , RNA/genética , Software , Biologia Computacional/métodos , Gráficos por Computador , Conformação de Ácido Nucleico , RNA/química
4.
BMC Bioinformatics ; 18(1): 282, 2017 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-28558664

RESUMO

BACKGROUND: RNA visualization software tools have traditionally presented a static visualization of RNA molecules with limited ability for users to interact with the resulting image once it is complete. Only a few tools allowed for dynamic structures. One such tool is jViz.RNA. Currently, jViz.RNA employs a unique method for the creation of the RNA molecule layout by mapping the RNA nucleotides into vertexes in a graph, which we call the detailed graph, and then utilizes a Newtonian mechanics inspired system of forces to calculate a layout for the RNA molecule. The work presented here focuses on improvements to jViz.RNA that allow the drawing of RNA secondary structures according to common drawing conventions, as well as dramatic run-time performance improvements. This is done first by presenting an alternative method for mapping the RNA molecule into a graph, which we call the compressed graph, and then employing advanced numerical integration methods for the compressed graph representation. RESULTS: Comparing the compressed graph and detailed graph implementations, we find that the compressed graph produces results more consistent with RNA drawing conventions. However, we also find that employing the compressed graph method requires a more sophisticated initial layout to produce visualizations that would require minimal user interference. Comparing the two numerical integration methods demonstrates the higher stability of the Backward Euler method, and its resulting ability to handle much larger time steps, a high priority feature for any software which entails user interaction. CONCLUSION: The work in this manuscript presents the preferred use of compressed graphs to detailed ones, as well as the advantages of employing the Backward Euler method over the Forward Euler method. These improvements produce more stable as well as visually aesthetic representations of the RNA secondary structures. The results presented demonstrate that both the compressed graph representation, as well as the Backward Euler integrator, greatly enhance the run-time performance and usability. The newest iteration of jViz.RNA is available at https://jviz.cs.sfu.ca/download/download.html .


Assuntos
RNA/química , Algoritmos , Sequência de Bases , Conformação de Ácido Nucleico , RNA/metabolismo , Software
5.
Artigo em Inglês | MEDLINE | ID: mdl-26915129

RESUMO

RNA visualization is crucial in order to understand the relationship that exists between RNA structure and its function, as well as the development of better RNA structure prediction algorithms. However, in the context of RNA visualization, one key structure remains difficult to visualize: Pseudoknots. Pseudoknots occur in RNA folding when two secondary structural components form base-pairs between them. The three-dimensional nature of these components makes them challenging to visualize in two-dimensional media, such as print media or screens. In this review, we focus on the advancements that have been made in the field of RNA visualization in two-dimensional media in the past two decades. The review aims at presenting all relevant aspects of pseudoknot visualization. We start with an overview of several pseudoknotted structures and their relevance in RNA function. Next, we discuss the theoretical basis for RNA structural topology classification and present RNA classification systems for both pseudoknotted and non-pseudoknotted RNAs. Each description of RNA classification system is followed by a discussion of the software tools and algorithms developed to date to visualize RNA, comparing the different tools' strengths and shortcomings.


Assuntos
Biologia Computacional/métodos , Conformação de Ácido Nucleico , RNA/química , RNA/metabolismo , Dobramento de RNA
6.
Int J Bioinform Res Appl ; 11(5): 375-96, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26558299

RESUMO

Pseudoknots are RNA tertiary structures which perform essential biological functions. This paper discusses SARNA-Predict-pk, a RNA pseudoknotted secondary structure prediction algorithm based on Simulated Annealing (SA). The research presented here extends previous work of SARNA-Predict and further examines the effect of the new algorithm to include prediction of RNA secondary structure with pseudoknots. An evaluation of the performance of SARNA-Predict-pk in terms of prediction accuracy is made via comparison with several state-of-the-art prediction algorithms using 20 individual known structures from seven RNA classes. We measured the sensitivity and specificity of nine prediction algorithms. Three of these are dynamic programming algorithms: Pseudoknot (pknotsRE), NUPACK, and pknotsRG-mfe. One is using the statistical clustering approach: Sfold and the other five are heuristic algorithms: SARNA-Predict-pk, ILM, STAR, IPknot and HotKnots algorithms. The results presented in this paper demonstrate that SARNA-Predict-pk can out-perform other state-of-the-art algorithms in terms of prediction accuracy. This supports the use of the proposed method on pseudoknotted RNA secondary structure prediction of other known structures.

7.
Artigo em Inglês | MEDLINE | ID: mdl-21030739

RESUMO

Ribonucleic acid (RNA), a single-stranded linear molecule, is essential to all biological systems. Different regions of the same RNA strand will fold together via base pair interactions to make intricate secondary and tertiary structures that guide crucial homeostatic processes in living organisms. Since the structure of RNA molecules is the key to their function, algorithms for the prediction of RNA structure are of great value. In this article, we demonstrate the usefulness of SARNA-Predict, an RNA secondary structure prediction algorithm based on Simulated Annealing (SA). A performance evaluation of SARNA-Predict in terms of prediction accuracy is made via comparison with eight state-of-the-art RNA prediction algorithms: mfold, Pseudoknot (pknotsRE), NUPACK, pknotsRG-mfe, Sfold, HotKnots, ILM, and STAR. These algorithms are from three different classes: heuristic, dynamic programming, and statistical sampling techniques. An evaluation for the performance of SARNA-Predict in terms of prediction accuracy was verified with native structures. Experiments on 33 individual known structures from eleven RNA classes (tRNA, viral RNA, antigenomic HDV, telomerase RNA, tmRNA, rRNA, RNaseP, 5S rRNA, Group I intron 23S rRNA, Group I intron 16S rRNA, and 16S rRNA) were performed. The results presented in this paper demonstrate that SARNA-Predict can out-perform other state-of-the-art algorithms in terms of prediction accuracy. Furthermore, there is substantial improvement of prediction accuracy by incorporating a more sophisticated thermodynamic model (efn2).


Assuntos
Algoritmos , Biologia Computacional/métodos , RNA/química , Software , Pareamento de Bases , Conformação de Ácido Nucleico , Análise de Sequência de RNA/métodos
8.
BMC Bioinformatics ; 7: 185, 2006 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-16584563

RESUMO

BACKGROUND: DNA sequencing is used ubiquitously: from deciphering genomes to determining the primary sequence of small RNAs (smRNAs). The cloning of smRNAs is currently the most conventional method to determine the actual sequence of these important regulators of gene expression. Typical smRNA cloning projects involve the sequencing of hundreds to thousands of smRNA clones that are delimited at their 5' and 3' ends by fixed sequence regions. These primers result from the biochemical protocol used to isolate and convert the smRNA into clonable PCR products. Recently we completed a smRNA cloning project involving tobacco plants, where analysis was required for approximately 700 smRNA sequences. Finding no easily accessible research tool to enter and analyze smRNA sequences we developed Ebbie to assist us with our study. RESULTS: Ebbie is a semi-automated smRNA cloning data processing algorithm, which initially searches for any substring within a DNA sequencing text file, which is flanked by two constant strings. The substring, also termed smRNA or insert, is stored in a MySQL and BlastN database. These inserts are then compared using BlastN to locally installed databases allowing the rapid comparison of the insert to both the growing smRNA database and to other static sequence databases. Our laboratory used Ebbie to analyze scores of DNA sequencing data originating from an smRNA cloning project. Through its built-in instant analysis of all inserts using BlastN, we were able to quickly identify 33 groups of smRNAs from approximately 700 database entries. This clustering allowed the easy identification of novel and highly expressed clusters of smRNAs. Ebbie is available under GNU GPL and currently implemented on http://bioinformatics.org/ebbie/. CONCLUSION: Ebbie was designed for medium sized smRNA cloning projects with about 1,000 database entries. Ebbie can be used for any type of sequence analysis where two constant primer regions flank a sequence of interest. The reliable storage of inserts, and their annotation in a MySQL database, BlastN comparison of new inserts to dynamic and static databases make it a powerful new tool in any laboratory using DNA sequencing. Ebbie also prevents manual mistakes during the excision process and speeds up annotation and data-entry. Once the server is installed locally, its access can be restricted to protect sensitive new DNA sequencing data. Ebbie was primarily designed for smRNA cloning projects, but can be applied to a variety of RNA and DNA cloning projects.


Assuntos
Algoritmos , Clonagem Molecular , Bases de Dados de Ácidos Nucleicos , Armazenamento e Recuperação da Informação/métodos , Internet , RNA Interferente Pequeno/genética , Sequência de Bases , Processamento Eletrônico de Dados/métodos , Dados de Sequência Molecular
9.
Bioinformatics ; 22(8): 934-42, 2006 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-16473869

RESUMO

MOTIVATION: Ribonucleic acid is vital in numerous stages of protein synthesis; it also possesses important functional and structural roles within the cell. The function of an RNA molecule within a particular organic system is principally determined by its structure. The current physical methods available for structure determination are time-consuming and expensive. Hence, computational methods for structure prediction are sought after. The energies involved by the formation of secondary structure elements are significantly greater than those of tertiary elements. Therefore, RNA structure prediction focuses on secondary structure. RESULTS: We present P-RnaPredict, a parallel evolutionary algorithm for RNA secondary structure prediction. The speedup provided by parallelization is investigated with five sequences, and a dramatic improvement in speedup is demonstrated, especially with longer sequences. An evaluation of the performance of P-RnaPredict in terms of prediction accuracy is made through comparison with 10 individual known structures from 3 RNA classes (5S rRNA, Group I intron 16S rRNA and 16S rRNA) and the mfold dynamic programming algorithm. P-RnaPredict is able to predict structures with higher true positive base pair counts and lower false positives than mfold on certain sequences. AVAILABILITY: P-RnaPredict is available for non-commercial usage. Interested parties should contact Kay C. Wiese (wiese@cs.sfu.ca).


Assuntos
Algoritmos , Modelos Químicos , Modelos Moleculares , Análise de Sequência de RNA/métodos , Software , Sequência de Bases , Simulação por Computador , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Alinhamento de Sequência/métodos
10.
IEEE Trans Nanobioscience ; 4(3): 212-8, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16220684

RESUMO

Many tools have been developed for visualization of RNA secondary structures using a variety of techniques and output formats. However, each tool is typically limited to one or two of the visualization models discussed in this paper, supports only a single file format, and is tied to a specific platform. In order for structure prediction researchers to better understand the results of their algorithms and to enable life science researchers to interpret RNA structure easily, it is helpful to provide them with a flexible and powerful tool.jViz.Rna is a multiplatform visualization tool capable of displaying RNA secondary structures encoded in a variety of file formats. The same structure can be viewed using any of the models supported, including linked graph, circle graph, dot plot, and classical structure. Also, the output is dynamic and can easily be further manipulated by the user. In addition, any of the drawings produced can be saved in either the EPS or PNG file formats enabling easy usage in publications and presentations.


Assuntos
Gráficos por Computador , Armazenamento e Recuperação da Informação/métodos , Modelos Químicos , Modelos Moleculares , RNA/química , Análise de Sequência de RNA/métodos , Software , Simulação por Computador , Conformação de Ácido Nucleico , Linguagens de Programação , RNA/análise , Interface Usuário-Computador
11.
IEEE Trans Nanobioscience ; 4(3): 219-27, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16220685

RESUMO

This paper presents a fully parallel version of RnaPredict, a genetic algorithm (GA) for RNA secondary structure prediction. The research presented here builds on previous work and examines the impact of three different pseudorandom number generators (PRNGs) on the GA's performance. The three generators tested are the C standard library PRNG RAND, a parallelized multiplicative congruential generator (MCG), and a parallelized Mersenne Twister (MT). A fully parallel version of RnaPredict using the Message Passing Interface (MPI) was implemented on a 128-node Beowulf cluster. The PRNG comparison tests were performed with known structures whose sequences are 118, 122, 468, 543, and 556 nucleotides in length. The effects of the PRNGs are investigated and the predicted structures are compared to known structures. Results indicate that P-RnaPredict demonstrated good prediction accuracy, particularly so for shorter sequences.


Assuntos
Algoritmos , Modelos Químicos , Análise Numérica Assistida por Computador , RNA/química , Análise de Sequência de RNA/métodos , Simulação por Computador , Armazenamento e Recuperação da Informação/métodos , Modelos Moleculares , Modelos Estatísticos , Conformação de Ácido Nucleico , RNA/análise , Software
12.
Biosystems ; 72(1-2): 29-41, 2003 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-14642657

RESUMO

This paper presents a Genetic Algorithm (GA) to predict the secondary structure of RNA molecules, where the secondary structure is encoded as a permutation. More specifically, the proposed algorithm predicts which specific canonical base pairs will form hydrogen bonds and build helices, also known as stems. Since RNA is involved in both transcription and translation and also has catalytic and structural roles in the cell, determining the structure of RNA is of fundamental importance in helping to determine RNA function. We introduce a GA where a permutation is used to encode the secondary structure of RNA molecules. We discuss results on RNA sequences of lengths 76, 210, 681, and 785 nucleotides and present several improvements to our algorithm. We show that the Keep-Best Reproduction operator has similar benefits as in the traveling salesman problem domain. In addition, a comparison of several crossover operators is provided. We also compare the results of the permutation-based GA with a binary GA, demonstrating the benefits of the newly proposed representation.


Assuntos
Algoritmos , Modelos Químicos , Modelos Moleculares , Conformação de Ácido Nucleico , RNA/química , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Sequência de Bases , Simulação por Computador , Modelos Estatísticos , Dados de Sequência Molecular , RNA/genética , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...