Search | VHL Regional Portal

1.

Assessing the Quality of Cotranscriptional Folding Simulations.

Kühnl, Felix; Stadler, Peter F; Findeiß, Sven.

Methods Mol Biol ; 2726: 347-376, 2024.

Article in English | MEDLINE | ID: mdl-38780738

ABSTRACT

Structural changes in RNAs are an important contributor to controlling gene expression not only at the posttranscriptional stage but also during transcription. A subclass of riboswitches and RNA thermometers located in the 5' region of the primary transcript regulates the downstream functional unit - usually an ORF - through premature termination of transcription. Not only such elements occur naturally, but they are also attractive devices in synthetic biology. The possibility to design such riboswitches or RNA thermometers is thus of considerable practical interest. Since these functional RNA elements act already during transcription, it is important to model and understand the dynamics of folding and, in particular, the formation of intermediate structures concurrently with transcription. Cotranscriptional folding simulations are therefore an important step to verify the functionality of design constructs before conducting expensive and labor-intensive wet lab experiments. For RNAs, full-fledged molecular dynamics simulations are far beyond practical reach because of both the size of the molecules and the timescales of interest. Even at the simplified level of secondary structures, further approximations are necessary. The BarMap approach is based on representing the secondary structure landscape for each individual transcription step by a coarse-grained representation that only retains a small set of low-energy local minima and the energy barriers between them. The folding dynamics between two transcriptional elongation steps is modeled as a Markov process on this representation. Maps between pairs of consecutive coarse-grained landscapes make it possible to follow the folding process as it changes in response to transcription elongation. In its original implementation, the BarMap software provides a general framework to investigate RNA folding dynamics on temporally changing landscapes. It is, however, difficult to use in particular for specific scenarios such as cotranscriptional folding. To overcome this limitation, we developed the user-friendly BarMap-QA pipeline described in detail in this contribution. It is illustrated here by an elaborate example that emphasizes the careful monitoring of several quality measures. Using an iterative workflow, a reliable and complete kinetics simulation of a synthetic, transcription-regulating riboswitch is obtained using minimal computational resources. All programs and scripts used in this contribution are free software and available for download as a source distribution for Linux® or as a platform-independent Docker® image including support for Apple macOS® and Microsoft Windows®.

Subject(s)

Molecular Dynamics Simulation , Nucleic Acid Conformation , RNA Folding , Transcription, Genetic , Riboswitch/genetics , RNA/chemistry , RNA/genetics , Software

2.

Tailored machine learning models for functional RNA detection in genome-wide screens.

Klapproth, Christopher; Zötzsche, Siegfried; Kühnl, Felix; Fallmann, Jörg; Stadler, Peter F; Findeiß, Sven.

NAR Genom Bioinform ; 5(3): lqad072, 2023 Sep.

Article in English | MEDLINE | ID: mdl-37608800

ABSTRACT

The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software framework for the alignment-based training, evaluation and application of machine learning models with user-defined parameters. Instead of focusing on the one-size-fits-all approach of pervasive in silico annotation pipelines, we offer a framework for the structured generation and evaluation of models based on arbitrary features and input data, focusing on stable and explainable results. Furthermore, we showcase the usage of our software package in a full-genome screen of Drosophila melanogaster and evaluate our results against the well-known but much less flexible program RNAz.

3.

RNA Design Principles for Riboswitches that Regulate RNase P-Mediated tRNA Processing.

Ender, Anna; Stadler, Peter F; Mörl, Mario; Findeiß, Sven.

Methods Mol Biol ; 2518: 179-202, 2022.

Article in English | MEDLINE | ID: mdl-35666446

ABSTRACT

Riboswitches are an attractive target for the directed design of RNA-based regulators by in silico prediction. These noncoding RNA elements consist of an aptamer platform for the highly selective ligand recognition and an expression platform which controls gene activity typically at the level of transcription or translation. In previous work, we could successfully apply RNA folding prediction to implement a new riboswitch mechanism regulating processing of a tRNA by RNase P. In this contribution, we present detailed information about our pipeline consisting of in silico design combined with the biochemical analysis for the verification of the implemented mechanism. Furthermore, we discuss the applicability of the presented biochemical in vivo and in vitro methods for the characterization of other artificial riboswitches.

Subject(s)

Riboswitch , Nucleic Acid Conformation , RNA/chemistry , RNA Folding , RNA, Transfer/chemistry , RNA, Transfer/genetics , Ribonuclease P/genetics , Riboswitch/genetics

4.

Synthetic riboswitches for the analysis of tRNA processing by eukaryotic RNase P enzymes.

Ender, Anna; Grafl, Nadine; Kolberg, Tim; Findeiß, Sven; Stadler, Peter F; Mörl, Mario.

RNA ; 28(4): 551-567, 2022 04.

Article in English | MEDLINE | ID: mdl-35022261

ABSTRACT

Removal of the 5'-leader region is an essential step in the maturation of tRNA molecules in all domains of life. This reaction is catalyzed by various RNase P activities, ranging from ribonucleoproteins with ribozyme activity to protein-only forms. In Escherichia coli, the efficiency of RNase P-mediated cleavage can be controlled by computationally designed riboswitch elements in a ligand-dependent way, where the 5'-leader sequence of a tRNA precursor is either sequestered in a hairpin structure or presented as a single-stranded region accessible for maturation. In the presented work, the regulatory potential of such artificial constructs is tested on different forms of eukaryotic RNase P enzymes-two protein-only RNase P enzymes (PRORP1 and PRORP2) from Arabidopsis thaliana and the ribonucleoprotein of Homo sapiens The PRORP enzymes were analyzed in vitro as well as in vivo in a bacterial RNase P complementation system. We also tested in HEK293T cells whether the riboswitches remain functional with human nuclear RNase P. While the regulatory principle of the synthetic riboswitches applies for all tested RNase P enzymes, the results also show differences in the substrate requirements of the individual enzyme versions. Hence, such designed RNase P riboswitches represent a novel tool to investigate the impact of the structural composition of the 5'-leader on substrate recognition by different types of RNase P enzymes.

Subject(s)

Ribonuclease P , Riboswitch , Eukaryota/genetics , HEK293 Cells , Humans , RNA Precursors/metabolism , RNA, Transfer/chemistry , RNA, Transfer/genetics , Ribonuclease P/metabolism , Riboswitch/genetics

5.

Common Features in lncRNA Annotation and Classification: A Survey.

Klapproth, Christopher; Sen, Rituparno; Stadler, Peter F; Findeiß, Sven; Fallmann, Jörg.

Noncoding RNA ; 7(4)2021 Dec 13.

Article in English | MEDLINE | ID: mdl-34940758

ABSTRACT

Long non-coding RNAs (lncRNAs) are widely recognized as important regulators of gene expression. Their molecular functions range from miRNA sponging to chromatin-associated mechanisms, leading to effects in disease progression and establishing them as diagnostic and therapeutic targets. Still, only a few representatives of this diverse class of RNAs are well studied, while the vast majority is poorly described beyond the existence of their transcripts. In this review we survey common in silico approaches for lncRNA annotation. We focus on the well-established sets of features used for classification and discuss their specific advantages and weaknesses. While the available tools perform very well for the task of distinguishing coding sequence from other RNAs, we find that current methods are not well suited to distinguish lncRNAs or parts thereof from other non-protein-coding input sequences. We conclude that the distinction of lncRNAs from intronic sequences and untranslated regions of coding mRNAs remains a pressing research gap.

6.

Ligand-dependent tRNA processing by a rationally designed RNase P riboswitch.

Ender, Anna; Etzel, Maja; Hammer, Stefan; Findeiß, Sven; Stadler, Peter; Mörl, Mario.

Nucleic Acids Res ; 49(3): 1784-1800, 2021 02 22.

Article in English | MEDLINE | ID: mdl-33469651

ABSTRACT

We describe a synthetic riboswitch element that implements a regulatory principle which directly addresses an essential tRNA maturation step. Constructed using a rational in silico design approach, this riboswitch regulates RNase P-catalyzed tRNA 5'-processing by either sequestering or exposing the single-stranded 5'-leader region of the tRNA precursor in response to a ligand. A single base pair in the 5'-leader defines the regulatory potential of the riboswitch both in vitro and in vivo. Our data provide proof for prior postulates on the importance of the structure of the leader region for tRNA maturation. We demonstrate that computational predictions of ligand-dependent structural rearrangements can address individual maturation steps of stable non-coding RNAs, thus making them amenable as promising target for regulatory devices that can be used as functional building blocks in synthetic biology.

Subject(s)

RNA Processing, Post-Transcriptional , RNA, Transfer/metabolism , Ribonuclease P/metabolism , Riboswitch , Computer Simulation , Escherichia coli/genetics , Ligands , RNA, Transfer/chemistry , Ribonuclease P/chemistry , Thermodynamics

7.

Beyond Plug and Pray: Context Sensitivity and in silico Design of Artificial Neomycin Riboswitches.

Günzel, Christian; Kühnl, Felix; Arnold, Katharina; Findeiß, Sven; Weinberg, Christina E; Stadler, Peter F; Mörl, Mario.

RNA Biol ; 18(4): 457-467, 2021 04.

Article in English | MEDLINE | ID: mdl-32882151

ABSTRACT

Gene regulation in prokaryotes often depends on RNA elements such as riboswitches or RNA thermometers located in the 5' untranslated region of mRNA. Rearrangements of the RNA structure in response, e.g., to the binding of small molecules or ions control translational initiation or premature termination of transcription and thus mRNA expression. Such structural responses are amenable to computational modelling, making it possible to rationally design synthetic riboswitches for a given aptamer. Starting from an artificial aptamer, we construct the first synthetic transcriptional riboswitches that respond to the antibiotic neomycin. We show that the switching behaviour in vivo critically depends not only on the sequence of the riboswitch itself, but also on its sequence context. We therefore developed in silico methods to predict the impact of the context, making it possible to adapt the design and to rescue non-functional riboswitches. We furthermore analyse the influence of 5' hairpins with varying stability on neomycin riboswitch activity. Our data highlight the limitations of a simple plug-and-play approach in the design of complex genetic circuits and demonstrate that detailed computational models significantly simplify, improve, and automate the design of transcriptional circuits. Our design software is available under a free licence on GitHub (https://github.com/xileF1337/riboswitch_design).

Subject(s)

Cloning, Molecular/methods , Computational Biology/methods , Neomycin/chemistry , Riboswitch/genetics , Aptamers, Nucleotide/chemistry , Aptamers, Nucleotide/genetics , Computer Simulation , Gene Expression Regulation, Bacterial , Genes, Reporter/genetics , Neomycin/pharmacology , Nucleic Acid Conformation , RNA, Bacterial/analysis , RNA, Bacterial/chemistry , RNA, Bacterial/genetics , Software , Synthetic Biology

8.

Evolving methods for rational de novo design of functional RNA molecules.

Hammer, Stefan; Günzel, Christian; Mörl, Mario; Findeiß, Sven.

Methods ; 161: 54-63, 2019 05 15.

Article in English | MEDLINE | ID: mdl-31059832

ABSTRACT

Artificial RNA molecules with novel functionality have many applications in synthetic biology, pharmacy and white biotechnology. The de novo design of such devices using computational methods and prediction tools is a resource-efficient alternative to experimental screening and selection pipelines. In this review, we describe methods common to many such computational approaches, thoroughly dissect these methods and highlight open questions for the individual steps. Initially, it is essential to investigate the biological target system, the regulatory mechanism that will be exploited, as well as the desired components in order to define design objectives. Subsequent computational design is needed to combine the selected components and to obtain novel functionality. This process can usually be split into constrained sequence sampling, the formulation of an optimization problem and an in silico analysis to narrow down the number of candidates with respect to secondary goals. Finally, experimental analysis is important to check whether the defined design objectives are indeed met in the target environment and detailed characterization experiments should be performed to improve the mechanistic models and detect missing design requirements.

Subject(s)

Computational Biology/methods , RNA/analysis , RNA/genetics , Sequence Analysis, RNA/methods , Animals , Computational Biology/trends , Humans , RNA, Untranslated/analysis , RNA, Untranslated/genetics , Sequence Analysis, RNA/trends , Synthetic Biology/methods , Synthetic Biology/trends

9.

Quantitative Characterization of Translational Riboregulators Using an in Vitro Transcription-Translation System.

Senoussi, Anis; Lee Tin Wah, Jonathan; Shimizu, Yoshihiro; Robert, Jérôme; Jaramillo, Alfonso; Findeiss, Sven; Axmann, Ilka M; Estevez-Torres, André.

ACS Synth Biol ; 7(5): 1269-1278, 2018 05 18.

Article in English | MEDLINE | ID: mdl-29617125

ABSTRACT

Riboregulators are short RNA sequences that, upon binding to a ligand, change their secondary structure and influence the expression rate of a downstream gene. They constitute an attractive alternative to transcription factors for building synthetic gene regulatory networks because they can be engineered de novo. However, riboregulators are generally designed in silico and tested in vivo, which provides little quantitative information about their performances, thus hindering the improvement of design algorithms. Here we show that a cell-free transcription-translation (TX-TL) system provides valuable information about the performances of in silico designed riboregulators. We first propose a simple model that provides a quantitative definition of the dynamic range of a riboregulator. We further characterize two types of translational riboregulators composed of a cis-repressed (cr) and a trans-activating (ta) strand. At the DNA level we demonstrate that high concentrations of taDNA poisoned the activator until total shut off, in agreement with our model, and that relative dynamic ranges of riboregulators determined in vitro are in agreement with published in vivo data. At the RNA level, we show that this approach provides a fast and simple way to measure dissociation constants of functional riboregulators, in contrast to standard mobility-shift assays. Our method opens the route for using cell-free TX-TL systems for the quantitative characterization of functional riboregulators in order to improve their design in silico.

Subject(s)

Gene Expression Regulation , Genetic Techniques , RNA/chemistry , RNA/genetics , 5' Untranslated Regions , Cell-Free System , DNA/chemistry , DNA/genetics , DNA-Directed RNA Polymerases/genetics , Fluorescence , Models, Genetic , Nucleic Acid Conformation , Protein Biosynthesis , RNA, Transfer , Real-Time Polymerase Chain Reaction , Transcription, Genetic , Viral Proteins/genetics

10.

In silico design of ligand triggered RNA switches.

Findeiß, Sven; Hammer, Stefan; Wolfinger, Michael T; Kühnl, Felix; Flamm, Christoph; Hofacker, Ivo L.

Methods ; 143: 90-101, 2018 07 01.

Article in English | MEDLINE | ID: mdl-29660485

ABSTRACT

This contribution sketches a work flow to design an RNA switch that is able to adapt two structural conformations in a ligand-dependent way. A well characterized RNA aptamer, i.e., knowing its Kd and adaptive structural features, is an essential ingredient of the described design process. We exemplify the principles using the well-known theophylline aptamer throughout this work. The aptamer in its ligand-binding competent structure represents one structural conformation of the switch while an alternative fold that disrupts the binding-competent structure forms the other conformation. To keep it simple we do not incorporate any regulatory mechanism to control transcription or translation. We elucidate a commonly used design process by explicitly dissecting and explaining the necessary steps in detail. We developed a novel objective function which specifies the mechanistics of this simple, ligand-triggered riboswitch and describe an extensive in silico analysis pipeline to evaluate important kinetic properties of the designed sequences. This protocol and the developed software can be easily extended or adapted to fit novel design scenarios and thus can serve as a template for future needs.

Subject(s)

Aptamers, Nucleotide/chemical synthesis , Computational Biology/methods , Nucleic Acid Conformation , Riboswitch/genetics , Aptamers, Nucleotide/genetics , Computational Biology/instrumentation , Kinetics , Ligands , RNA Folding , Software

11.

Design of Artificial Riboswitches as Biosensors.

Findeiß, Sven; Etzel, Maja; Will, Sebastian; Mörl, Mario; Stadler, Peter F.

Sensors (Basel) ; 17(9)2017 Aug 30.

Article in English | MEDLINE | ID: mdl-28867802

ABSTRACT

RNA aptamers readily recognize small organic molecules, polypeptides, as well as other nucleic acids in a highly specific manner. Many such aptamers have evolved as parts of regulatory systems in nature. Experimental selection techniques such as SELEX have been very successful in finding artificial aptamers for a wide variety of natural and synthetic ligands. Changes in structure and/or stability of aptamers upon ligand binding can propagate through larger RNA constructs and cause specific structural changes at distal positions. In turn, these may affect transcription, translation, splicing, or binding events. The RNA secondary structure model realistically describes both thermodynamic and kinetic aspects of RNA structure formation and refolding at a single, consistent level of modelling. Thus, this framework allows studying the function of natural riboswitches in silico. Moreover, it enables rationally designing artificial switches, combining essentially arbitrary sensors with a broad choice of read-out systems. Eventually, this approach sets the stage for constructing versatile biosensors.

Subject(s)

Riboswitch , Aptamers, Nucleotide , Biosensing Techniques , Kinetics , Ligands

12.

RNAblueprint: flexible multiple target nucleic acid sequence design.

Hammer, Stefan; Tschiatschek, Birgit; Flamm, Christoph; Hofacker, Ivo L; Findeiß, Sven.

Bioinformatics ; 33(18): 2850-2858, 2017 Sep 15.

Article in English | MEDLINE | ID: mdl-28449031

ABSTRACT

MOTIVATION: Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. RESULTS: We developed a C ++ library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. AVAILABILITY AND IMPLEMENTATION: RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . CONTACT: s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Subject(s)

Models, Molecular , RNA/chemistry , Software , Synthetic Biology/methods , Nucleic Acid Conformation

13.

Applicability of a computational design approach for synthetic riboswitches.

Domin, Gesine; Findeiß, Sven; Wachsmuth, Manja; Will, Sebastian; Stadler, Peter F; Mörl, Mario.

Nucleic Acids Res ; 45(7): 4108-4119, 2017 04 20.

Article in English | MEDLINE | ID: mdl-27994029

ABSTRACT

Riboswitches have gained attention as tools for synthetic biology, since they enable researchers to reprogram cells to sense and respond to exogenous molecules. In vitro evolutionary approaches produced numerous RNA aptamers that bind such small ligands, but their conversion into functional riboswitches remains difficult. We previously developed a computational approach for the design of synthetic theophylline riboswitches based on secondary structure prediction. These riboswitches have been constructed to regulate ligand-dependent transcription termination in Escherichia coli. Here, we test the usability of this design strategy by applying the approach to tetracycline and streptomycin aptamers. The resulting tetracycline riboswitches exhibit robust regulatory properties in vivo. Tandem fusions of these riboswitches with theophylline riboswitches represent logic gates responding to two different input signals. In contrast, the conversion of the streptomycin aptamer into functional riboswitches appears to be difficult. Investigations of the underlying aptamer secondary structure revealed differences between in silico prediction and structure probing. We conclude that only aptamers adopting the minimal free energy (MFE) structure are suitable targets for construction of synthetic riboswitches with design approaches based on equilibrium thermodynamics of RNA structures. Further improvements in the design strategy are required to implement aptamer structures not corresponding to the calculated MFE state.

Subject(s)

Gene Expression Regulation , Riboswitch , Transcription Termination, Genetic , Aptamers, Nucleotide/chemistry , Computational Biology/methods , Computer Simulation , Escherichia coli/genetics , Genes, Reporter , Nucleic Acid Conformation , Riboswitch/drug effects , Streptomycin/pharmacology , Tetracycline/pharmacology

14.

Design criteria for synthetic riboswitches acting on transcription.

Wachsmuth, Manja; Domin, Gesine; Lorenz, Ronny; Serfling, Robert; Findeiß, Sven; Stadler, Peter F; Mörl, Mario.

RNA Biol ; 12(2): 221-31, 2015.

Article in English | MEDLINE | ID: mdl-25826571

ABSTRACT

Riboswitches are RNA-based regulators of gene expression composed of a ligand-sensing aptamer domain followed by an overlapping expression platform. The regulation occurs at either the level of transcription (by formation of terminator or antiterminator structures) or translation (by presentation or sequestering of the ribosomal binding site). Due to a modular composition, these elements can be manipulated by combining different aptamers and expression platforms and therefore represent useful tools to regulate gene expression in synthetic biology. Using computationally designed theophylline-dependent riboswitches we show that 2 parameters, terminator hairpin stability and folding traps, have a major impact on the functionality of the designed constructs. These have to be considered very carefully during design phase. Furthermore, a combination of several copies of individual riboswitches leads to a much improved activation ratio between induced and uninduced gene activity and to a linear dose-dependent increase in reporter gene expression. Such serial arrangements of synthetic riboswitches closely resemble their natural counterparts and may form the basis for simple quantitative read out systems for the detection of specific target molecules in the cell.

Subject(s)

Drug Design , Riboswitch , Transcription, Genetic , Escherichia coli/genetics , Escherichia coli/metabolism , Genes, Reporter , Green Fluorescent Proteins/genetics , Green Fluorescent Proteins/metabolism , Molecular Sequence Data , Nucleic Acid Conformation , Structure-Activity Relationship , Synthetic Biology , Theophylline/chemistry , Thermodynamics , beta-Galactosidase/genetics , beta-Galactosidase/metabolism

15.

Design of transcription regulating riboswitches.

Findeiß, Sven; Wachsmuth, Manja; Mörl, Mario; Stadler, Peter F.

Methods Enzymol ; 550: 1-22, 2015.

Article in English | MEDLINE | ID: mdl-25605378

ABSTRACT

In this chapter, we review both computational and experimental aspects of de novo RNA sequence design. We give an overview of currently available design software and their limitations, and discuss the necessary setup to experimentally validate proper function in vitro and in vivo. We focus on transcription-regulating riboswitches, a task that has just recently lead to first successful designs of such RNA elements.

Subject(s)

Riboswitch/physiology , Base Sequence/genetics , Base Sequence/physiology , RNA/chemistry , RNA/genetics , RNA Folding , Riboswitch/genetics

16.

Bioinformatics of prokaryotic RNAs.

Backofen, Rolf; Amman, Fabian; Costa, Fabrizio; Findeiß, Sven; Richter, Andreas S; Stadler, Peter F.

RNA Biol ; 11(5): 470-83, 2014.

Article in English | MEDLINE | ID: mdl-24755880

ABSTRACT

The genome of most prokaryotes gives rise to surprisingly complex transcriptomes, comprising not only protein-coding mRNAs, often organized as operons, but also harbors dozens or even hundreds of highly structured small regulatory RNAs and unexpectedly large levels of anti-sense transcripts. Comprehensive surveys of prokaryotic transcriptomes and the need to characterize also their non-coding components is heavily dependent on computational methods and workflows, many of which have been developed or at least adapted specifically for the use with bacterial and archaeal data. This review provides an overview on the state-of-the-art of RNA bioinformatics focusing on applications to prokaryotes.

Subject(s)

Computational Biology , Prokaryotic Cells/metabolism , RNA/genetics , Base Pairing , Computational Biology/methods , Gene Expression Profiling , Genomics , Nucleic Acid Conformation , RNA/chemistry , Transcriptome

17.

TSSAR: TSS annotation regime for dRNA-seq data.

Amman, Fabian; Wolfinger, Michael T; Lorenz, Ronny; Hofacker, Ivo L; Stadler, Peter F; Findeiß, Sven.

BMC Bioinformatics ; 15: 89, 2014 Mar 27.

Article in English | MEDLINE | ID: mdl-24674136

ABSTRACT

BACKGROUND: Differential RNA sequencing (dRNA-seq) is a high-throughput screening technique designed to examine the architecture of bacterial operons in general and the precise position of transcription start sites (TSS) in particular. Hitherto, dRNA-seq data were analyzed by visualizing the sequencing reads mapped to the reference genome and manually annotating reliable positions. This is very labor intensive and, due to the subjectivity, biased. RESULTS: Here, we present TSSAR, a tool for automated de novo TSS annotation from dRNA-seq data that respects the statistics of dRNA-seq libraries. TSSAR uses the premise that the number of sequencing reads starting at a certain genomic position within a transcriptional active region follows a Poisson distribution with a parameter that depends on the local strength of expression. The differences of two dRNA-seq library counts thus follow a Skellam distribution. This provides a statistical basis to identify significantly enriched primary transcripts.We assessed the performance by analyzing a publicly available dRNA-seq data set using TSSAR and two simple approaches that utilize user-defined score cutoffs. We evaluated the power of reproducing the manual TSS annotation. Furthermore, the same data set was used to reproduce 74 experimentally validated TSS in H. pylori from reliable techniques such as RACE or primer extension. Both analyses showed that TSSAR outperforms the static cutoff-dependent approaches. CONCLUSIONS: Having an automated and efficient tool for analyzing dRNA-seq data facilitates the use of the dRNA-seq technique and promotes its application to more sophisticated analysis. For instance, monitoring the plasticity and dynamics of the transcriptomal architecture triggered by different stimuli and growth conditions becomes possible.The main asset of a novel tool for dRNA-seq analysis that reaches out to a broad user community is usability. As such, we provide TSSAR both as intuitive RESTful Web service ( http://rna.tbi.univie.ac.at/TSSAR) together with a set of post-processing and analysis tools, as well as a stand-alone version for use in high-throughput dRNA-seq data analysis pipelines.

Subject(s)

High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, RNA/methods , Base Sequence , Databases, Nucleic Acid , Genome , Genomics/methods , Helicobacter pylori/genetics , Humans , Software , Stenotrophomonas maltophilia/genetics

18.

Identification of new protein coding sequences and signal peptidase cleavage sites of Helicobacter pylori strain 26695 by proteogenomics.

Müller, Stephan A; Findeiß, Sven; Pernitzsch, Sandy R; Wissenbach, Dirk K; Stadler, Peter F; Hofacker, Ivo L; von Bergen, Martin; Kalkhof, Stefan.

J Proteomics ; 86: 27-42, 2013 Jun 28.

Article in English | MEDLINE | ID: mdl-23665149

ABSTRACT

Correct annotation of protein coding genes is the basis of conventional data analysis in proteomic studies. Nevertheless, most protein sequence databases almost exclusively rely on gene finding software and inevitably also miss protein annotations or possess errors. Proteogenomics tries to overcome these issues by matching MS data directly against a genome sequence database. Here we report an in-depth proteogenomics study of Helicobacter pylori strain 26695. MS data was searched against a combined database of the NCBI annotations and a six-frame translation of the genome. Database searches with Mascot and X! Tandem revealed 1115 proteins identified by at least two peptides with a peptide false discovery rate below 1%. This represents 71% of the predicted proteome. So far this is the most extensive proteome study of Helicobacter pylori. Our proteogenomic approach unambiguously identified four previously missed annotations and furthermore allowed us to correct sequences of six annotated proteins. Since secreted proteins are often involved in pathogenic processes we further investigated signal peptidase cleavage sites. By applying a database search that accommodates the identification of semi-specific cleaved peptides, 63 previously unknown signal peptides were detected. The motif LXA showed to be the predominant recognition sequence for signal peptidases. BIOLOGICAL SIGNIFICANCE: The results of MS-based proteomic studies highly rely on correct annotation of protein coding genes which is the basis of conventional data analysis. However, the annotation of protein coding sequences in genomic data is usually based on gene finding software. These tools are limited in their prediction accuracy such as the problematic determination of exact gene boundaries. Thus, protein databases own partly erroneous or incomplete sequences. Additionally, some protein sequences might also be missing in the databases. Proteogenomics, a combination of proteomic and genomic data analyses, is well suited to detect previously not annotated proteins and to correct erroneous sequences. For this purpose, the existing database of the investigated species is typically supplemented with a six-frame translation of the genome. Here, we studied the proteome of the major human pathogen Helicobacter pylori that is responsible for many gastric diseases such as duodenal ulcers and gastric cancer. Our in-depth proteomic study highly reliably identified 1115 proteins (FDR<0.01%) by at least two peptides (FDR<1%) which represent 71% of the predicted proteome deposited at NCBI. The proteogenomic data analysis of our data set resulted in the unambiguous identification of four previously missed annotations, the correction of six annotated proteins as well as the detection of 63 previously unknown signal peptides. We have annotated proteins of particular biological interest like the ferrous iron transport protein A, the coiled-coil-rich protein HP0058 and the lipopolysaccharide biosynthesis protein HP0619. For instance, the protein HP0619 could be a drug target for the inhibition of the LPS synthesis pathway. Furthermore it has been proven that the motif "LXA" is the predominant recognition sequence for the signal peptidase I of H. pylori. Signal peptidases are essential enzymes for the viability of bacterial cells and are involved in pathogenesis. Therefore signal peptidases could be novel targets for antibiotics. The inclusion of the corrected and new annotated proteins as well as the information of signal peptide cleavage sites will help in the study of biological pathways involved in pathogenesis or drug response of H. pylori.

Subject(s)

Databases, Protein , Genomics/methods , Helicobacter pylori/genetics , Membrane Proteins/metabolism , Protein Sorting Signals/genetics , Proteomics/methods , Serine Endopeptidases/metabolism , Frameshift Mutation , Mass Spectrometry , Molecular Sequence Annotation , Protein Sorting Signals/physiology

19.

Mapping the RNA-Seq trash bin: unusual transcripts in prokaryotic transcriptome sequencing data.

Doose, Gero; Alexis, Maria; Kirsch, Rebecca; Findeiß, Sven; Langenberger, David; Machné, Rainer; Mörl, Mario; Hoffmann, Steve; Stadler, Peter F.

RNA Biol ; 10(7): 1204-10, 2013 Jul.

Article in English | MEDLINE | ID: mdl-23702463

ABSTRACT

Prokaryotic transcripts constitute almost always uninterrupted intervals when mapped back to the genome. Split reads, i.e., RNA-seq reads consisting of parts that only map to discontiguous loci, are thus disregarded in most analysis pipelines. There are, however, some well-known exceptions, in particular, tRNA splicing and circularized small RNAs in Archaea as well as self-splicing introns. Here, we reanalyze a series of published RNA-seq data sets, screening them specifically for non-contiguously mapping reads. We recover most of the known cases together with several novel archaeal ncRNAs associated with circularized products. In Eubacteria, only a handful of interesting candidates were obtained beyond a few previously described group I and group II introns. Most of the atypically mapping reads do not appear to correspond to well-defined, specifically processed products. Whether this diffuse background is, at least in part, an incidental by-product of prokaryotic RNA processing or whether it consists entirely of technical artifacts of reverse transcription or amplification remains unknown.

Subject(s)

Computational Biology/methods , Prokaryotic Cells/metabolism , RNA/chemistry , Sequence Analysis, RNA , Transcriptome , Archaea/genetics , Bacteria/genetics , Genomics/methods , Molecular Sequence Annotation , RNA/genetics

20.

De novo design of a synthetic riboswitch that regulates transcription termination.

Wachsmuth, Manja; Findeiß, Sven; Weissheimer, Nadine; Stadler, Peter F; Mörl, Mario.

Nucleic Acids Res ; 41(4): 2541-51, 2013 Feb 01.

Article in English | MEDLINE | ID: mdl-23275562

ABSTRACT

Riboswitches are regulatory RNA elements typically located in the 5'-untranslated region of certain mRNAs and control gene expression at the level of transcription or translation. These elements consist of a sensor and an adjacent actuator domain. The sensor usually is an aptamer that specifically interacts with a ligand. The actuator contains an intrinsic terminator or a ribosomal binding site for transcriptional or translational regulation, respectively. Ligand binding leads to structural rearrangements of the riboswitch and to presentation or masking of these regulatory elements. Based on this modular organization, riboswitches are an ideal target for constructing synthetic regulatory systems for gene expression. Although riboswitches for translational control have been designed successfully, attempts to construct synthetic elements regulating transcription have failed so far. Here, we present an in silico pipeline for the rational design of synthetic riboswitches that regulate gene expression at the transcriptional level. Using the well-characterized theophylline aptamer as sensor, we designed the actuator part as RNA sequences that can fold into functional intrinsic terminator structures. In the biochemical characterization, several of the designed constructs show ligand-dependent control of gene expression in Escherichia coli, demonstrating that it is possible to engineer riboswitches not only for translational but also for transcriptional regulation.

Subject(s)

Gene Expression Regulation , Riboswitch , Terminator Regions, Genetic , Transcription Termination, Genetic , Aptamers, Nucleotide/chemistry , RNA/chemistry , Riboswitch/drug effects , Theophylline/pharmacology , Transcription Termination, Genetic/drug effects

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL