Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 35(22): 4553-4559, 2019 11 01.
Article in English | MEDLINE | ID: mdl-30993337

ABSTRACT

MOTIVATION: MicroRNAs form an important class of RNA regulators that has been studied extensively. The miRBase and Rfam database provide rich, frequently updated information on both pre-miRNAs and their mature forms. These data sources, however, rely on individual data submission and thus are neither complete nor consistent in their coverage across different miRNA families. Quantitative studies of miRNA evolution therefore are difficult or impossible on this basis. RESULTS: We present here a workflow and a corresponding implementation, MIRfix, that automatically curates miRNA datasets by improving alignments of their precursors, the consistency of the annotation of mature miR and miR* sequence, and the phylogenetic coverage. MIRfix produces alignments that are comparable across families and sets the stage for improved homology search as well as quantitative analyses. AVAILABILITY AND IMPLEMENTATION: MIRfix can be downloaded from https://github.com/Bierinformatik/MIRfix. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
MicroRNAs/genetics , Animals , Information Storage and Retrieval , Phylogeny
2.
RNA Biol ; 15(2): 158-164, 2018 02 01.
Article in English | MEDLINE | ID: mdl-29219696

ABSTRACT

The 7SK RNA is a small nuclear RNA that is involved in the regulation of Pol-II transcription. It is very well conserved in vertebrates, but shows extensive variations in both sequence and structure across invertebrates. A systematic homology search extended the collection of 7SK genes in both Arthropods and Lophotrochozoa making use of the large number of recently published invertebrate genomes. The extended data set made it possible to infer complete consensus structures for invertebrate 7SK RNAs. These show that not only the well-conserved 5'- and 3'- domains but all the interior Stem A domain is universally conserved. In contrast, Stem B region exhibits substantial structural variation and does not adhere to a common structural model beyond phylum level.


Subject(s)
Invertebrates/genetics , RNA, Small Cytoplasmic/chemistry , RNA, Small Cytoplasmic/genetics , Signal Recognition Particle/chemistry , Signal Recognition Particle/genetics , Animals , Conserved Sequence , Evolution, Molecular , Invertebrates/chemistry , Models, Molecular , Nucleic Acid Conformation , Phylogeny , Sequence Homology, Nucleic Acid
3.
Algorithms Mol Biol ; 12: 22, 2017.
Article in English | MEDLINE | ID: mdl-28852417

ABSTRACT

BACKGROUND: Genomic DNA frequently undergoes rearrangement of the gene order that can be localized by comparing the two DNA sequences. In mitochondrial genomes different mechanisms are likely at work, at least some of which involve the duplication of sequence around the location of the apparent breakpoints. We hypothesize that these different mechanisms of genome rearrangement leave distinctive sequence footprints. In order to study such effects it is important to locate the breakpoint positions with precision. RESULTS: We define a partially local sequence alignment problem that assumes that following a rearrangement of a sequence F, two fragments L, and R are produced that may exactly fit together to match F, leave a gap of deleted DNA between L and R, or overlap with each other. We show that this alignment problem can be solved by dynamic programming in cubic space and time. We apply the new method to evaluate rearrangements of animal mitogenomes and find that a surprisingly large fraction of these events involved local sequence duplications. CONCLUSIONS: The partially local sequence alignment method is an effective way to investigate the mechanism of genomic rearrangement events. While applied here only to mitogenomes there is no reason why the method could not be used to also consider rearrangements in nuclear genomes.

4.
J Integr Bioinform ; 14(1)2017 Jun 05.
Article in English | MEDLINE | ID: mdl-28637930

ABSTRACT

The miRBase currently reports more than 25,000 microRNAs in several hundred genomes that belong to more than 1000 families of homologous sequences. Quantitative investigations of miRNA gene evolution requires the construction of data sets that are consistent in their coverage and include those genomes that are of interest in a given study. Given the size and structure of data, this can be achieved only with the help of a fully automatic pipeline that improves the available seed alignments, extends the set of available sequences by homology search, and reliably identifies true positive homology search results. Here we describe the current progress towards such a system, emphasizing the task of improving and completing the initial seed alignment.


Subject(s)
Evolution, Molecular , MicroRNAs/genetics , Sequence Alignment/methods , Sequence Homology , Animals , Automation , Databases, Genetic , Datasets as Topic , Genome/genetics , Humans
5.
Mol Phylogenet Evol ; 106: 209-216, 2017 01.
Article in English | MEDLINE | ID: mdl-27693569

ABSTRACT

Mitochondrial genome sequences are available in large number and new sequences become published nowadays with increasing pace. Fast, automatic, consistent, and high quality annotations are a prerequisite for downstream analyses. Therefore, we present an automated pipeline for fast de novo annotation of mitochondrial protein-coding genes. The annotation is based on enhanced phylogeny-aware hidden Markov models (HMMs). The pipeline builds taxon-specific enhanced multiple sequence alignments (MSA) of already annotated sequences and corresponding HMMs using an approximation of the phylogeny. The MSAs are enhanced by fixing unannotated frameshifts, purging of wrong sequences, and removal of non-conserved columns from both ends. A comparison with reference annotations highlights the high quality of the results. The frameshift correction method predicts a large number of frameshifts, many of which are unknown. A detailed analysis of the frameshifts in nad3 of the Archosauria-Testudines group has been conducted.


Subject(s)
Genome, Mitochondrial , Animals , Base Sequence , Birds/classification , DNA, Mitochondrial/chemistry , DNA, Mitochondrial/classification , DNA, Mitochondrial/metabolism , Databases, Genetic , Frameshift Mutation , Markov Chains , Mitochondrial Proteins/chemistry , Mitochondrial Proteins/classification , Mitochondrial Proteins/genetics , Mitochondrial Proteins/metabolism , Molecular Sequence Data , Phylogeny , Reptiles/classification , Sequence Alignment
6.
Nucleic Acids Res ; 43(16): 8044-56, 2015 Sep 18.
Article in English | MEDLINE | ID: mdl-26227972

ABSTRACT

Remolding of tRNAs is a well-documented process in mitochondrial genomes that changes the identity of a tRNA. It involves a duplication of a tRNA gene, a mutation that changes the anticodon and the loss of the ancestral tRNA gene. The net effect is a functional tRNA that is more closely related to tRNAs of a different alloacceptor family than to tRNAs with the same anticodon in related species. Beyond being of interest for understanding mitochondrial tRNA function and evolution, tRNA remolding events can lead to artifacts in the annotation of mitogenomes and thus in studies of mitogenomic evolution. Therefore, it is important to identify and catalog these events. Here we describe novel methods to detect tRNA remolding in large-scale data sets and apply them to survey tRNA remolding throughout animal evolution. We identify several novel remolding events in addition to the ones previously mentioned in the literature. A detailed analysis of these remoldings showed that many of them are derived from ancestral events.


Subject(s)
Evolution, Molecular , Genome, Mitochondrial , RNA, Transfer/genetics , Animals , Anticodon , Codon , Crustacea/genetics , Mutation , Porifera/genetics , RNA, Transfer, Leu/genetics , Sequence Alignment
7.
Mitochondrion ; 17: 56-66, 2014 Jul.
Article in English | MEDLINE | ID: mdl-24911382

ABSTRACT

The comprehensive understanding of mitochondrial genome evolution requires a detailed mechanistic picture of mitogenomic replication. Despite many previous efforts it has remained a non-trivial problem to determine the origins of replication and trace their fate across rearrangements of the gene order even in the small genomes of animal mitochondria. We elaborate here on the observation that the GC skew is correlated with the distance from the replication origins. This effect has been explained as a consequence of the standard model of mitochondrial DNA replication, i.e. the strand displacement model. According to this model chemical damage accumulates proportional to the duration that DNA is exposed in single-stranded form during replication (Dssh) which depends on the relative position with respect to the replication origins. Based on this model we developed a computational method to infer the positions of both the heavy strand and the light strand origin from nucleotide skew data. In a comprehensive survey of deuterostome mitochondria we infer conserved replication origins for the vast majority of vertebrates and cephalochordates. Deviations from the consensus picture are presumably associated with genome rearrangements.


Subject(s)
DNA Replication , DNA, Mitochondrial/chemistry , DNA, Mitochondrial/genetics , Replication Origin , Animals , Base Composition , Computational Biology/methods , Humans , Vertebrates
SELECTION OF CITATIONS
SEARCH DETAIL
...