Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 1.076
Filtrar
1.
Nat Protoc ; 16(9): 4355-4381, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34331052

RESUMO

Elongation, splicing and polyadenylation are fundamental steps of transcription, and studying their coordination requires simultaneous monitoring of these dynamic processes on one transcript. We recently developed a full-length nascent RNA sequencing method in the model plant Arabidopsis that simultaneously detects RNA polymerase II position, splicing status, polyadenylation site and poly(A) tail length at genome-wide scale. This method allows calculation of the kinetics of cotranscriptional splicing and detects polyadenylated transcripts with unspliced introns retained at specific positions posttranscriptionally. Here we describe a detailed protocol for this method called FLEP-seq (full-length elongating and polyadenylated RNA sequencing) that is applicable to plants. Library production requires as little as one nanogram of nascent RNA (after rRNA/tRNA removal), and either Nanopore or PacBio platforms can be used for sequencing. We also provide a complete bioinformatic pipeline from raw data processing to downstream analysis. The minimum time required for FLEP-seq, including RNA extraction and library preparation, is 36 h. The subsequent long-read sequencing and initial data analysis ranges between 31 and 40 h, depending on the sequencing platform.


Assuntos
Poli A/análise , RNA Polimerase II/análise , Splicing de RNA , Análise de Sequência de RNA/métodos , Software , Arabidopsis , Genômica/métodos , Poliadenilação
2.
Mol Hum Reprod ; 27(7)2021 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-34152407

RESUMO

Mammalian oocytes and embryos rely exclusively on maternal mRNAs to accomplish early developmental processes. Since oocytes and early embryos are transcriptionally silent after meiotic resumption, most of the synthesised maternal mRNA does not undergo immediate translation but is instead stored in the oocyte. Quantitative RT-PCR is commonly used to quantify mRNA levels, and correct quantification relies on reverse transcription and the choice of reference genes. Different methods for reverse transcription may affect gene expression determination in oocytes. In this study, we examined the suitability of either random or oligo(dT) primers for reverse transcription to be used for quantitative RT-PCR. We further looked for changes in poly(A) length of the maternal mRNAs during oocyte maturation. Our data indicate that depending on the method of reverse transcription, the optimal combination of reference genes for normalisation differed. Surprisingly, we observed a shortening of the poly(A) tail lengths of maternal mRNA as oocytes progressed from germinal vesicle to metaphase II. Overall, our findings suggest dynamic maternal regulation of mRNA structure and gene expression during oocyte maturation and early embryo development.


Assuntos
Blastômeros/metabolismo , Primers do DNA , Regulação da Expressão Gênica no Desenvolvimento , Mórula/metabolismo , Oócitos/metabolismo , RNA Mensageiro/metabolismo , Reação em Cadeia da Polimerase Via Transcriptase Reversa/métodos , Transcrição Reversa , Zigoto/metabolismo , Animais , Bovinos , Primers do DNA/síntese química , DNA Complementar/genética , Técnicas de Cultura Embrionária , Genes , Poli A/análise , RNA Mensageiro/genética , RNA Mensageiro/isolamento & purificação , Padrões de Referência , Criação de Embriões para Pesquisa , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico
3.
Methods Mol Biol ; 2284: 543-567, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33835463

RESUMO

The poly(A) tail is a homopolymeric stretch of adenosine at the 3'-end of mature RNA transcripts and its length plays an important role in nuclear export, stability, and translational regulation of mRNA. Existing techniques for genome-wide estimation of poly(A) tail length are based on short-read sequencing. These methods are limited because they sequence a synthetic DNA copy of mRNA instead of the native transcripts. Furthermore, they can identify only a short segment of the transcript proximal to the poly(A) tail which makes it difficult to assign the measured poly(A) length uniquely to a single transcript isoform. With the introduction of native RNA sequencing by Oxford Nanopore Technologies, it is now possible to sequence full-length native RNA. A single long read contains both the transcript and the associated poly(A) tail, thereby making transcriptome-wide isoform-specific poly(A) tail length assessment feasible. We developed tailfindr-an R-based package for estimating poly(A) tail length from Oxford Nanopore sequencing data. In this chapter, we describe in detail the pipeline for transcript isoform-specific poly(A) tail profiling based on native RNA Nanopore sequencing-from library preparation to downstream data analysis with tailfindr.


Assuntos
Sequenciamento por Nanoporos/métodos , Poli A/análise , RNA/análise , Análise de Sequência de RNA/métodos , Animais , Estudos de Viabilidade , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Poli A/genética , Isoformas de Proteínas/análise , Isoformas de Proteínas/genética , RNA/química , RNA/genética , Processamento Pós-Transcricional do RNA , RNA Mensageiro/análise , RNA Mensageiro/genética , Transcriptoma , Peixe-Zebra/genética
4.
Biochemistry ; 59(22): 2041-2046, 2020 06 09.
Artigo em Inglês | MEDLINE | ID: mdl-32412236

RESUMO

Homopolymeric adenosine RNA plays numerous roles in both cells and noncellular genetic material. We report herein an unusual poly(A) signature in chemical mapping data generated by the Eterna Massive Open Laboratory. Poly(A) sequences of length seven or more show unexpected results in the selective 2'-hydroxyl acylation read out by primer extension (SHAPE) and dimethyl sulfate (DMS) chemical probing. This unusual signature first appears in poly(A) sequences of length seven and grows to its maximum strength at length ∼10. In a long poly(A) sequence, the substitution of a single A by any other nucleotide disrupts the signature, but only for the 6 or so nucleotides on the 5' side of the substitution.


Assuntos
Ensaios de Triagem em Larga Escala , Poli A/análise , RNA/análise
5.
Nucleic Acids Res ; 48(11): 5926-5938, 2020 06 19.
Artigo em Inglês | MEDLINE | ID: mdl-32421815

RESUMO

Alternative polyadenylation (APA) produces isoforms with distinct 3'-ends, yet their functional differences remain largely unknown. Here, we introduce the APA-seq method to detect the expression levels of APA isoforms from 3'-end RNA-Seq data by exploiting both paired-end reads for gene isoform identification and quantification. We detected the expression levels of APA isoforms in individual Caenorhabditis elegans embryos at different stages throughout embryogenesis. Examining the correlation between the temporal profiles of isoforms led us to distinguish two classes of genes: those with highly correlated isoforms (HCI) and those with lowly correlated isoforms (LCI) across time. We hypothesized that variants with similar expression profiles may be the product of biological noise, while the LCI variants may be under tighter selection and consequently their distinct 3' UTR isoforms are more likely to have functional consequences. Supporting this notion, we found that LCI genes have significantly more miRNA binding sites, more correlated expression profiles with those of their targeting miRNAs and a relative lack of correspondence between their transcription and protein abundances. Collectively, our results suggest that a lack of coherence among the regulation of 3' UTR isoforms is a proxy for selective pressures acting upon APA usage and consequently for their functional relevance.


Assuntos
Regiões 3' não Traduzidas/genética , Caenorhabditis elegans/embriologia , Caenorhabditis elegans/genética , Regulação da Expressão Gênica no Desenvolvimento , Poli A/análise , Poliadenilação , Animais , Drosophila melanogaster , Desenvolvimento Embrionário/genética , MicroRNAs/genética , MicroRNAs/metabolismo , Conformação de Ácido Nucleico , Xenopus laevis
6.
Nucleic Acids Res ; 48(D1): D174-D179, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31617559

RESUMO

Generated by 3' end cleavage and polyadenylation at alternative polyadenylation (poly(A)) sites, alternative terminal exons account for much of the variation between human transcript isoforms. More than a dozen protocols have been developed so far for capturing and sequencing RNA 3' ends from a variety of cell types and species. In previous studies, we have used these data to uncover novel regulatory signals and cell type-specific isoforms. Here we present an update of the PolyASite (https://polyasite.unibas.ch) resource of poly(A) sites, constructed from publicly available human, mouse and worm 3' end sequencing datasets by enforcing uniform quality measures, including the flagging of putative internal priming sites. Through integrated processing of all data, we identified and clustered sites that are closely spaced and share polyadenylation signals, as these are likely the result of stochastic variations in processing. For each cluster, we identified the representative - most frequently processed - site and estimated the relative use in the transcriptome across all samples. We have established a modern web portal for efficient finding, exploration and export of data. Database generation is fully automated, greatly facilitating incorporation of new datasets and the updating of underlying genome resources.


Assuntos
Bases de Dados de Ácidos Nucleicos , Poliadenilação , Animais , Caenorhabditis elegans/genética , Humanos , Camundongos , Poli A/análise , Análise de Sequência de RNA
7.
Nat Commun ; 10(1): 5292, 2019 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-31757970

RESUMO

Message RNA poly(A) tails are vital for their function and regulation. However, the full-length sequence of mRNA isoforms with their poly(A) tails remains undetermined. Here, we develop a method at single-cell level sensitivity that enables quantification of poly(A) tails along with the full-length cDNA while reading non-adenosine residues within poly(A) tails precisely, which we name poly(A) inclusive RNA isoform sequencing (PAIso-seq). Using this method, we can quantify isoform specific poly(A) tail length. More interestingly, we find that 17% of the mRNAs harbor non-A residues within the body of poly(A) tails in mouse GV oocytes. We show that PAIso-seq is sensitive enough to analyze single GV oocytes. These findings will not only provide an accurate and sensitive tool in studying poly(A) tails, but also open a door for the function and regulation of non-adenosine modifications within the body of poly(A) tails.


Assuntos
Nucleotídeos de Citosina/química , Nucleotídeos de Guanina/química , Oócitos/metabolismo , Poli A/química , RNA Mensageiro/química , Análise de Sequência de RNA/métodos , Nucleotídeos de Uracila/química , Animais , Proteínas de Ciclo Celular/genética , Ciclina B1/genética , Nucleotídeos de Citosina/análise , DNA (Citosina-5-)-Metiltransferase 1/genética , Nucleotídeos de Guanina/análise , Camundongos , Poli A/análise , Biossíntese de Proteínas , Isoformas de RNA , RNA Mensageiro/análise , Análise de Célula Única , Ativador de Plasminogênio Tecidual/genética , Nucleotídeos de Uracila/análise
8.
Nucleic Acids Res ; 47(5): 2263-2275, 2019 03 18.
Artigo em Inglês | MEDLINE | ID: mdl-30576549

RESUMO

Recent studies have shown that tissue-specific transcriptomes contain multiple types of RNAs that are transcribed from intronic and intergenic sequences. The current study presents a tool for the discovery of transcribed, unannotated sequence elements from RNA-seq libraries. This RNA Element (RE) discovery algorithm (REDa) was applied to a spectrum of tissues and cells representing germline, embryonic, and somatic tissues and examined as a function of differentiation through the first set of cell divisions of human development. This highlighted extensive transcription throughout the genome, yielding previously unidentified human spermatogenic RNAs. Both exonic and novel X-chromosome REs were subject to robust meiotic sex chromosome inactivation, although an extensive de-repression occurred in the post-meiotic stages of spermatogenesis. Surprisingly, 2.4% of the 10,395 X chromosome exonic REs were present in mature sperm. Transcribed genomic repetitive sequences, including simple centromeric repeats, HERVE and HSAT1, were also shown to be associated with RE expression during spermatogenesis. These results suggest that pervasive intergenic repetitive sequence expression during human spermatogenesis may play a role in regulating chromatin dynamics. Repetitive REs switching repeat classes during differentiation upon fertilization and embryonic genome activation was evident.


Assuntos
Algoritmos , Blastocisto/metabolismo , Oócitos/metabolismo , RNA Mensageiro/análise , RNA Mensageiro/genética , Sequências Reguladoras de Ácido Ribonucleico/genética , Análise de Sequência de RNA , Espermatozoides/metabolismo , Blastocisto/citologia , Diferenciação Celular , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Cromossomos Humanos X/genética , Desenvolvimento Embrionário/genética , Éxons/genética , Feminino , Fertilização , Regulação da Expressão Gênica no Desenvolvimento , Genômica , Humanos , Fígado/citologia , Fígado/metabolismo , Masculino , Meiose/genética , Oócitos/citologia , Poli A/análise , Poli A/genética , Poli A/isolamento & purificação , RNA Mensageiro/isolamento & purificação , Sequências Repetitivas de Ácido Nucleico , Espermatogênese/genética , Espermatozoides/citologia , Transcrição Gênica , Inativação do Cromossomo X
9.
Langmuir ; 34(49): 14940-14945, 2018 12 11.
Artigo em Inglês | MEDLINE | ID: mdl-30462509

RESUMO

RNA sensing is of vital significance to advance our comprehension of gene expression and to further benefit medical diagnostics. Taking advantage of the excellent sensing capability of the aerolysin nanopore as a single-biomolecule interface, we for the first time achieved the direct characterization of single native RNA of Poly(A)4 and Poly(U)4. Poly(A)4 induces ∼10% larger blockade current amplitude than Poly(U)4. The statistical duration of Poly(A)4 is 18.83 ± 1.08 ms, which is 100 times longer than that of Poly(U)4. Our results demonstrated that the capture of RNA homopolymers is restricted by the biased diffusion. The translocation of RNA needs to overcome a lower free-energy barrier than that of DNA. Moreover, the strong RNA-aerolysin interaction is attributed to the hydroxyl in pentose, which prolongs the translocation time. This study opens an avenue for aerolysin nanopores to directly achieve RNA sensing, including discrimination of RNA epigenetic modification and selective detection of miRNA.


Assuntos
Toxinas Bacterianas/química , Nanoporos , Proteínas Citotóxicas Formadoras de Poros/química , RNA/análise , Aeromonas hydrophila/química , Técnicas Eletroquímicas/métodos , Bicamadas Lipídicas/química , Fosfatidilcolinas/química , Poli A/análise , Poli U/análise
10.
Anal Bioanal Chem ; 410(6): 1667-1677, 2018 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-29313076

RESUMO

The 3'-polyadenosine (poly A) tail of in vitro transcribed (IVT) mRNA was studied using liquid chromatography coupled to mass spectrometry (LC-MS). Poly A tails were cleaved from the mRNA using ribonuclease T1 followed by isolation with dT magnetic beads. Extracted tails were then analyzed by LC-MS which provided tail length information at single-nucleotide resolution. A 2100-nt mRNA with plasmid-encoded poly A tail lengths of either 27, 64, 100, or 117 nucleotides was used for these studies as enzymatically added poly A tails showed significant length heterogeneity. The number of As observed in the tails closely matched Sanger sequencing results of the DNA template, and even minor plasmid populations with sequence variations were detected. When the plasmid sequence contained a discreet number of poly As in the tail, analysis revealed a distribution that included tails longer than the encoded tail lengths. These observations were consistent with transcriptional slippage of T7 RNAP taking place within a poly A sequence. The type of RNAP did not alter the observed tail distribution, and comparison of T3, T7, and SP6 showed all three RNAPs produced equivalent tail length distributions. The addition of a sequence at the 3' end of the poly A tail did, however, produce narrower tail length distributions which supports a previously described model of slippage where the 3' end can be locked in place by having a G or C after the poly nucleotide region. Graphical abstract Determination of mRNA poly A tail length using magnetic beads and LC-MS.


Assuntos
Cromatografia Líquida de Alta Pressão/métodos , Espectrometria de Massas/métodos , Poli A/análise , RNA Mensageiro/química , Animais , Linhagem Celular , Camundongos , Plasmídeos/química , Plasmídeos/genética , Poli A/genética , Poliadenilação , RNA Mensageiro/genética , Transcrição Gênica
11.
Molecules ; 22(12)2017 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-29236076

RESUMO

We demonstrate here for the first time that a guanidiniocarbonyl-pyrrole (GCP) unit can be applied for the fine recognition of single stranded RNA sequences-an intuitively unexpected result since so far binding of the GCP unit to ds-DNA or ds-RNA relied strongly on minor or major groove interactions, as shown in previous work. Two novel nucleobase-GCP isosteric conjugates differing in the flexibility of GCP unit revealed a fluorimetric recognition of various single stranded RNA, which could be additionally regulated by pH. The more rigid conjugate showed a specific fluorescence increase for poly A only at pH 7, whereby this response could be reversibly switched-off at pH 5. The more flexible derivative revealed selective fluorescence quenching by poly G at pH 7 but no change for poly A, whereas its recognition of poly AH⁺ can be switched-on at pH 5. The computational analysis confirmed the important role of the GCP fragment and its protonation states in the sensing of polynucleotides and revealed that it is affected by the intrinsic dynamical features of conjugates themselves. Both conjugates showed a negligible response to uracil and cytosine ss-RNA as well as ds-RNA at pH 7, and only weak interactions with ds-DNA. Thus, nucleobase-GCP conjugates can be considered as novel lead compounds for the design of ss-RNA or ss-DNA selective fluorimetric probes.


Assuntos
Fluorometria/métodos , Guanidinas/química , Sondas Moleculares/química , Pirróis/química , RNA/análise , DNA/química , Concentração de Íons de Hidrogênio , Poli A/análise , Poli G/análise , Soluções
12.
Nucleic Acids Res ; 45(12): e112, 2017 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-28449108

RESUMO

The recent emergence of alternative polyadenylation (APA) as an engine driving transcriptomic diversity has stimulated the development of sequencing methodologies designed to assess genome-wide polyadenylation events. The goal of these approaches is to enrich, partition, capture and ultimately sequence poly(A) site junctions. However, these methods often require poly(A) enrichment, 3΄ linker ligation steps, and RNA fragmentation, which can necessitate higher levels of starting RNA, increase experimental error and potentially introduce bias. We recently reported a click-chemistry based method for generating RNAseq libraries called 'ClickSeq'. Here, we adapt this method to direct the cDNA synthesis specifically toward the 3΄UTR/poly(A) tail junction of cellular RNA. With this novel approach, we demonstrate sensitive and specific enrichment for poly(A) site junctions without the need for complex sample preparation, fragmentation or purification. Poly(A)-ClickSeq (PAC-seq) is therefore a simple procedure that generates high-quality RNA-seq poly(A) libraries. As a proof-of-principle, we utilized PAC-seq to explore the poly(A) landscape of both human and Drosophila cells in culture and observed outstanding overlap with existing poly(A) databases and also identified previously unannotated poly(A) sites. Moreover, we utilize PAC-seq to quantify and analyze APA events regulated by CFIm25 illustrating how this technology can be harnessed to identify alternatively polyadenylated RNA.


Assuntos
Regiões 3' não Traduzidas , Química Click/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Poli A/análise , RNA Mensageiro/análise , Transcriptoma , Animais , Sequência de Bases , DNA Complementar/genética , DNA Complementar/metabolismo , Bases de Dados Genéticas , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Perfilação da Expressão Gênica , Biblioteca Gênica , Células HeLa , Humanos , Anotação de Sequência Molecular , Poli A/genética , Poli A/metabolismo , Poliadenilação , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Análise de Sequência de RNA , Fatores de Poliadenilação e Clivagem de mRNA/genética , Fatores de Poliadenilação e Clivagem de mRNA/metabolismo
13.
Methods Mol Biol ; 1463: 93-102, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-27734350

RESUMO

mRNA regulation by poly(A) tail length variations plays an important role in many developmental processes. Recent advances have shown that, in particular, deadenylation (the shortening of mRNA poly(A) tails) is essential for germ-line stem cell biology in the Drosophila ovary. Therefore, a rapid and accurate method to analyze poly(A) tail lengths of specific mRNAs in this tissue is valuable. Several methods of poly(A) test (PAT) assays have been reported to measure mRNA poly(A) tail lengths in vivo. Here, we describe two of these methods (PAT and ePAT) that we have adapted for Drosophila ovarian germ cells and germ-line stem cells.


Assuntos
Drosophila/genética , Ovário/química , Poli A/análise , RNA Mensageiro/química , Animais , Feminino , Regulação da Expressão Gênica , Ovário/citologia , Poliadenilação , Nicho de Células-Tronco , Células-Tronco/química , Células-Tronco/citologia
14.
BMC Genomics ; 16: 511, 2015 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-26155789

RESUMO

BACKGROUND: Messenger RNA polyadenylation is an essential step for the maturation of most eukaryotic mRNAs. Accurate determination of poly(A) sites helps define the 3'-ends of genes, which is important for genome annotation and gene function research. Genomic studies have revealed the presence of poly(A) sites in intergenic regions, which may be attributed to 3'-UTR extensions and novel transcript units. However, there is no systematically evaluation of intergenic poly(A) sites in plants. RESULTS: Approximately 16,000 intergenic poly(A) site clusters (IPAC) in Arabidopsis thaliana were discovered and evaluated at the whole genome level. Based on the distributions of distance from IPACs to nearby sense and antisense genes, these IPACs were classified into three categories. About 70 % of them were from previously unannotated 3'-UTR extensions to known genes, which would extend 6985 transcripts of TAIR10 genome annotation beyond their 3'-ends, with a mean extension of 134 nucleotides. 1317 IPACs were originated from novel intergenic transcripts, 37 of which were likely to be associated with protein coding transcripts. 2957 IPACs corresponded to antisense transcripts for genes on the reverse strand, which might affect 2265 protein coding genes and 39 non-protein-coding genes, including long non-coding RNA genes. The rest of IPACs could be originated from transcriptional read-through or gene mis-annotations. CONCLUSIONS: The identified IPACs corresponding to novel transcripts, 3'-UTR extensions, and antisense transcription should be incorporated into current Arabidopsis genome annotation. Comprehensive characterization of IPACs from this study provides insights of alternative polyadenylation and antisense transcription in plants.


Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Poli A/análise , Regiões 3' não Traduzidas , Regulação da Expressão Gênica de Plantas , Genoma de Planta , RNA Antissenso/genética , RNA de Plantas/genética
15.
Bioinformatics ; 31(10): 1671-3, 2015 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-25583118

RESUMO

MOTIVATION: Polyadenylation is an essential process during eukaryotic gene expression. Prediction of poly(A) sites helps to define the 3' end of genes, which is important for gene annotation and elucidating gene regulation mechanisms. However, due to limited knowledge of poly(A) signals, it is still challenging to predict poly(A) sites in plants and algae. PASPA is a web server for P: oly( A: ) S: ite prediction in P: lants and A: lgae, which integrates many in-house tools as add-ons to facilitate poly(A) site prediction, visualization and mining. This server can predict poly(A) sites for ten species, including seven previously poly(A) signal non-characterized species, with sensitivity and specificity in a range between 0.80 and 0.95.


Assuntos
Regiões 3' não Traduzidas , Poli A/análise , Poliadenilação , RNA de Plantas/química , Software , Algoritmos , Clorófitas/genética , Diatomáceas/genética , Internet , Rodófitas/genética
16.
Comput Biol Chem ; 54: 49-56, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25594576

RESUMO

Polyadenylation is the process of addition of poly(A) tail to mRNA 3' ends. Identification of motifs controlling polyadenylation plays an essential role in improving genome annotation accuracy and better understanding of the mechanisms governing gene regulation. The bioinformatics methods used for poly(A) motifs recognition have demonstrated that information extracted from sequences surrounding the candidate motifs can differentiate true motifs from the false ones greatly. However, these methods depend on either domain features or string kernels. To date, methods combining information from different sources have not been found yet. Here, we proposed an improved poly(A) motifs recognition method by combing different sources based on decision level fusion. First of all, two novel prediction methods was proposed based on support vector machine (SVM): one method is achieved by using the domain-specific features and principle component analysis (PCA) method to eliminate the redundancy (PCA-SVM); the other method is based on Oligo string kernel (Oligo-SVM). Then we proposed a novel machine-learning method for poly(A) motif prediction by marrying four poly(A) motifs recognition methods, including two state-of-the-art methods (Random Forest (RF) and HMM-SVM), and two novel proposed methods (PCA-SVM and Oligo-SVM). A decision level information fusion method was employed to combine the decision values of different classifiers by applying the DS evidence theory. We evaluated our method on a comprehensive poly(A) dataset that consists of 14,740 samples on 12 variants of poly(A) motifs and 2750 samples containing none of these motifs. Our method has achieved accuracy up to 86.13%. Compared with the four classifiers, our evidence theory based method reduces the average error rate by about 30%, 27%, 26% and 16%, respectively. The experimental results suggest that the proposed method is more effective for poly(A) motif recognition.


Assuntos
Poli A/química , Poliadenilação , RNA Mensageiro/química , Máquina de Vetores de Suporte , Biologia Computacional/métodos , Biologia Computacional/estatística & dados numéricos , Dados de Sequência Molecular , Motivos de Nucleotídeos , Poli A/análise , Análise de Componente Principal , RNA Mensageiro/análise
17.
Nucleic Acids Res ; 43(Database issue): D59-67, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25378337

RESUMO

Increasing amounts of genes have been shown to utilize alternative polyadenylation (APA) 3'-processing sites depending on the cell and tissue type and/or physiological and pathological conditions at the time of processing, and the construction of genome-wide database regarding APA is urgently needed for better understanding poly(A) site selection and APA-directed gene expression regulation for a given biology. Here we present a web-accessible database, named APASdb (http://mosas.sysu.edu.cn/utr), which can visualize the precise map and usage quantification of different APA isoforms for all genes. The datasets are deeply profiled by the sequencing alternative polyadenylation sites (SAPAS) method capable of high-throughput sequencing 3'-ends of polyadenylated transcripts. Thus, APASdb details all the heterogeneous cleavage sites downstream of poly(A) signals, and maintains near complete coverage for APA sites, much better than the previous databases using conventional methods. Furthermore, APASdb provides the quantification of a given APA variant among transcripts with different APA sites by computing their corresponding normalized-reads, making our database more useful. In addition, APASdb supports URL-based retrieval, browsing and display of exon-intron structure, poly(A) signals, poly(A) sites location and usage reads, and 3'-untranslated regions (3'-UTRs). Currently, APASdb involves APA in various biological processes and diseases in human, mouse and zebrafish.


Assuntos
Bases de Dados de Ácidos Nucleicos , Poliadenilação , Animais , Expressão Gênica , Humanos , Internet , Camundongos , Poli A/análise , Clivagem do RNA , Peixe-Zebra/genética
18.
PLoS One ; 9(11): e111780, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25369024

RESUMO

The localisation of poly(A) RNA in plant cells containing either reticular (Allium cepa) or chromocentric (Lupinus luteus, Arabidopsis thaliana) nuclei was studied through in situ hybridisation. In both types of nuclei, the amount of poly(A) RNA was much greater in the nucleus than in the cytoplasm. In the nuclei, poly(A) RNA was present in structures resembling nuclear bodies. The molecular composition as well as the characteristic ultrastructure of the bodies containing poly(A) RNA demonstrated that they were Cajal bodies. We showed that some poly(A) RNAs in Cajal bodies code for proteins. However, examination of the localisation of active RNA polymerase II and in situ run-on transcription assays both demonstrated that CBs are not sites of transcription and that BrU-containing RNA accumulates in these structures long after synthesis. In addition, it was demonstrated that accumulation of poly(A) RNA occurs in the nuclei and CBs of hypoxia-treated cells. Our findings indicated that CBs may be involved in the later stages of poly(A) RNA metabolism, playing a role storage or retention.


Assuntos
Arabidopsis/citologia , Corpos Enovelados/ultraestrutura , Lupinus/citologia , Cebolas/citologia , Poli A/análise , RNA de Plantas/análise , Corpos Enovelados/química , RNA Mensageiro/análise , Ribonucleoproteínas Nucleares Pequenas/análise
19.
Nature ; 508(7494): 66-71, 2014 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-24476825

RESUMO

Poly(A) tails enhance the stability and translation of most eukaryotic messenger RNAs, but difficulties in globally measuring poly(A)-tail lengths have impeded greater understanding of poly(A)-tail function. Here we describe poly(A)-tail length profiling by sequencing (PAL-seq) and apply it to measure tail lengths of millions of individual RNAs isolated from yeasts, cell lines, Arabidopsis thaliana leaves, mouse liver, and zebrafish and frog embryos. Poly(A)-tail lengths were conserved between orthologous mRNAs, with mRNAs encoding ribosomal proteins and other 'housekeeping' proteins tending to have shorter tails. As expected, tail lengths were coupled to translational efficiencies in early zebrafish and frog embryos. However, this strong coupling diminished at gastrulation and was absent in non-embryonic samples, indicating a rapid developmental switch in the nature of translational control. This switch complements an earlier switch to zygotic transcriptional control and explains why the predominant effect of microRNA-mediated deadenylation concurrently shifts from translational repression to mRNA destabilization.


Assuntos
Regulação da Expressão Gênica no Desenvolvimento/genética , Poli A/análise , Biossíntese de Proteínas/genética , RNA Mensageiro/genética , Animais , Arabidopsis/genética , Sequência de Bases , Linhagem Celular , Drosophila melanogaster/embriologia , Drosophila melanogaster/genética , Gastrulação/genética , Humanos , Fígado/metabolismo , Camundongos , MicroRNAs/genética , MicroRNAs/metabolismo , Modelos Genéticos , Folhas de Planta/genética , Poli A/genética , Estabilidade de RNA/genética , RNA Mensageiro/metabolismo , Ribossomos/metabolismo , Análise de Sequência de RNA , Especificidade da Espécie , Transcrição Gênica , Xenopus/embriologia , Xenopus/genética , Leveduras/genética , Peixe-Zebra/embriologia , Peixe-Zebra/genética , Zigoto/metabolismo
20.
Artigo em Inglês | MEDLINE | ID: mdl-25571055

RESUMO

Polyadenylation including the cleavage of pre-mRNA and addition of a stretch of adenosines to the 3'-end is an essential step of pre-mRNA processing in eukayotes. The known regulatory role of polyadenylation in mRNA localization, stability, and translation and the emerging link between poly(A) and disease states underline the necessary to fully characterize polyadenylation sites. Several artificial intelligence methods have been proposed for poly(A) sites recognition. However, these methods are suitable to small subsets of genome sequences. It is necessary to propose a method for genome-wide recognition of poly(A) sites. Recent efforts have found a lot of poly(A) related factors on DNA level. Here, we proposed a novel genome-wide poly(A) recognition method based on the Condition Random Field (CRF) by integrating multiple features. Compared with the polya_svm (the most accurate program for prediction of poly(A) sites till date), our method had a higher performance with the area under ROC curve(0.8621 versus 0.6796). The result suggests that our method is an effective method in genome wide poly(A) sites recognition.


Assuntos
Genoma Humano , Poli A/análise , Regiões 3' não Traduzidas , Algoritmos , Área Sob a Curva , Inteligência Artificial , Sequência de Bases , Ilhas de CpG , Histonas/genética , Humanos , MicroRNAs/genética , MicroRNAs/metabolismo , Curva ROC , Máquina de Vetores de Suporte
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...