Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
Bioinformatics ; 34(11): 1939-1940, 2018 06 01.
Article in English | MEDLINE | ID: mdl-29390042

ABSTRACT

Summary: JABAWS 2.2 is a computational framework that simplifies the deployment of web services for Bioinformatics. In addition to the five multiple sequence alignment (MSA) algorithms in JABAWS 1.0, JABAWS 2.2 includes three additional MSA programs (Clustal Omega, MSAprobs, GLprobs), four protein disorder prediction methods (DisEMBL, IUPred, Ronn, GlobPlot), 18 measures of protein conservation as implemented in AACon, and RNA secondary structure prediction by the RNAalifold program. JABAWS 2.2 can be deployed on a variety of in-house or hosted systems. JABAWS 2.2 web services may be accessed from the Jalview multiple sequence analysis workbench (Version 2.8 and later), as well as directly via the JABAWS command line interface (CLI) client. JABAWS 2.2 can be deployed on a local virtual server as a Virtual Appliance (VA) or simply as a Web Application Archive (WAR) for private use. Improvements in JABAWS 2.2 also include simplified installation and a range of utility tools for usage statistics collection, and web services querying and monitoring. The JABAWS CLI client has been updated to support all the new services and allow integration of JABAWS 2.2 services into conventional scripts. A public JABAWS 2 server has been in production since December 2011 and served over 800 000 analyses for users worldwide. Availability and implementation: JABAWS 2.2 is made freely available under the Apache 2 license and can be obtained from: http://www.compbio.dundee.ac.uk/jabaws. Contact: g.j.barton@dundee.ac.uk.


Subject(s)
Computational Biology/methods , Nucleic Acid Conformation , RNA/metabolism , Software , Algorithms , Internet , Models, Molecular , Proteostasis Deficiencies , RNA/chemistry , Sequence Alignment , Sequence Analysis, Protein/methods , Sequence Analysis, RNA/methods
3.
RNA ; 22(6): 839-51, 2016 06.
Article in English | MEDLINE | ID: mdl-27022035

ABSTRACT

RNA-seq is now the technology of choice for genome-wide differential gene expression experiments, but it is not clear how many biological replicates are needed to ensure valid biological interpretation of the results or which statistical tools are best for analyzing the data. An RNA-seq experiment with 48 biological replicates in each of two conditions was performed to answer these questions and provide guidelines for experimental design. With three biological replicates, nine of the 11 tools evaluated found only 20%-40% of the significantly differentially expressed (SDE) genes identified with the full set of 42 clean replicates. This rises to >85% for the subset of SDE genes changing in expression by more than fourfold. To achieve >85% for all SDE genes regardless of fold change requires more than 20 biological replicates. The same nine tools successfully control their false discovery rate at ≲5% for all numbers of replicates, while the remaining two tools fail to control their FDR adequately, particularly for low numbers of replicates. For future RNA-seq experiments, these results suggest that at least six biological replicates should be used, rising to at least 12 when it is important to identify SDE genes for all fold changes. If fewer than 12 replicates are used, a superior combination of true positive and false positive performances makes edgeR and DESeq2 the leading tools. For higher replicate numbers, minimizing false positives is more important and DESeq marginally outperforms the other tools.


Subject(s)
Sequence Analysis, RNA/methods , Gene Expression Profiling , RNA, Fungal/genetics , Reproducibility of Results , Saccharomyces cerevisiae/genetics
4.
Bioinformatics ; 31(22): 3625-30, 2015 Nov 15.
Article in English | MEDLINE | ID: mdl-26206307

ABSTRACT

MOTIVATION: High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read-count variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of these models has usually been tested on either low-replicate RNA-seq data or simulations. RESULTS: A 48-replicate RNA-seq experiment in yeast was performed and data tested against theoretical models. The observed gene read counts were consistent with both log-normal and negative binomial distributions, while the mean-variance relation followed the line of constant dispersion parameter of ∼0.01. The high-replicate data also allowed for strict quality control and screening of 'bad' replicates, which can drastically affect the gene read-count distribution. AVAILABILITY AND IMPLEMENTATION: RNA-seq data have been submitted to ENA archive with project ID PRJEB5348. CONTACT: g.j.barton@dundee.ac.uk.


Subject(s)
Models, Statistical , Sequence Analysis, RNA/methods , Base Sequence , Binomial Distribution , Gene Expression Profiling , Reproducibility of Results , Saccharomyces cerevisiae/genetics
5.
Sci Rep ; 5: 8771, 2015 Mar 05.
Article in English | MEDLINE | ID: mdl-25740547

ABSTRACT

Using whole-cell phenotypic assays, the GlaxoSmithKline high-throughput screening (HTS) diversity set of 1.8 million compounds was screened against the three kinetoplastids most relevant to human disease, i.e. Leishmania donovani, Trypanosoma cruzi and Trypanosoma brucei. Secondary confirmatory and orthogonal intracellular anti-parasiticidal assays were conducted, and the potential for non-specific cytotoxicity determined. Hit compounds were chemically clustered and triaged for desirable physicochemical properties. The hypothetical biological target space covered by these diversity sets was investigated through bioinformatics methodologies. Consequently, three anti-kinetoplastid chemical boxes of ~200 compounds each were assembled. Functional analyses of these compounds suggest a wide array of potential modes of action against kinetoplastid kinases, proteases and cytochromes as well as potential host-pathogen targets. This is the first published parallel high throughput screening of a pharma compound collection against kinetoplastids. The compound sets are provided as an open resource for future lead discovery programs, and to address important research questions.


Subject(s)
Drug Evaluation, Preclinical/methods , High-Throughput Screening Assays , Kinetoplastida/drug effects , Parasitic Sensitivity Tests/methods , Small Molecule Libraries , Animals , Antiprotozoal Agents/pharmacology , Cell Line , Genome, Protozoan , Humans , Kinetoplastida/classification , Kinetoplastida/genetics , Mice , Phylogeny
6.
J Allergy Clin Immunol ; 134(1): 82-91, 2014 Jul.
Article in English | MEDLINE | ID: mdl-24880632

ABSTRACT

BACKGROUND: Atopic dermatitis (AD; eczema) is characterized by a widespread abnormality in cutaneous barrier function and propensity to inflammation. Filaggrin is a multifunctional protein and plays a key role in skin barrier formation. Loss-of-function mutations in the gene encoding filaggrin (FLG) are a highly significant risk factor for atopic disease, but the molecular mechanisms leading to dermatitis remain unclear. OBJECTIVE: We sought to interrogate tissue-specific variations in the expressed genome in the skin of children with AD and to investigate underlying pathomechanisms in atopic skin. METHODS: We applied single-molecule direct RNA sequencing to analyze the whole transcriptome using minimal tissue samples. Uninvolved skin biopsy specimens from 26 pediatric patients with AD were compared with site-matched samples from 10 nonatopic teenage control subjects. Cases and control subjects were screened for FLG genotype to stratify the data set. RESULTS: Two thousand four hundred thirty differentially expressed genes (false discovery rate, P < .05) were identified, of which 211 were significantly upregulated and 490 downregulated by greater than 2-fold. Gene ontology terms for "extracellular space" and "defense response" were enriched, whereas "lipid metabolic processes" were downregulated. The subset of FLG wild-type cases showed dysregulation of genes involved with lipid metabolism, whereas filaggrin haploinsufficiency affected global gene expression and was characterized by a type 1 interferon-mediated stress response. CONCLUSION: These analyses demonstrate the importance of extracellular space and lipid metabolism in atopic skin pathology independent of FLG genotype, whereas an aberrant defense response is seen in subjects with FLG mutations. Genotype stratification of the large data set has facilitated functional interpretation and might guide future therapy development.


Subject(s)
Dermatitis, Atopic/genetics , Intermediate Filament Proteins/genetics , Skin/metabolism , Transcription, Genetic/immunology , Adolescent , Case-Control Studies , Child , Dermatitis, Atopic/immunology , Dermatitis, Atopic/metabolism , Dermatitis, Atopic/pathology , Extracellular Space/immunology , Female , Filaggrin Proteins , Gene Expression Regulation , High-Throughput Nucleotide Sequencing , Humans , Intermediate Filament Proteins/immunology , Lipid Metabolism/immunology , Male , Skin/immunology , Skin/pathology , Young Adult
7.
PLoS One ; 9(4): e94270, 2014.
Article in English | MEDLINE | ID: mdl-24722185

ABSTRACT

The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3' untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3' polyadenylation sites to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3' UTR re-annotation (including extension of one 3' UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data.


Subject(s)
3' Untranslated Regions , Computational Biology/methods , Expressed Sequence Tags , Sequence Analysis, RNA/methods , Animals , Arabidopsis/genetics , Cell Cycle Proteins/genetics , Chickens/genetics , Genetic Loci/genetics , Genome , Genome, Human , Genome, Plant , Genomics , Humans , Methyltransferases/genetics , Models, Statistical , RNA, Messenger/metabolism , Ribosomal Proteins/genetics , Skin/metabolism
8.
PLoS Genet ; 9(10): e1003867, 2013 Oct.
Article in English | MEDLINE | ID: mdl-24204292

ABSTRACT

Alternative cleavage and polyadenylation influence the coding and regulatory potential of mRNAs and where transcription termination occurs. Although widespread, few regulators of this process are known. The Arabidopsis thaliana protein FPA is a rare example of a trans-acting regulator of poly(A) site choice. Analysing fpa mutants therefore provides an opportunity to reveal generic consequences of disrupting this process. We used direct RNA sequencing to quantify shifts in RNA 3' formation in fpa mutants. Here we show that specific chimeric RNAs formed between the exons of otherwise separate genes are a striking consequence of loss of FPA function. We define intergenic read-through transcripts resulting from defective RNA 3' end formation in fpa mutants and detail cryptic splicing and antisense transcription associated with these read-through RNAs. We identify alternative polyadenylation within introns that is sensitive to FPA and show FPA-dependent shifts in IBM1 poly(A) site selection that differ from those recently defined in mutants defective in intragenic heterochromatin and DNA methylation. Finally, we show that defective termination at specific loci in fpa mutants is shared with dicer-like 1 (dcl1) or dcl4 mutants, leading us to develop alternative explanations for some silencing roles of these proteins. We relate our findings to the impact that altered patterns of 3' end formation can have on gene and genome organisation.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , RNA, Messenger/biosynthesis , RNA-Binding Proteins/genetics , Transcription Termination, Genetic , Alternative Splicing/genetics , Arabidopsis Proteins/biosynthesis , Arabidopsis Proteins/metabolism , Cell Cycle Proteins/genetics , Cell Cycle Proteins/metabolism , DNA Methylation/genetics , Exons , Gene Silencing , Heterochromatin/genetics , Jumonji Domain-Containing Histone Demethylases/genetics , Jumonji Domain-Containing Histone Demethylases/metabolism , Mutation , RNA, Messenger/genetics , RNA-Binding Proteins/biosynthesis , RNA-Binding Proteins/metabolism , Ribonuclease III/genetics , Ribonuclease III/metabolism
9.
Sci Rep ; 3: 2866, 2013 Oct 09.
Article in English | MEDLINE | ID: mdl-24104185

ABSTRACT

RNA-binding proteins (RBPs) play an important role in plant host-microbe interactions. In this study, we show that the plant RBP known as FPA, which regulates 3'-end mRNA polyadenylation, negatively regulates basal resistance to bacterial pathogen Pseudomonas syringae in Arabidopsis. A custom microarray analysis reveals that flg22, a peptide derived from bacterial flagellins, induces expression of alternatively polyadenylated isoforms of mRNA encoding the defence-related transcriptional repressor ETHYLENE RESPONSE FACTOR 4 (ERF4), which is regulated by FPA. Flg22 induces expression of a novel isoform of ERF4 that lacks the ERF-associated amphiphilic repression (EAR) motif, while FPA inhibits this induction. The EAR-lacking isoform of ERF4 acts as a transcriptional activator in vivo and suppresses the flg22-dependent reactive oxygen species burst. We propose that FPA controls use of proximal polyadenylation sites of ERF4, which quantitatively limit the defence response output.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Flagellin/pharmacology , Gene Expression Regulation, Plant , Plant Diseases/microbiology , Polyadenylation/genetics , RNA Processing, Post-Transcriptional , RNA-Binding Proteins/metabolism , Repressor Proteins/genetics , Arabidopsis/drug effects , Arabidopsis/genetics , Arabidopsis/growth & development , Immunity, Innate/genetics , Peptide Fragments/pharmacology , Plant Diseases/genetics , Plants, Genetically Modified/drug effects , Plants, Genetically Modified/genetics , Plants, Genetically Modified/growth & development , Pseudomonas syringae/genetics , Pseudomonas syringae/growth & development , Pseudomonas syringae/pathogenicity , RNA, Messenger/genetics , RNA, Plant/genetics , RNA-Binding Proteins/genetics , Reactive Oxygen Species/metabolism , Real-Time Polymerase Chain Reaction , Reverse Transcriptase Polymerase Chain Reaction
10.
Nat Struct Mol Biol ; 19(8): 845-52, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22820990

ABSTRACT

It has recently been shown that RNA 3'-end formation plays a more widespread role in controlling gene expression than previously thought. To examine the impact of regulated 3'-end formation genome-wide, we applied direct RNA sequencing to A. thaliana. Here we show the authentic transcriptome in unprecedented detail and describe the effects of 3'-end formation on genome organization. We reveal extreme heterogeneity in RNA 3' ends, discover previously unrecognized noncoding RNAs and propose widespread reannotation of the genome. We explain the origin of most poly(A)(+) antisense RNAs and identify cis elements that control 3'-end formation in different registers. These findings are essential to understanding what the genome actually encodes, how it is organized and how regulated 3'-end formation affects these processes.


Subject(s)
Arabidopsis/genetics , Arabidopsis/metabolism , RNA, Plant/genetics , RNA, Plant/metabolism , Base Sequence , Chromosome Mapping , Gene Expression Regulation, Plant , Genes, Plant , Molecular Sequence Data , Nucleic Acid Conformation , Oligodeoxyribonucleotides/metabolism , Polyadenylation , RNA 3' End Processing , RNA, Antisense/genetics , RNA, Antisense/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA, Plant/chemistry , RNA, Small Nucleolar/genetics , RNA, Small Nucleolar/metabolism , RNA, Untranslated/chemistry , RNA, Untranslated/genetics , RNA, Untranslated/metabolism , Sequence Analysis, RNA
SELECTION OF CITATIONS
SEARCH DETAIL
...