Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nat Biotechnol ; 35(4): 350-353, 2017 04.
Artigo em Inglês | MEDLINE | ID: mdl-28263295

RESUMO

We present SplashRNA, a sequential classifier to predict potent microRNA-based short hairpin RNAs (shRNAs). Trained on published and novel data sets, SplashRNA outperforms previous algorithms and reliably predicts the most efficient shRNAs for a given gene. Combined with an optimized miR-E backbone, >90% of high-scoring SplashRNA predictions trigger >85% protein knockdown when expressed from a single genomic integration. SplashRNA can significantly improve the accuracy of loss-of-function genetics studies and facilitates the generation of compact shRNA libraries.


Assuntos
Algoritmos , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Inativação Gênica , Aprendizado de Máquina , RNA Interferente Pequeno/genética , Software , Sistemas CRISPR-Cas/genética , Mapeamento Cromossômico/métodos , Análise de Sequência de RNA/métodos
2.
Bioinformatics ; 33(1): 139-141, 2017 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-27634950

RESUMO

MOTIVATION: Deep sequencing based ribosome footprint profiling can provide novel insights into the regulatory mechanisms of protein translation. However, the observed ribosome profile is fundamentally confounded by transcriptional activity. In order to decipher principles of translation regulation, tools that can reliably detect changes in translation efficiency in case-control studies are needed. RESULTS: We present a statistical framework and an analysis tool, RiboDiff, to detect genes with changes in translation efficiency across experimental treatments. RiboDiff uses generalized linear models to estimate the over-dispersion of RNA-Seq and ribosome profiling measurements separately, and performs a statistical test for differential translation efficiency using both mRNA abundance and ribosome occupancy. AVAILABILITY AND IMPLEMENTATION: RiboDiff webpage http://bioweb.me/ribodiff Source code including scripts for preprocessing the FASTQ data are available at http://github.com/ratschlab/ribodiff CONTACTS: zhongy@cbio.mskcc.org or raetsch@inf.ethz.chSupplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Biossíntese de Proteínas , RNA Mensageiro/metabolismo , Ribossomos/metabolismo , Análise de Sequência de RNA/métodos , Software , Regulação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos
3.
Bioinformatics ; 30(9): 1300-1, 2014 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-24413671

RESUMO

We present Oqtans, an open-source workbench for quantitative transcriptome analysis, that is integrated in Galaxy. Its distinguishing features include customizable computational workflows and a modular pipeline architecture that facilitates comparative assessment of tool and data quality. Oqtans integrates an assortment of machine learning-powered tools into Galaxy, which show superior or equal performance to state-of-the-art tools. Implemented tools comprise a complete transcriptome analysis workflow: short-read alignment, transcript identification/quantification and differential expression analysis. Oqtans and Galaxy facilitate persistent storage, data exchange and documentation of intermediate results and analysis workflows. We illustrate how Oqtans aids the interpretation of data from different experiments in easy to understand use cases. Users can easily create their own workflows and extend Oqtans by integrating specific tools. Oqtans is available as (i) a cloud machine image with a demo instance at cloud.oqtans.org, (ii) a public Galaxy instance at galaxy.cbio.mskcc.org, (iii) a git repository containing all installed software (oqtans.org/git); most of which is also available from (iv) the Galaxy Toolshed and (v) a share string to use along with Galaxy CloudMan.


Assuntos
RNA/genética , Análise de Sequência de RNA/métodos , Transcriptoma , Sequência de Bases , Internet , Software
4.
Bioinformatics ; 29(20): 2529-38, 2013 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-23980025

RESUMO

MOTIVATION: High-throughput sequencing of mRNA (RNA-Seq) has led to tremendous improvements in the detection of expressed genes and reconstruction of RNA transcripts. However, the extensive dynamic range of gene expression, technical limitations and biases, as well as the observed complexity of the transcriptional landscape, pose profound computational challenges for transcriptome reconstruction. RESULTS: We present the novel framework MITIE (Mixed Integer Transcript IdEntification) for simultaneous transcript reconstruction and quantification. We define a likelihood function based on the negative binomial distribution, use a regularization approach to select a few transcripts collectively explaining the observed read data and show how to find the optimal solution using Mixed Integer Programming. MITIE can (i) take advantage of known transcripts, (ii) reconstruct and quantify transcripts simultaneously in multiple samples, and (iii) resolve the location of multi-mapping reads. It is designed for genome- and assembly-based transcriptome reconstruction. We present an extensive study based on realistic simulated RNA-Seq data. When compared with state-of-the-art approaches, MITIE proves to be significantly more sensitive and overall more accurate. Moreover, MITIE yields substantial performance gains when used with multiple samples. We applied our system to 38 Drosophila melanogaster modENCODE RNA-Seq libraries and estimated the sensitivity of reconstructing omitted transcript annotations and the specificity with respect to annotated transcripts. Our results corroborate that a well-motivated objective paired with appropriate optimization techniques lead to significant improvements over the state-of-the-art in transcriptome reconstruction. AVAILABILITY: MITIE is implemented in C++ and is available from http://bioweb.me/mitie under the GPL license.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/análise , Análise de Sequência de RNA/métodos , Software , Transcrição Gênica , Animais , Drosophila melanogaster , Humanos , Internet , RNA/genética
5.
PLoS Genet ; 8(8): e1002856, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22912589

RESUMO

Cohesin is a protein complex that forms a ring around sister chromatids thus holding them together. The ring is composed of three proteins: Smc1, Smc3 and Scc1. The roles of three additional proteins that associate with the ring, Scc3, Pds5 and Wpl1, are not well understood. It has been proposed that these three factors form a complex that stabilizes the ring and prevents it from opening. This activity promotes sister chromatid cohesion but at the same time poses an obstacle for the initial entrapment of sister DNAs. This hindrance to cohesion establishment is overcome during DNA replication via acetylation of the Smc3 subunit by the Eco1 acetyltransferase. However, the full mechanistic consequences of Smc3 acetylation remain unknown. In the current work, we test the requirement of Scc3 and Pds5 for the stable association of cohesin with DNA. We investigated the consequences of Scc3 and Pds5 depletion in vivo using degron tagging in budding yeast. The previously described DHFR-based N-terminal degron as well as a novel Eco1-derived C-terminal degron were employed in our study. Scc3 and Pds5 associate with cohesin complexes independently of each other and require the Scc1 "core" subunit for their association with chromosomes. Contrary to previous data for Scc1 downregulation, depletion of either Scc3 or Pds5 had a strong effect on sister chromatid cohesion but not on cohesin binding to DNA. Quantity, stability and genome-wide distribution of cohesin complexes remained mostly unchanged after the depletion of Scc3 and Pds5. Our findings are inconsistent with a previously proposed model that Scc3 and Pds5 are cohesin maintenance factors required for cohesin ring stability or for maintaining its association with DNA. We propose that Scc3 and Pds5 specifically function during cohesion establishment in S phase.


Assuntos
Proteínas de Ciclo Celular/genética , Proteínas Cromossômicas não Histona/genética , Cromossomos Fúngicos , DNA Fúngico/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Acetiltransferases/genética , Acetiltransferases/metabolismo , Proteínas de Ciclo Celular/deficiência , Proteínas de Ciclo Celular/metabolismo , Cromátides/genética , Cromátides/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Segregação de Cromossomos/genética , DNA Fúngico/genética , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Fase S/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Coesinas
6.
Nature ; 477(7365): 419-23, 2011 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-21874022

RESUMO

Genetic differences between Arabidopsis thaliana accessions underlie the plant's extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions.


Assuntos
Arabidopsis/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas/genética , Genoma de Planta/genética , Transcrição Gênica/genética , Arabidopsis/classificação , Proteínas de Arabidopsis/genética , Sequência de Bases , Genes de Plantas/genética , Genômica , Haplótipos/genética , Mutação INDEL/genética , Anotação de Sequência Molecular , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Proteoma/genética , Plântula/genética , Análise de Sequência de DNA
7.
Genome Res ; 21(2): 325-41, 2011 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-21177967

RESUMO

The C. elegans genome has been completely sequenced, and the developmental anatomy of this model organism is described at single-cell resolution. Here we utilize strategies that exploit this precisely defined architecture to link gene expression to cell type. We obtained RNAs from specific cells and from each developmental stage using tissue-specific promoters to mark cells for isolation by FACS or for mRNA extraction by the mRNA-tagging method. We then generated gene expression profiles of more than 30 different cells and developmental stages using tiling arrays. Machine-learning-based analysis detected transcripts corresponding to established gene models and revealed novel transcriptionally active regions (TARs) in noncoding domains that comprise at least 10% of the total C. elegans genome. Our results show that about 75% of transcripts with detectable expression are differentially expressed among developmental stages and across cell types. Examination of known tissue- and cell-specific transcripts validates these data sets and suggests that newly identified TARs may exercise cell-specific functions. Additionally, we used self-organizing maps to define groups of coregulated transcripts and applied regulatory element analysis to identify known transcription factor- and miRNA-binding sites, as well as novel motifs that likely function to control subsets of these genes. By using cell-specific, whole-genome profiling strategies, we have detected a large number of novel transcripts and produced high-resolution gene expression maps that provide a basis for establishing the roles of individual genes in cellular differentiation.


Assuntos
Caenorhabditis elegans/genética , Regulação da Expressão Gênica no Desenvolvimento , Animais , Biologia Computacional , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento/genética , Masculino , Meiose/genética , Dados de Sequência Molecular , Oogênese/genética , Fases de Leitura Aberta/genética , Transcrição Gênica , Regiões não Traduzidas/genética , Inativação do Cromossomo X/genética
8.
Curr Protoc Bioinformatics ; Chapter 11: Unit 11.6, 2010 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-21154708

RESUMO

Next-generation sequencing technologies have revolutionized genome and transcriptome sequencing. RNA-Seq experiments are able to generate huge amounts of transcriptome sequence reads at a fraction of the cost of Sanger sequencing. Reads produced by these technologies are relatively short and error prone. To utilize such reads for transcriptome reconstruction and gene-structure identification, one needs to be able to accurately align the sequence reads over intron boundaries. In this unit, we describe PALMapper, a fast and easy-to-use tool that is designed to accurately compute both unspliced and spliced alignments for millions of RNA-Seq reads. It combines the efficient read mapper GenomeMapper with the spliced aligner QPALMA, which exploits read-quality information and predictions of splice sites to improve the alignment accuracy. The PALMapper package is available as a command-line tool running on Unix or Mac OS X systems or through a Web interface based on Galaxy tools.


Assuntos
Genômica/métodos , RNA/química , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Software , Sequência de Bases , Perfilação da Expressão Gênica , Genoma , Splicing de RNA
9.
Genomics ; 94(1): 48-54, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19285128

RESUMO

We have developed AspAlt-a web-based comparative analytical platform for exploring the variations in alternative transcription (AT) events and alternative splicing (AS) events in eukaryotes. AspAlt provides integrated access to 2.1 million AT-AS annotations from 1,58,876 multi-isoform genes and has the following user-friendly analytical features: (1) advanced graphical display to visualize and analyze AT-AS events in 46 eukaryotic genomes; (2) compare and identify the differences in AT-AS patterns among a group of genes specified by the user or among homologous gene groups; (3) inter-database comparative viewer to analyze the differences in the AT-AS annotations for the same gene among Ensembl, RefSeq and AceView databases; (4) dynamically classify and generate graphical plots of AT-AS events from mRNA annotations submitted by the user; and (5) download genomic AT-AS annotations of 46 eukaryotes in XML and tab-delimited formats. The AspAlt resource is available at http://66.170.16.154/AspAlt.


Assuntos
Processamento Alternativo/genética , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Software , Transcrição Gênica , Gráficos por Computador , Células Eucarióticas , Internet , RNA Mensageiro/genética , Alinhamento de Sequência
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...