Búsqueda | Portal Regional de la BVS

Multivariate segmentation in the analysis of transcription tiling array data.

Piccolboni, Antonio.

J Comput Biol ; 15(7): 845-56, 2008 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-18707537

RESUMEN

Tiling DNA microarrays extend current microarray technology by probing the non-repeat portion of a genome at regular intervals in an unbiased fashion. A fundamental problem in the analysis of these data is the detection of genomic regions that are differentially transcribed across multiple conditions. We propose a linear time algorithm based on segmentation techniques and linear modeling that can work at a user-selected false discovery rate (FDR). It also attains a fourfold sensitivity gain over the only competing algorithm when applied to a whole genome transcription data set spanning the embryonic development of Drosophila melanogaster.

Asunto(s)

Algoritmos , Drosophila melanogaster/genética , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Transcripción Genética , Animales , Bases de Datos Genéticas , Drosophila melanogaster/embriología , Genoma , Modelos Genéticos , Análisis Multivariante , Análisis de Secuencia de ADN/métodos

RNA maps reveal new RNA classes and a possible function for pervasive transcription.

Kapranov, Philipp; Cheng, Jill; Dike, Sujit; Nix, David A; Duttagupta, Radharani; Willingham, Aarron T; Stadler, Peter F; Hertel, Jana; Hackermüller, Jörg; Hofacker, Ivo L; Bell, Ian; Cheung, Evelyn; Drenkow, Jorg; Dumais, Erica; Patel, Sandeep; Helt, Gregg; Ganesh, Madhavan; Ghosh, Srinka; Piccolboni, Antonio; Sementchenko, Victor; Tammana, Hari; Gingeras, Thomas R.

Science ; 316(5830): 1484-8, 2007 Jun 08.

Artículo en Inglés | MEDLINE | ID: mdl-17510325

RESUMEN

Significant fractions of eukaryotic genomes give rise to RNA, much of which is unannotated and has reduced protein-coding potential. The genomic origins and the associations of human nuclear and cytosolic polyadenylated RNAs longer than 200 nucleotides (nt) and whole-cell RNAs less than 200 nt were investigated in this genome-wide study. Subcellular addresses for nucleotides present in detected RNAs were assigned, and their potential processing into short RNAs was investigated. Taken together, these observations suggest a novel role for some unannotated RNAs as primary transcripts for the production of short RNAs. Three potentially functional classes of RNAs have been identified, two of which are syntenically conserved and correlate with the expression state of protein-coding genes. These data support a highly interleaved organization of the human transcriptome.

Asunto(s)

Genoma Humano , Precursores del ARN/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN/genética , Transcripción Genética , Animales , Línea Celular Tumoral , Núcleo Celular/metabolismo , Citosol/metabolismo , Exones , Expresión Génica , Genoma , Células HeLa , Humanos , Ratones , Regiones Promotoras Genéticas , ARN/metabolismo , Precursores del ARN/metabolismo , Sintenía , Regiones Terminadoras Genéticas

Biological function of unannotated transcription during the early development of Drosophila melanogaster.

Manak, J Robert; Dike, Sujit; Sementchenko, Victor; Kapranov, Philipp; Biemar, Frederic; Long, Jeff; Cheng, Jill; Bell, Ian; Ghosh, Srinka; Piccolboni, Antonio; Gingeras, Thomas R.

Nat Genet ; 38(10): 1151-8, 2006 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-16951679

RESUMEN

Many animal and plant genomes are transcribed much more extensively than current annotations predict. However, the biological function of these unannotated transcribed regions is largely unknown. Approximately 7% and 23% of the detected transcribed nucleotides during D. melanogaster embryogenesis map to unannotated intergenic and intronic regions, respectively. Based on computational analysis of coordinated transcription, we conservatively estimate that 29% of all unannotated transcribed sequences function as missed or alternative exons of well-characterized protein-coding genes. We estimate that 15.6% of intergenic transcribed regions function as missed or alternative transcription start sites (TSS) used by 11.4% of the expressed protein-coding genes. Identification of P element mutations within or near newly identified 5' exons provides a strategy for mapping previously uncharacterized mutations to their respective genes. Collectively, these data indicate that at least 85% of the fly genome is transcribed and processed into mature transcripts representing at least 30% of the fly genome.

Asunto(s)

Drosophila melanogaster/embriología , Drosophila melanogaster/genética , Regulación del Desarrollo de la Expresión Génica , Transcripción Genética , Secuencia de Aminoácidos , Animales , ADN Intergénico , Proteínas de Drosophila/genética , Embrión no Mamífero , Exones , Genoma de los Insectos , Datos de Secuencia Molecular , Mutación , Análisis de Secuencia por Matrices de Oligonucleótidos , Sitio de Iniciación de la Transcripción

Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution.

Cheng, Jill; Kapranov, Philipp; Drenkow, Jorg; Dike, Sujit; Brubaker, Shane; Patel, Sandeep; Long, Jeffrey; Stern, David; Tammana, Hari; Helt, Gregg; Sementchenko, Victor; Piccolboni, Antonio; Bekiranov, Stefan; Bailey, Dione K; Ganesh, Madhavan; Ghosh, Srinka; Bell, Ian; Gerhard, Daniela S; Gingeras, Thomas R.

Science ; 308(5725): 1149-54, 2005 May 20.

Artículo en Inglés | MEDLINE | ID: mdl-15790807

RESUMEN

Sites of transcription of polyadenylated and nonpolyadenylated RNAs for 10 human chromosomes were mapped at 5-base pair resolution in eight cell lines. Unannotated, nonpolyadenylated transcripts comprise the major proportion of the transcriptional output of the human genome. Of all transcribed sequences, 19.4, 43.7, and 36.9% were observed to be polyadenylated, nonpolyadenylated, and bimorphic, respectively. Half of all transcribed sequences are found only in the nucleus and for the most part are unannotated. Overall, the transcribed portions of the human genome are predominantly composed of interlaced networks of both poly A+ and poly A- annotated transcripts and unannotated transcripts of unknown function. This organization has important implications for interpreting genotype-phenotype associations, regulation of gene expression, and the definition of a gene.

Asunto(s)

Cromosomas Humanos/genética , Genoma Humano , ARN Mensajero/análisis , Transcripción Genética , Línea Celular , Línea Celular Tumoral , Núcleo Celular/metabolismo , Cromosomas Humanos Par 13/genética , Cromosomas Humanos Par 14/genética , Cromosomas Humanos Par 19/genética , Cromosomas Humanos Par 20/genética , Cromosomas Humanos Par 21/genética , Cromosomas Humanos Par 22/genética , Cromosomas Humanos Par 6/genética , Cromosomas Humanos Par 7/genética , Cromosomas Humanos X/genética , Cromosomas Humanos Y/genética , Biología Computacional , Citosol/metabolismo , ADN Complementario , ADN Intergénico , Exones , Femenino , Humanos , Intrones , Masculino , Datos de Secuencia Molecular , Técnicas de Amplificación de Ácido Nucleico , Análisis de Secuencia por Matrices de Oligonucleótidos , Mapeo Físico de Cromosoma , Empalme del ARN

Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22.

Kampa, Dione; Cheng, Jill; Kapranov, Philipp; Yamanaka, Mark; Brubaker, Shane; Cawley, Simon; Drenkow, Jorg; Piccolboni, Antonio; Bekiranov, Stefan; Helt, Gregg; Tammana, Hari; Gingeras, Thomas R.

Genome Res ; 14(3): 331-42, 2004 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-14993201

RESUMEN

In this report, we have achieved a richer view of the transcriptome for Chromosomes 21 and 22 by using high-density oligonucleotide arrays on cytosolic poly(A)(+) RNA. Conservatively, only 31.4% of the observed transcribed nucleotides correspond to well-annotated genes, whereas an additional 4.8% and 14.7% correspond to mRNAs and ESTs, respectively. Approximately 85% of the known exons were detected, and up to 21% of known genes have only a single isoform based on exon-skipping alternative expression. Overall, the expression of the well-characterized exons falls predominately into two categories, uniquely or ubiquitously expressed with an identifiable proportion of antisense transcripts. The remaining observed transcription (49.0%) was outside of any known annotation. These novel transcripts appear to be more cell-line-specific and have lower and less variation in expression than the well-characterized genes. Novel transcripts were further characterized based on their distance to annotations, transcript size, coding capacity, and identification as antisense to intronic sequences. By RT-PCR, 126 novel transcripts were independently verified, resulting in a 65% verification rate. These observations strongly support the argument for a re-evaluation of the total number of human genes and an alternative term for "gene" to encompass these growing, novel classes of RNA transcripts in the human genome.

Asunto(s)

Cromosomas Humanos Par 21/genética , Cromosomas Humanos Par 22/genética , ARN/genética , Transcripción Genética/genética , Línea Celular , Línea Celular Tumoral , Mapeo Cromosómico/métodos , ADN de Neoplasias/genética , Perfilación de la Expresión Génica/métodos , Genes/genética , Genes Relacionados con las Neoplasias/genética , Humanos , Células Jurkat/química , Células Jurkat/metabolismo , Datos de Secuencia Molecular , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Sondas de Oligonucleótidos/genética , ARN Mensajero/genética

Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs.

Cawley, Simon; Bekiranov, Stefan; Ng, Huck H; Kapranov, Philipp; Sekinger, Edward A; Kampa, Dione; Piccolboni, Antonio; Sementchenko, Victor; Cheng, Jill; Williams, Alan J; Wheeler, Raymond; Wong, Brant; Drenkow, Jorg; Yamanaka, Mark; Patel, Sandeep; Brubaker, Shane; Tammana, Hari; Helt, Gregg; Struhl, Kevin; Gingeras, Thomas R.

Cell ; 116(4): 499-509, 2004 Feb 20.

Artículo en Inglés | MEDLINE | ID: mdl-14980218

RESUMEN

Using high-density oligonucleotide arrays representing essentially all nonrepetitive sequences on human chromosomes 21 and 22, we map the binding sites in vivo for three DNA binding transcription factors, Sp1, cMyc, and p53, in an unbiased manner. This mapping reveals an unexpectedly large number of transcription factor binding site (TFBS) regions, with a minimal estimate of 12,000 for Sp1, 25,000 for cMyc, and 1600 for p53 when extrapolated to the full genome. Only 22% of these TFBS regions are located at the 5' termini of protein-coding genes while 36% lie within or immediately 3' to well-characterized genes and are significantly correlated with noncoding RNAs. A significant number of these noncoding RNAs are regulated in response to retinoic acid, and overlapping pairs of protein-coding and noncoding RNAs are often coregulated. Thus, the human genome contains roughly comparable numbers of protein-coding and noncoding genes that are bound by common transcription factors and regulated by common environmental signals.

Asunto(s)

Cromosomas Humanos Par 21 , Cromosomas Humanos Par 22 , Factores de Transcripción/metabolismo , Secuencias de Aminoácidos , Sitios de Unión , Línea Celular , Cromatina/metabolismo , Mapeo Cromosómico , Islas de CpG , Exones , Etiquetas de Secuencia Expresada , Genoma Humano , Humanos , Células Jurkat , Modelos Genéticos , Reacción en Cadena de la Polimerasa , Pruebas de Precipitina , Regiones Promotoras Genéticas , Unión Proteica , ARN/química , ARN/metabolismo , ARN Mensajero/metabolismo , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Tretinoina/metabolismo

On the complexity of fundamental computational problems in pedigree analysis.

Piccolboni, Antonio; Gusfield, Dan.

J Comput Biol ; 10(5): 763-73, 2003.

Artículo en Inglés | MEDLINE | ID: mdl-14633398

RESUMEN

Pedigree analysis is a central component of many current efforts to locate genes that contribute to diseases or to valuable traits. The analysis usually involves solving one of two very computation-intense problems. We analyze the complexity of these two problems. Surprisingly, we show that both problems are NP-hard even for pedigrees that contain no inbreeding loops.

Asunto(s)

Linaje , Biología Computacional/métodos , Modelos Genéticos , Probabilidad

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA