Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters










Database
Language
Publication year range
1.
J Comput Biol ; 15(7): 845-56, 2008 Sep.
Article in English | MEDLINE | ID: mdl-18707537

ABSTRACT

Tiling DNA microarrays extend current microarray technology by probing the non-repeat portion of a genome at regular intervals in an unbiased fashion. A fundamental problem in the analysis of these data is the detection of genomic regions that are differentially transcribed across multiple conditions. We propose a linear time algorithm based on segmentation techniques and linear modeling that can work at a user-selected false discovery rate (FDR). It also attains a fourfold sensitivity gain over the only competing algorithm when applied to a whole genome transcription data set spanning the embryonic development of Drosophila melanogaster.


Subject(s)
Algorithms , Drosophila melanogaster/genetics , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , Transcription, Genetic , Animals , Databases, Genetic , Drosophila melanogaster/embryology , Genome , Models, Genetic , Multivariate Analysis , Sequence Analysis, DNA/methods
2.
Science ; 316(5830): 1484-8, 2007 Jun 08.
Article in English | MEDLINE | ID: mdl-17510325

ABSTRACT

Significant fractions of eukaryotic genomes give rise to RNA, much of which is unannotated and has reduced protein-coding potential. The genomic origins and the associations of human nuclear and cytosolic polyadenylated RNAs longer than 200 nucleotides (nt) and whole-cell RNAs less than 200 nt were investigated in this genome-wide study. Subcellular addresses for nucleotides present in detected RNAs were assigned, and their potential processing into short RNAs was investigated. Taken together, these observations suggest a novel role for some unannotated RNAs as primary transcripts for the production of short RNAs. Three potentially functional classes of RNAs have been identified, two of which are syntenically conserved and correlate with the expression state of protein-coding genes. These data support a highly interleaved organization of the human transcriptome.


Subject(s)
Genome, Human , RNA Precursors/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , RNA/genetics , Transcription, Genetic , Animals , Cell Line, Tumor , Cell Nucleus/metabolism , Cytosol/metabolism , Exons , Gene Expression , Genome , HeLa Cells , Humans , Mice , Promoter Regions, Genetic , RNA/metabolism , RNA Precursors/metabolism , Synteny , Terminator Regions, Genetic
3.
Nat Genet ; 38(10): 1151-8, 2006 Oct.
Article in English | MEDLINE | ID: mdl-16951679

ABSTRACT

Many animal and plant genomes are transcribed much more extensively than current annotations predict. However, the biological function of these unannotated transcribed regions is largely unknown. Approximately 7% and 23% of the detected transcribed nucleotides during D. melanogaster embryogenesis map to unannotated intergenic and intronic regions, respectively. Based on computational analysis of coordinated transcription, we conservatively estimate that 29% of all unannotated transcribed sequences function as missed or alternative exons of well-characterized protein-coding genes. We estimate that 15.6% of intergenic transcribed regions function as missed or alternative transcription start sites (TSS) used by 11.4% of the expressed protein-coding genes. Identification of P element mutations within or near newly identified 5' exons provides a strategy for mapping previously uncharacterized mutations to their respective genes. Collectively, these data indicate that at least 85% of the fly genome is transcribed and processed into mature transcripts representing at least 30% of the fly genome.


Subject(s)
Drosophila melanogaster/embryology , Drosophila melanogaster/genetics , Gene Expression Regulation, Developmental , Transcription, Genetic , Amino Acid Sequence , Animals , DNA, Intergenic , Drosophila Proteins/genetics , Embryo, Nonmammalian , Exons , Genome, Insect , Molecular Sequence Data , Mutation , Oligonucleotide Array Sequence Analysis , Transcription Initiation Site
4.
Science ; 308(5725): 1149-54, 2005 May 20.
Article in English | MEDLINE | ID: mdl-15790807

ABSTRACT

Sites of transcription of polyadenylated and nonpolyadenylated RNAs for 10 human chromosomes were mapped at 5-base pair resolution in eight cell lines. Unannotated, nonpolyadenylated transcripts comprise the major proportion of the transcriptional output of the human genome. Of all transcribed sequences, 19.4, 43.7, and 36.9% were observed to be polyadenylated, nonpolyadenylated, and bimorphic, respectively. Half of all transcribed sequences are found only in the nucleus and for the most part are unannotated. Overall, the transcribed portions of the human genome are predominantly composed of interlaced networks of both poly A+ and poly A- annotated transcripts and unannotated transcripts of unknown function. This organization has important implications for interpreting genotype-phenotype associations, regulation of gene expression, and the definition of a gene.


Subject(s)
Chromosomes, Human/genetics , Genome, Human , RNA, Messenger/analysis , Transcription, Genetic , Cell Line , Cell Line, Tumor , Cell Nucleus/metabolism , Chromosomes, Human, Pair 13/genetics , Chromosomes, Human, Pair 14/genetics , Chromosomes, Human, Pair 19/genetics , Chromosomes, Human, Pair 20/genetics , Chromosomes, Human, Pair 21/genetics , Chromosomes, Human, Pair 22/genetics , Chromosomes, Human, Pair 6/genetics , Chromosomes, Human, Pair 7/genetics , Chromosomes, Human, X/genetics , Chromosomes, Human, Y/genetics , Computational Biology , Cytosol/metabolism , DNA, Complementary , DNA, Intergenic , Exons , Female , Humans , Introns , Male , Molecular Sequence Data , Nucleic Acid Amplification Techniques , Oligonucleotide Array Sequence Analysis , Physical Chromosome Mapping , RNA Splicing
5.
Genome Res ; 14(3): 331-42, 2004 Mar.
Article in English | MEDLINE | ID: mdl-14993201

ABSTRACT

In this report, we have achieved a richer view of the transcriptome for Chromosomes 21 and 22 by using high-density oligonucleotide arrays on cytosolic poly(A)(+) RNA. Conservatively, only 31.4% of the observed transcribed nucleotides correspond to well-annotated genes, whereas an additional 4.8% and 14.7% correspond to mRNAs and ESTs, respectively. Approximately 85% of the known exons were detected, and up to 21% of known genes have only a single isoform based on exon-skipping alternative expression. Overall, the expression of the well-characterized exons falls predominately into two categories, uniquely or ubiquitously expressed with an identifiable proportion of antisense transcripts. The remaining observed transcription (49.0%) was outside of any known annotation. These novel transcripts appear to be more cell-line-specific and have lower and less variation in expression than the well-characterized genes. Novel transcripts were further characterized based on their distance to annotations, transcript size, coding capacity, and identification as antisense to intronic sequences. By RT-PCR, 126 novel transcripts were independently verified, resulting in a 65% verification rate. These observations strongly support the argument for a re-evaluation of the total number of human genes and an alternative term for "gene" to encompass these growing, novel classes of RNA transcripts in the human genome.


Subject(s)
Chromosomes, Human, Pair 21/genetics , Chromosomes, Human, Pair 22/genetics , RNA/genetics , Transcription, Genetic/genetics , Cell Line , Cell Line, Tumor , Chromosome Mapping/methods , DNA, Neoplasm/genetics , Gene Expression Profiling/methods , Genes/genetics , Genes, Neoplasm/genetics , Humans , Jurkat Cells/chemistry , Jurkat Cells/metabolism , Molecular Sequence Data , Oligonucleotide Array Sequence Analysis/methods , Oligonucleotide Probes/genetics , RNA, Messenger/genetics
6.
Cell ; 116(4): 499-509, 2004 Feb 20.
Article in English | MEDLINE | ID: mdl-14980218

ABSTRACT

Using high-density oligonucleotide arrays representing essentially all nonrepetitive sequences on human chromosomes 21 and 22, we map the binding sites in vivo for three DNA binding transcription factors, Sp1, cMyc, and p53, in an unbiased manner. This mapping reveals an unexpectedly large number of transcription factor binding site (TFBS) regions, with a minimal estimate of 12,000 for Sp1, 25,000 for cMyc, and 1600 for p53 when extrapolated to the full genome. Only 22% of these TFBS regions are located at the 5' termini of protein-coding genes while 36% lie within or immediately 3' to well-characterized genes and are significantly correlated with noncoding RNAs. A significant number of these noncoding RNAs are regulated in response to retinoic acid, and overlapping pairs of protein-coding and noncoding RNAs are often coregulated. Thus, the human genome contains roughly comparable numbers of protein-coding and noncoding genes that are bound by common transcription factors and regulated by common environmental signals.


Subject(s)
Chromosomes, Human, Pair 21 , Chromosomes, Human, Pair 22 , Transcription Factors/metabolism , Amino Acid Motifs , Binding Sites , Cell Line , Chromatin/metabolism , Chromosome Mapping , CpG Islands , Exons , Expressed Sequence Tags , Genome, Human , Humans , Jurkat Cells , Models, Genetic , Polymerase Chain Reaction , Precipitin Tests , Promoter Regions, Genetic , Protein Binding , RNA/chemistry , RNA/metabolism , RNA, Messenger/metabolism , Reverse Transcriptase Polymerase Chain Reaction , Tretinoin/metabolism
7.
J Comput Biol ; 10(5): 763-73, 2003.
Article in English | MEDLINE | ID: mdl-14633398

ABSTRACT

Pedigree analysis is a central component of many current efforts to locate genes that contribute to diseases or to valuable traits. The analysis usually involves solving one of two very computation-intense problems. We analyze the complexity of these two problems. Surprisingly, we show that both problems are NP-hard even for pedigrees that contain no inbreeding loops.


Subject(s)
Pedigree , Computational Biology/methods , Models, Genetic , Probability
SELECTION OF CITATIONS
SEARCH DETAIL
...