Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
PLoS Comput Biol ; 8(4): e1002464, 2012.
Article in English | MEDLINE | ID: mdl-22496636

ABSTRACT

High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions-particularly those expressed with low abundance- is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample.


Subject(s)
Algorithms , Gene Fusion/genetics , Oligonucleotide Array Sequence Analysis/methods , Sequence Analysis, RNA/methods , Software , Base Sequence , Molecular Sequence Data
2.
PLoS One ; 5(2): e9317, 2010 Feb 19.
Article in English | MEDLINE | ID: mdl-20174472

ABSTRACT

Due to growing throughput and shrinking cost, massively parallel sequencing is rapidly becoming an attractive alternative to microarrays for the genome-wide study of gene expression and copy number alterations in primary tumors. The sequencing of transcripts (RNA-Seq) should offer several advantages over microarray-based methods, including the ability to detect somatic mutations and accurately measure allele-specific expression. To investigate these advantages we have applied a novel, strand-specific RNA-Seq method to tumors and matched normal tissue from three patients with oral squamous cell carcinomas. Additionally, to better understand the genomic determinants of the gene expression changes observed, we have sequenced the tumor and normal genomes of one of these patients. We demonstrate here that our RNA-Seq method accurately measures allelic imbalance and that measurement on the genome-wide scale yields novel insights into cancer etiology. As expected, the set of genes differentially expressed in the tumors is enriched for cell adhesion and differentiation functions, but, unexpectedly, the set of allelically imbalanced genes is also enriched for these same cancer-related functions. By comparing the transcriptomic perturbations observed in one patient to his underlying normal and tumor genomes, we find that allelic imbalance in the tumor is associated with copy number mutations and that copy number mutations are, in turn, strongly associated with changes in transcript abundance. These results support a model in which allele-specific deletions and duplications drive allele-specific changes in gene expression in the developing tumor.


Subject(s)
Carcinoma, Squamous Cell/genetics , Gene Expression Profiling , Mouth Neoplasms/genetics , Sequence Analysis, DNA/methods , Allelic Imbalance , Cluster Analysis , Gene Deletion , Gene Dosage , Gene Duplication , Gene Expression Regulation, Neoplastic , Genome-Wide Association Study/methods , Humans , Mutation , Oligonucleotide Array Sequence Analysis , Polymorphism, Single Nucleotide , Reverse Transcriptase Polymerase Chain Reaction
3.
Genomics ; 88(4): 513-20, 2006 Oct.
Article in English | MEDLINE | ID: mdl-16890402

ABSTRACT

We have discovered a family of small secreted proteins in Homo sapiens and Mus musculus. The IGF-like (IGFL) genes encode proteins of approximately 100 amino acids that contain 11 conserved cysteine residues at fixed positions, including two CC motifs. In H. sapiens, the family is composed of four genes and two pseudogenes that are referred as IGFL1 to IGFL4 and IGFL1P1 and IGFL1P2, respectively. Human IGFL genes are clustered together on chromosome 19 within a 35-kb interval. M. musculus has a single IGFL family member that is located on chromosome 7. Further, evolutionary analysis shows a lack of direct orthology between any of the four human members and the mouse gene. This relationship between the mouse and the human family members suggests that the multiple members in the human complement have arisen from recent duplication events that appear limited to the primate lineage. Structural considerations and sequence comparisons would suggest that IGFL proteins are distantly related to the IGF superfamily of growth factors. IGFL mRNAs display specific expression patterns; they are expressed in fetal tissues, breast, and prostate, and in many cancers as well, and this pattern is consistent with that of the IGF family members.


Subject(s)
Gene Expression Regulation , Multigene Family , Somatomedins/genetics , Somatomedins/metabolism , Amino Acid Sequence , Animals , Chromosomes, Human, Pair 19 , Conserved Sequence , Cysteine , Evolution, Molecular , Exons , Female , Humans , Male , Mice , Molecular Sequence Data , Pseudogenes , Sequence Homology, Amino Acid , Somatomedins/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL
...