Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
Genome Res ; 31(12): 2236-2248, 2021 Dec.
Article in English | MEDLINE | ID: mdl-34799402

ABSTRACT

Alu are high copy number interspersed repeats that have accumulated near genes during primate and human evolution. They are a pervasive source of structural variation in modern humans. Impacts that Alu insertions may have on gene expression are not well understood, although some have been associated with expression quantitative trait loci (eQTLs). Here, we directly test regulatory effects of polymorphic Alu insertions in isolation of other variants on the same haplotype. To screen insertion variants for those with such effects, we used ectopic luciferase reporter assays and evaluated 110 Alu insertion variants, including more than 40 with a potential role in disease risk. We observed a continuum of effects with significant outliers that up- or down-regulate luciferase activity. Using a series of reporter constructs, which included genomic context surrounding the Alu, we can distinguish between instances in which the Alu disrupts another regulator and those in which the Alu introduces new regulatory sequence. We next focused on three polymorphic Alu loci associated with breast cancer that display significant effects in the reporter assay. We used CRISPR to modify the endogenous sequences, establishing cell lines varying in the Alu genotype. Our findings indicate that Alu genotype can alter expression of genes implicated in cancer risk, including PTHLH, RANBP9, and MYC These data show that commonly occurring polymorphic Alu elements can alter transcript levels and potentially contribute to disease risk.

2.
Genome Biol ; 21(1): 185, 2020 07 27.
Article in English | MEDLINE | ID: mdl-32718348

ABSTRACT

BACKGROUND: Mobile elements are a major source of structural variants in the human genome, and some mobile elements can regulate gene expression and transcript splicing. However, the impact of polymorphic mobile element insertions (pMEIs) on gene expression and splicing in diverse human tissues has not been thoroughly studied. The multi-tissue gene expression and whole genome sequencing data generated by the Genotype-Tissue Expression (GTEx) project provide a great opportunity to systematically evaluate the role of pMEIs in regulating gene expression in human tissues. RESULTS: Using the GTEx whole genome sequencing data, we identify 20,545 high-quality pMEIs from 639 individuals. Coupling pMEI genotypes with gene expression profiles, we identify pMEI-associated expression quantitative trait loci (eQTLs) and splicing quantitative trait loci (sQTLs) in 48 tissues. Using joint analyses of pMEIs and other genomic variants, pMEIs are predicted to be the potential causal variant for 3522 eQTLs and 3717 sQTLs. The pMEI-associated eQTLs and sQTLs show a high level of tissue specificity, and these pMEIs are enriched in the proximity of affected genes and in regulatory elements. Using reporter assays, we confirm that several pMEIs associated with eQTLs and sQTLs can alter gene expression levels and isoform proportions, respectively. CONCLUSION: Overall, our study shows that pMEIs are associated with thousands of gene expression and splicing variations, indicating that pMEIs could have a significant role in regulating tissue-specific gene expression and transcript splicing. Detailed mechanisms for the role of pMEIs in gene regulation in different tissues will be an important direction for future studies.


Subject(s)
Alternative Splicing , Datasets as Topic , Gene Expression , Interspersed Repetitive Sequences , Quantitative Trait Loci , Humans
3.
Philos Trans R Soc Lond B Biol Sci ; 375(1795): 20190335, 2020 03 30.
Article in English | MEDLINE | ID: mdl-32075555

ABSTRACT

Long interspersed element-1 (LINE-1, L1) sequences, which comprise about 17% of human genome, are the product of one of the most active types of mobile DNAs in modern humans. LINE-1 insertion alleles can cause inherited and de novo genetic diseases, and LINE-1-encoded proteins are highly expressed in some cancers. Genome-wide LINE-1 mapping in single cells could be useful for defining somatic and germline retrotransposition rates, and for enabling studies to characterize tumour heterogeneity, relate insertions to transcriptional and epigenetic effects at the cellular level, or describe cellular phylogenies in development. Our laboratories have reported a genome-wide LINE-1 insertion site mapping method for bulk DNA, named transposon insertion profiling by sequencing (TIPseq). There have been significant barriers applying LINE-1 mapping to single cells, owing to the chimeric artefacts and features of repetitive sequences. Here, we optimize a modified TIPseq protocol and show its utility for LINE-1 mapping in single lymphoblastoid cells. Results from single-cell TIPseq experiments compare well to known LINE-1 insertions found by whole-genome sequencing and TIPseq on bulk DNA. Among the several approaches we tested, whole-genome amplification by multiple displacement amplification followed by restriction enzyme digestion, vectorette ligation and LINE-1-targeted PCR had the best assay performance. This article is part of a discussion meeting issue 'Crossroads between transposons and gene regulation'.


Subject(s)
DNA Transposable Elements/genetics , Long Interspersed Nucleotide Elements/genetics , Mutagenesis, Insertional , Cell Line , Humans , Sequence Analysis, DNA , Single-Cell Analysis
4.
Nat Struct Mol Biol ; 27(2): 168-178, 2020 02.
Article in English | MEDLINE | ID: mdl-32042151

ABSTRACT

LINE-1 retrotransposon overexpression is a hallmark of human cancers. We identified a colorectal cancer wherein a fast-growing tumor subclone downregulated LINE-1, prompting us to examine how LINE-1 expression affects cell growth. We find that nontransformed cells undergo a TP53-dependent growth arrest and activate interferon signaling in response to LINE-1. TP53 inhibition allows LINE-1+ cells to grow, and genome-wide-knockout screens show that these cells require replication-coupled DNA-repair pathways, replication-stress signaling and replication-fork restart factors. Our findings demonstrate that LINE-1 expression creates specific molecular vulnerabilities and reveal a retrotransposition-replication conflict that may be an important determinant of cancer growth.


Subject(s)
DNA/genetics , Long Interspersed Nucleotide Elements , Neoplasms/genetics , Cell Line, Tumor , Cell Proliferation , DNA Replication , G1 Phase Cell Cycle Checkpoints , Gene Expression Regulation, Neoplastic , HEK293 Cells , HeLa Cells , Humans , Signal Transduction , Tumor Suppressor Protein p53/genetics , Tumor Suppressor Protein p53/metabolism
5.
Mob DNA ; 11: 1, 2020.
Article in English | MEDLINE | ID: mdl-31892958

ABSTRACT

BACKGROUND: Long interspersed element-1 (LINE-1, L1) is the major driver of mobile DNA activity in modern humans. When expressed, LINE-1 loci produce bicistronic transcripts encoding two proteins essential for retrotransposition, ORF1p and ORF2p. Many types of human cancers are characterized by L1 promoter hypomethylation, L1 transcription, L1 ORF1p protein expression, and somatic L1 retrotransposition. ORF2p encodes the endonuclease and reverse transcriptase activities required for L1 retrotransposition. Its expression is poorly characterized in human tissues and cell lines. RESULTS: We report mass spectrometry-based tumor proteome profiling studies wherein ORF2p eludes detection. To test whether ORF2p could be detected with specific reagents, we developed and validated five rabbit monoclonal antibodies with immunoreactivity for specific epitopes on the protein. These reagents readily detect ectopic ORF2p expressed from bicistronic L1 constructs. However, endogenous ORF2p is not detected in human tumor samples or cell lines by western blot, immunoprecipitation, or immunohistochemistry despite high levels of ORF1p expression. Moreover, we report endogenous ORF1p-associated interactomes, affinity isolated from colorectal cancers, wherein we similarly fail to detect ORF2p. These samples include primary tumors harboring hundreds of somatically acquired L1 insertions. The new data are available via ProteomeXchange with identifier PXD013743. CONCLUSIONS: Although somatic retrotransposition provides unequivocal genetic evidence for the expression of ORF2p in human cancers, we are unable to directly measure its presence using several standard methods. Experimental systems have previously indicated an unequal stoichiometry between ORF1p and ORF2p, but in vivo, the expression of these two proteins may be more strikingly uncoupled. These findings are consistent with observations that ORF2p is not tolerable for cell growth.

6.
Mob DNA ; 10: 8, 2019.
Article in English | MEDLINE | ID: mdl-30899333

ABSTRACT

BACKGROUND: Transposable elements make up a significant portion of the human genome. Accurately locating these mobile DNAs is vital to understand their role as a source of structural variation and somatic mutation. To this end, laboratories have developed strategies to selectively amplify or otherwise enrich transposable element insertion sites in genomic DNA. RESULTS: Here we describe a technique, Transposon Insertion Profiling by sequencing (TIPseq), to map Long INterspersed Element 1 (LINE-1, L1) retrotransposon insertions in the human genome. This method uses vectorette PCR to amplify species-specific L1 (L1PA1) insertion sites followed by paired-end Illumina sequencing. In addition to providing a step-by-step molecular biology protocol, we offer users a guide to our pipeline for data analysis, TIPseqHunter. Our recent studies in pancreatic and ovarian cancer demonstrate the ability of TIPseq to identify invariant (fixed), polymorphic (inherited variants), as well as somatically-acquired L1 insertions that distinguish cancer genomes from a patient's constitutional make-up. CONCLUSIONS: TIPseq provides an approach for amplifying evolutionarily young, active transposable element insertion sites from genomic DNA. Our rationale and variations on this protocol may be useful to those mapping L1 and other mobile elements in complex genomes.

7.
Nucleic Acids Res ; 47(1): 421-431, 2019 01 10.
Article in English | MEDLINE | ID: mdl-30418605

ABSTRACT

RNA splicing is a highly regulated process dependent on sequences near splice sites. Insertions of Alu retrotransposons can disrupt splice sites or bind splicing regulators. We hypothesized that some common inherited polymorphic Alu insertions are responsible for splicing QTLs (sQTL). We focused on intronic Alu variants mapping within 100 bp of an alternatively used exon and screened for those that alter splicing. We identify five loci, 21.7% of those assayed, where the polymorphic Alu alters splicing. While in most cases the Alu promotes exon skipping, at one locus the Alu increases exon inclusion. Of particular interest is an Alu polymorphism in the CD58 gene. Reduced CD58 expression is associated with risk for developing multiple sclerosis. We show that the Alu insertion promotes skipping of CD58 exon 3 and results in a frameshifted transcript, indicating that the Alu may be the causative variant for increased MS risk at this locus. Using RT-PCR analysis at the endogenous locus, we confirm that the Alu variant is a sQTL for CD58. In summary, altered splicing efficiency is a common functional consequence of Alu polymorphisms including at least one instance where the variant is implicated in disease risk. This work broadens our understanding of splicing regulatory sequences around exons.


Subject(s)
Alu Elements/genetics , CD58 Antigens/genetics , Quantitative Trait Loci/genetics , RNA Splicing/genetics , Alternative Splicing/genetics , Exons/genetics , Genetic Variation , Humans , Introns/genetics , RNA Splice Sites/genetics , RNA, Messenger/genetics
8.
Proc Natl Acad Sci U S A ; 114(20): E3984-E3992, 2017 05 16.
Article in English | MEDLINE | ID: mdl-28465436

ABSTRACT

Interspersed repeat sequences comprise much of our DNA, although their functional effects are poorly understood. The most commonly occurring repeat is the Alu short interspersed element. New Alu insertions occur in human populations, and have been responsible for several instances of genetic disease. In this study, we sought to determine if there are instances of polymorphic Alu insertion variants that function in a common variant, common disease paradigm. We cataloged 809 polymorphic Alu elements mapping to 1,159 loci implicated in disease risk by genome-wide association study (GWAS) (P < 10-8). We found that Alu insertion variants occur disproportionately at GWAS loci (P = 0.013). Moreover, we identified 44 of these Alu elements in linkage disequilibrium (r2 > 0.7) with the trait-associated SNP. This figure represents a >20-fold increase in the number of polymorphic Alu elements associated with human phenotypes. This work provides a broader perspective on how structural variants in repetitive DNAs may contribute to human disease.


Subject(s)
Alu Elements , Disease/genetics , Case-Control Studies , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Linkage Disequilibrium , Polymorphism, Single Nucleotide
9.
Mob DNA ; 8: 6, 2017.
Article in English | MEDLINE | ID: mdl-28450901

ABSTRACT

BACKGROUND: Polymorphic Alu elements account for 17% of structural variants in the human genome. The majority of these belong to the youngest AluY subfamilies, and most structural variant discovery efforts have focused on identifying Alu polymorphisms from these currently retrotranspositionally active subfamilies. In this report we analyze polymorphisms from the evolutionarily older AluS subfamily, whose peak activity was tens of millions of years ago. We annotate the AluS polymorphisms, assess their likely mechanism of origin, and evaluate their contribution to structural variation in the human genome. RESULTS: Of 52 previously reported polymorphic AluS elements ascertained for this study, 48 were confirmed to belong to the AluS subfamily using high stringency subfamily classification criteria. Of these, the majority (77%, 37/48) appear to be deletion polymorphisms. Two polymorphic AluS elements (4%) have features of non-classical Alu insertions and one polymorphic AluS element (2%) likely inserted by a mechanism involving internal priming. Seven AluS polymorphisms (15%) appear to have arisen by the classical target-primed reverse transcription (TPRT) retrotransposition mechanism. These seven TPRT products are 3' intact with 3' poly-A tails, and are flanked by target site duplications; L1 ORF2p endonuclease cleavage sites were also observed, providing additional evidence that these are L1 ORF2p endonuclease-mediated TPRT insertions. Further sequence analysis showed strong conservation of both the RNA polymerase III promoter and SRP9/14 binding sites, important for mediating transcription and interaction with retrotransposition machinery, respectively. This conservation of functional features implies that some of these are fairly recent insertions since they have not diverged significantly from their respective retrotranspositionally competent source elements. CONCLUSIONS: Of the polymorphic AluS elements evaluated in this report, 15% (7/48) have features consistent with TPRT-mediated insertion, thus suggesting that some AluS elements have been more active recently than previously thought, or that fixation of AluS insertion alleles remains incomplete. These data expand the potential significance of polymorphic AluS elements in contributing to structural variation in the human genome. Future discovery efforts focusing on polymorphic AluS elements are likely to identify more such polymorphisms, and approaches tailored to identify deletion alleles may be warranted.

10.
Proc Natl Acad Sci U S A ; 114(5): E733-E740, 2017 01 31.
Article in English | MEDLINE | ID: mdl-28096347

ABSTRACT

Mammalian genomes are replete with interspersed repeats reflecting the activity of transposable elements. These mobile DNAs are self-propagating, and their continued transposition is a source of both heritable structural variation as well as somatic mutation in human genomes. Tailored approaches to map these sequences are useful to identify insertion alleles. Here, we describe in detail a strategy to amplify and sequence long interspersed element-1 (LINE-1, L1) retrotransposon insertions selectively in the human genome, transposon insertion profiling by next-generation sequencing (TIPseq). We also report the development of a machine-learning-based computational pipeline, TIPseqHunter, to identify insertion sites with high precision and reliability. We demonstrate the utility of this approach to detect somatic retrotransposition events in high-grade ovarian serous carcinoma.


Subject(s)
Long Interspersed Nucleotide Elements/genetics , Ovarian Neoplasms/genetics , Algorithms , Female , Genome, Human , Humans , Machine Learning , Polymerase Chain Reaction/methods
11.
Mob DNA ; 7: 22, 2016.
Article in English | MEDLINE | ID: mdl-27843500

ABSTRACT

BACKGROUND: Gliomas are the most common primary brain tumors in adults. We sought to understand the roles of endogenous transposable elements in these malignancies by identifying evidence of somatic retrotransposition in glioblastomas (GBM). We performed transposon insertion profiling of the active subfamily of Long INterspersed Element-1 (LINE-1) elements by deep sequencing (TIPseq) on genomic DNA of low passage oncosphere cell lines derived from 7 primary GBM biopsies, 3 secondary GBM tissue samples, and matched normal intravenous blood samples from the same individuals. RESULTS: We found and PCR validated one somatically acquired tumor-specific insertion in a case of secondary GBM. No LINE-1 insertions present in primary GBM oncosphere cultures were missing from corresponding blood samples. However, several copies of the element (11) were found in genomic DNA from blood and not in the oncosphere cultures. SNP 6.0 microarray analysis revealed deletions or loss of heterozygosity in the tumor genomes over the intervals corresponding to these LINE-1 insertions. CONCLUSIONS: These findings indicate that LINE-1 retrotransposon can act as an infrequent insertional mutagen in secondary GBM, but that retrotransposition is uncommon in these central nervous system tumors as compared to other neoplasias.

12.
Nat Med ; 21(9): 1060-4, 2015 Sep.
Article in English | MEDLINE | ID: mdl-26259033

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is typically diagnosed after the disease has metastasized; it is among the most lethal forms of cancer. We recently described aberrant expression of an open reading frame 1 protein, ORF1p, encoded by long interspersed element-1 (LINE-1; L1) retrotransposon, in PDAC. To test whether LINE-1 expression leads to somatic insertions of this mobile DNA, we used a targeted method to sequence LINE-1 insertion sites in matched PDAC and normal samples. We found evidence of 465 somatic LINE-1 insertions in 20 PDAC genomes, which were absent from corresponding normal samples. In cases in which matched normal tissue, primary PDAC and metastatic disease sites were available, insertions were found in primary and metastatic tissues in differing proportions. Two adenocarcinomas secondarily involving the pancreas, but originating in the stomach and duodenum, acquired insertions with a similar discordance between primary and metastatic sites. Together, our findings show that LINE-1 contributes to the genetic evolution of PDAC and suggest that somatic insertions are acquired discontinuously in gastrointestinal neoplasms.


Subject(s)
Carcinoma, Pancreatic Ductal/genetics , Clonal Evolution , Long Interspersed Nucleotide Elements/physiology , Pancreatic Neoplasms/genetics , Apoptotic Protease-Activating Factor 1/analysis , Humans
13.
Bioinformatics ; 30(11): 1514-21, 2014 Jun 01.
Article in English | MEDLINE | ID: mdl-24501098

ABSTRACT

MOTIVATION: Repetitive sequences account for approximately half of the human genome. Accurately ascertaining sequences in these regions with next generation sequencers is challenging, and requires a different set of analytical techniques than for reads originating from unique sequences. Complicating the matter are repetitive regions subject to programmed rearrangements, as is the case with the antigen-binding domains in the Immunoglobulin (Ig) and T-cell receptor (TCR) loci. RESULTS: We developed a probability-based score and visualization method to aid in distinguishing true structural variants from alignment artifacts. We demonstrate the usefulness of this method in its ability to separate real structural variants from false positives generated with existing upstream analysis tools. We validated our approach using both target-capture and whole-genome experiments. Capture sequencing reads were generated from primary lymphoid tumors, cancer cell lines and an EBV-transformed lymphoblast cell line over the Ig and TCR loci. Whole-genome sequencing reads were from a lymphoblastoid cell-line. AVAILABILITY: We implement our method as an R package available at https://github.com/Eitan177/targetSeqView. Code to reproduce the figures and results are also available.


Subject(s)
Genomic Structural Variation , High-Throughput Nucleotide Sequencing/methods , Sequence Alignment/methods , Sequence Analysis, DNA/methods , Cell Line, Transformed , Cell Line, Tumor , DNA/chemistry , Genome, Human , Genomics/methods , Humans , Probability , Repetitive Sequences, Nucleic Acid
14.
BMC Genomics ; 14: 565, 2013 Aug 19.
Article in English | MEDLINE | ID: mdl-23957733

ABSTRACT

BACKGROUND: Lymphocytes achieve diversity in antigen recognition in part by rearranging genomic DNA at loci encoding antibodies and cell surface receptors. The process, termed V(D)J recombination, juxtaposes modular coding sequences for antigen binding. Erroneous recombination events causing chromosomal translocations are recognized causes of lymphoid malignancies. Here we show a hybridization based method for sequence enrichment can be used to efficiently and selectively capture genomic DNA adjacent to V(D)J recombination breakpoints for massively parallel sequencing. The approach obviates the need for PCR amplification of recombined sequences. RESULTS: Using tailored informatics analyses to resolve alignment and assembly issues in these repetitive regions, we were able to detect numerous recombination events across a panel of cancer cell lines and primary lymphoid tumors, and an EBV transformed lymphoblast line. With reassembly, breakpoints could be defined to single base pair resolution. The observed events consist of canonical V(D)J or V-J rearrangements, non-canonical rearrangements, and putatively oncogenic reciprocal chromosome translocations. We validated non-canonical and chromosome translocation junctions by PCR and Sanger sequencing. The translocations involved the MYC and BCL-2 loci, and activation of these was consistent with histopathologic features of the respective B-cell tumors. We also show an impressive prevalence of novel erroneous V-V recombination events at sites not incorporated with other downstream coding segments. CONCLUSIONS: Our results demonstrate the ability of next generation sequencing to describe human V(D)J recombinase activity and provide a scalable means to chronicle off-target, unexpressed, and non-amplifiable recombinations occurring in the development of lymphoid cancers.


Subject(s)
Gene Rearrangement , Leukemia, Lymphoid/genetics , VDJ Recombinases/physiology , B-Lymphocytes/enzymology , Base Sequence , Carcinogenesis/genetics , Cell Line, Tumor , Chromosome Breakpoints , Chromosome Mapping , High-Throughput Nucleotide Sequencing , Humans , Leukemia, Lymphoid/enzymology , Sequence Analysis, DNA , Translocation, Genetic
15.
PLoS One ; 6(6): e20874, 2011.
Article in English | MEDLINE | ID: mdl-21698104

ABSTRACT

A newly discovered gammaretrovirus, termed XMRV, was recently reported to be present in the prostate cancer cell line CWR22Rv1. Using a combination of both immunohistochemistry with broadly-reactive murine leukemia virus (MLV) anti-sera and PCR, we determined if additional prostate cancer or other cell lines contain XMRV or MLV-related viruses. Our study included a total of 72 cell lines, which included 58 of the 60 human cancer cell lines used in anticancer drug screens and maintained at the NCI-Frederick (NCI-60). We have identified gammaretroviruses in two additional prostate cancer cell lines: LAPC4 and VCaP, and show that these viruses are replication competent. Viral genome sequencing identified the virus in LAPC4 and VCaP as nearly identical to another known xenotropic MLV, Bxv-1. We also identified a gammaretrovirus in the non-small-cell lung carcinoma cell line EKVX. Prostate cancer cell lines appear to have a propensity for infection with murine gammaretroviruses, and we propose that this may be in part due to cell line establishment by xenograft passage in immunocompromised mice. It is unclear if infection with these viruses is necessary for cell line establishment, or what confounding role they may play in experiments performed with these commonly used lines. Importantly, our results suggest a need for regular screening of cancer cell lines for retroviral "contamination", much like routine mycoplasma testing.


Subject(s)
Gammaretrovirus/physiology , Prostatic Neoplasms/virology , Virus Replication , Animals , Base Sequence , Cell Line, Tumor , DNA Primers , Gammaretrovirus/genetics , Genome, Viral , Humans , Male , Mice , Polymerase Chain Reaction
16.
Cell ; 141(7): 1171-82, 2010 Jun 25.
Article in English | MEDLINE | ID: mdl-20602999

ABSTRACT

Characterizing structural variants in the human genome is of great importance, but a genome wide analysis to detect interspersed repeats has not been done. Thus, the degree to which mobile DNAs contribute to genetic diversity, heritable disease, and oncogenesis remains speculative. We perform transposon insertion profiling by microarray (TIP-chip) to map human L1(Ta) retrotransposons (LINE-1 s) genome-wide. This identified numerous novel human L1(Ta) insertional polymorphisms with highly variant allelic frequencies. We also explored TIP-chip's usefulness to identify candidate alleles associated with different phenotypes in clinical cohorts. Our data suggest that the occurrence of new insertions is twice as high as previously estimated, and that these repeats are under-recognized as sources of human genomic and phenotypic diversity. We have just begun to probe the universe of human L1(Ta) polymorphisms, and as TIP-chip is applied to other insertions such as Alu SINEs, it will expand the catalog of genomic variants even further.


Subject(s)
DNA Transposable Elements , Genome, Human , Genome-Wide Association Study , Oligonucleotide Array Sequence Analysis , Chromosomes, Human, X , DNA Restriction Enzymes/metabolism , Genetic Diseases, X-Linked/genetics , Humans , Male
SELECTION OF CITATIONS
SEARCH DETAIL
...