Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 22
Filter
Add more filters










Publication year range
1.
Genome Res ; 32(7): 1298-1314, 2022 07.
Article in English | MEDLINE | ID: mdl-35728967

ABSTRACT

The retrotransposon LINE-1 (L1) is central to the recent evolutionary history of the human genome and continues to drive genetic diversity and germline pathogenesis. However, the spatiotemporal extent and biological significance of somatic L1 activity are poorly defined and are virtually unexplored in other primates. From a single L1 lineage active at the divergence of apes and Old World monkeys, successive L1 subfamilies have emerged in each descendant primate germline. As revealed by case studies, the presently active human L1 subfamily can also mobilize during embryonic and brain development in vivo. It is unknown whether nonhuman primate L1s can similarly generate somatic insertions in the brain. Here we applied approximately 40× single-cell whole-genome sequencing (scWGS), as well as retrotransposon capture sequencing (RC-seq), to 20 hippocampal neurons from two rhesus macaques (Macaca mulatta). In one animal, we detected and PCR-validated a somatic L1 insertion that generated target site duplications, carried a short 5' transduction, and was present in ∼7% of hippocampal neurons but absent from cerebellum and nonbrain tissues. The corresponding donor L1 allele was exceptionally mobile in vitro and was embedded in PRDM4, a gene expressed throughout development and in neural stem cells. Nanopore long-read methylome and RNA-seq transcriptome analyses indicated young retrotransposon subfamily activation in the early embryo, followed by repression in adult tissues. These data highlight endogenous macaque L1 retrotransposition potential, provide prototypical evidence of L1-mediated somatic mosaicism in a nonhuman primate, and allude to L1 mobility in the brain over the past 30 million years of human evolution.


Subject(s)
Brain , Long Interspersed Nucleotide Elements , Retroelements , Animals , DNA-Binding Proteins/genetics , Macaca mulatta/genetics , Neurons , Retroelements/genetics , Transcription Factors/genetics
2.
Nat Commun ; 9(1): 5398, 2018 12 19.
Article in English | MEDLINE | ID: mdl-30568248

ABSTRACT

This Article contains an error in the author affiliations. The correct affiliation for author Ruchi Shukla is 'MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Crewe Road, Edinburgh, EH4 2XU, UK', and is not 'Mater Research Institute - University of Queensland, TRI Building, Woolloongabba QLD 4102, Australia'.

3.
Genome Res ; 28(5): 639-653, 2018 05.
Article in English | MEDLINE | ID: mdl-29643204

ABSTRACT

The retrotransposon Long Interspersed Element 1 (LINE-1 or L1) is a continuing source of germline and somatic mutagenesis in mammals. Deregulated L1 activity is a hallmark of cancer, and L1 mutagenesis has been described in numerous human malignancies. We previously employed retrotransposon capture sequencing (RC-seq) to analyze hepatocellular carcinoma (HCC) samples from patients infected with hepatitis B or hepatitis C virus and identified L1 variants responsible for activating oncogenic pathways. Here, we have applied RC-seq and whole-genome sequencing (WGS) to an Abcb4 (Mdr2)-/- mouse model of hepatic carcinogenesis and demonstrated for the first time that L1 mobilization occurs in murine tumors. In 12 HCC nodules obtained from 10 animals, we validated four somatic L1 insertions by PCR and capillary sequencing, including TF subfamily elements, and one GF subfamily example. One of the TF insertions carried a 3' transduction, allowing us to identify its donor L1 and to demonstrate that this full-length TF element retained retrotransposition capacity in cultured cancer cells. Using RC-seq, we also identified eight tumor-specific L1 insertions from 25 HCC patients with a history of alcohol abuse. Finally, we used RC-seq and WGS to identify three tumor-specific L1 insertions among 10 intra-hepatic cholangiocarcinoma (ICC) patients, including one insertion traced to a donor L1 on Chromosome 22 known to be highly active in other cancers. This study reveals L1 mobilization as a common feature of hepatocarcinogenesis in mammals, demonstrating that the phenomenon is not restricted to human viral HCC etiologies and is encountered in murine liver tumors.


Subject(s)
Carcinoma, Hepatocellular/genetics , Liver Neoplasms/genetics , Long Interspersed Nucleotide Elements/genetics , Retroelements/genetics , ATP Binding Cassette Transporter, Subfamily B/genetics , Adult , Aged , Aged, 80 and over , Animals , Cell Transformation, Neoplastic/genetics , Female , Humans , Liver/metabolism , Liver/pathology , Male , Mammals/genetics , Mice, Knockout , Middle Aged , Mutagenesis, Insertional , ATP-Binding Cassette Sub-Family B Member 4
4.
Genome Res ; 27(8): 1395-1405, 2017 08.
Article in English | MEDLINE | ID: mdl-28483779

ABSTRACT

LINE-1 (L1) retrotransposons are a noted source of genetic diversity and disease in mammals. To expand its genomic footprint, L1 must mobilize in cells that will contribute their genetic material to subsequent generations. Heritable L1 insertions may therefore arise in germ cells and in pluripotent embryonic cells, prior to germline specification, yet the frequency and predominant developmental timing of such events remain unclear. Here, we applied mouse retrotransposon capture sequencing (mRC-seq) and whole-genome sequencing (WGS) to pedigrees of C57BL/6J animals, and uncovered an L1 insertion rate of ≥1 event per eight births. We traced heritable L1 insertions to pluripotent embryonic cells and, strikingly, to early primordial germ cells (PGCs). New L1 insertions bore structural hallmarks of target-site primed reverse transcription (TPRT) and mobilized efficiently in a cultured cell retrotransposition assay. Together, our results highlight the rate and evolutionary impact of heritable L1 retrotransposition and reveal retrotransposition-mediated genomic diversification as a fundamental property of pluripotent embryonic cells in vivo.


Subject(s)
Embryo, Mammalian/metabolism , Long Interspersed Nucleotide Elements , Animals , Embryo, Mammalian/cytology , Female , Genomics/methods , Germ Cells , HeLa Cells , Humans , Male , Mice , Mice, Inbred C57BL , Mosaicism , Whole Genome Sequencing/methods
5.
Mob DNA ; 7: 21, 2016.
Article in English | MEDLINE | ID: mdl-27843499

ABSTRACT

BACKGROUND: LINE-1 (L1) retrotransposons are a notable endogenous source of mutagenesis in mammals. Notably, cancer cells can support unusual L1 retrotransposition and L1-associated sequence rearrangement mechanisms following DNA damage. Recent reports suggest that L1 is mobile in epithelial tumours and neural cells but, paradoxically, not in brain cancers. RESULTS: Here, using retrotransposon capture sequencing (RC-seq), we surveyed L1 mutations in 14 tumours classified as glioblastoma multiforme (GBM) or as a lower grade glioma. In four GBM tumours, we characterised one probable endonuclease-independent L1 insertion, two L1-associated rearrangements and one likely Alu-Alu recombination event adjacent to an L1. These mutations included PCR validated intronic events in MeCP2 and EGFR. Despite sequencing L1 integration sites at up to 250× depth by RC-seq, we found no tumour-specific, endonuclease-dependent L1 insertions. Whole genome sequencing analysis of the tumours carrying the MeCP2 and EGFR L1 mutations also revealed no endonuclease-dependent L1 insertions. In a complementary in vitro assay, wild-type and endonuclease mutant L1 reporter constructs each mobilised very inefficiently in four cultured GBM cell lines. CONCLUSIONS: These experiments altogether highlight the consistent absence of canonical L1 retrotransposition in GBM tumours and cultured cell lines, as well as atypical L1-associated sequence rearrangements following DNA damage in vivo.

6.
Nat Commun ; 7: 10286, 2016 Jan 08.
Article in English | MEDLINE | ID: mdl-26743714

ABSTRACT

Human induced pluripotent stem cells (hiPSCs) are capable of unlimited proliferation and can differentiate in vitro to generate derivatives of the three primary germ layers. Genetic and epigenetic abnormalities have been reported by Wissing and colleagues to occur during hiPSC derivation, including mobilization of engineered LINE-1 (L1) retrotransposons. However, incidence and functional impact of endogenous retrotransposition in hiPSCs are yet to be established. Here we apply retrotransposon capture sequencing to eight hiPSC lines and three human embryonic stem cell (hESC) lines, revealing endogenous L1, Alu and SINE-VNTR-Alu (SVA) mobilization during reprogramming and pluripotent stem cell cultivation. Surprisingly, 4/7 de novo L1 insertions are full length and 6/11 retrotransposition events occurred in protein-coding genes expressed in pluripotent stem cells. We further demonstrate that an intronic L1 insertion in the CADPS2 gene is acquired during hiPSC cultivation and disrupts CADPS2 expression. These experiments elucidate endogenous retrotransposition, and its potential consequences, in hiPSCs and hESCs.


Subject(s)
Alu Elements/genetics , Cell Proliferation/genetics , Cellular Reprogramming/genetics , Embryonic Stem Cells/metabolism , Induced Pluripotent Stem Cells/metabolism , Long Interspersed Nucleotide Elements/genetics , Calcium-Binding Proteins/genetics , Cell Line , Cellular Reprogramming Techniques , Epigenesis, Genetic , Humans , Minisatellite Repeats , Retroelements/genetics , Vesicular Transport Proteins/genetics
7.
Cell ; 161(2): 228-39, 2015 Apr 09.
Article in English | MEDLINE | ID: mdl-25860606

ABSTRACT

Somatic LINE-1 (L1) retrotransposition during neurogenesis is a potential source of genotypic variation among neurons. As a neurogenic niche, the hippocampus supports pronounced L1 activity. However, the basal parameters and biological impact of L1-driven mosaicism remain unclear. Here, we performed single-cell retrotransposon capture sequencing (RC-seq) on individual human hippocampal neurons and glia, as well as cortical neurons. An estimated 13.7 somatic L1 insertions occurred per hippocampal neuron and carried the sequence hallmarks of target-primed reverse transcription. Notably, hippocampal neuron L1 insertions were specifically enriched in transcribed neuronal stem cell enhancers and hippocampus genes, increasing their probability of functional relevance. In addition, bias against intronic L1 insertions sense oriented relative to their host gene was observed, perhaps indicating moderate selection against this configuration in vivo. These experiments demonstrate pervasive L1 mosaicism at genomic loci expressed in hippocampal neurons.


Subject(s)
Hippocampus/cytology , Long Interspersed Nucleotide Elements , Mosaicism , Neurons/cytology , Genetic Variation , Humans , Neurogenesis , Polymerase Chain Reaction , Tissue Banks
8.
Plant J ; 79(6): 993-1008, 2014 Sep.
Article in English | MEDLINE | ID: mdl-24947485

ABSTRACT

Switchgrass (Panicum virgatum) is a polyploid, outcrossing grass species native to North America and has recently been recognized as a potential biofuel feedstock crop. Significant phenotypic variation including ploidy is present across the two primary ecotypes of switchgrass, referred to as upland and lowland switchgrass. The tetraploid switchgrass genome is approximately 1400 Mbp, split between two subgenomes, with significant repetitive sequence content limiting the efficiency of re-sequencing approaches for determining genome diversity. To characterize genetic diversity in upland and lowland switchgrass as a first step in linking genotype to phenotype, we designed an exome capture probe set based on transcript assemblies that represent approximately 50 Mb of annotated switchgrass exome sequences. We then evaluated and optimized the probe set using solid phase comparative genome hybridization and liquid phase exome capture followed by next-generation sequencing. Using the optimized probe set, we assessed variation in the exomes of eight switchgrass genotypes representing tetraploid lowland and octoploid upland cultivars to benchmark our exome capture probe set design. We identified ample variation in the switchgrass genome including 1,395,501 single nucleotide polymorphisms (SNPs), 8173 putative copy number variants and 3336 presence/absence variants. While the majority of the SNPs (84%) detected was bi-allelic, a substantial number was tri-allelic with limited occurrence of tetra-allelic polymorphisms consistent with the heterozygous and polyploid nature of the switchgrass genome. Collectively, these data demonstrate the efficacy of exome capture for discovery of genome variation in a polyploid species with a large, repetitive and heterozygous genome.


Subject(s)
DNA Copy Number Variations/genetics , Exome/genetics , Genetic Variation , Genome, Plant/genetics , Panicum/genetics , Alleles , Base Sequence , Ecotype , Genotype , High-Throughput Nucleotide Sequencing , Polymorphism, Single Nucleotide , Polyploidy , Sequence Analysis, DNA
9.
Nat Protoc ; 9(5): 989-1009, 2014 May.
Article in English | MEDLINE | ID: mdl-24705597

ABSTRACT

RNA sequencing (RNAseq) samples the majority of expressed genes infrequently, owing to the large size, complex splicing and wide dynamic range of eukaryotic transcriptomes. This results in sparse sequencing coverage that can hinder robust isoform assembly and quantification. RNA capture sequencing (CaptureSeq) addresses this challenge by using oligonucleotide probes to capture selected genes or regions of interest for targeted sequencing. Targeted RNAseq provides enhanced coverage for sensitive gene discovery, robust transcript assembly and accurate gene quantification. Here we describe a detailed protocol for all stages of RNA CaptureSeq, from initial probe design considerations and capture of targeted genes to final assembly and quantification of captured transcripts. Initial probe design and final analysis can take less than 1 d, whereas the central experimental capture stage requires ∼7 d.


Subject(s)
Genetic Association Studies/methods , Oligonucleotide Probes/genetics , Sequence Analysis, RNA/methods , Software , Exons/genetics , Nucleic Acid Amplification Techniques/methods
10.
PLoS One ; 8(12): e81992, 2013.
Article in English | MEDLINE | ID: mdl-24312619

ABSTRACT

Targeted genomic selection methodologies, or sequence capture, allow for DNA enrichment and large-scale resequencing and characterization of natural genetic variation in species with complex genomes, such as rapeseed canola (Brassica napus L., AACC, 2n=38). The main goal of this project was to combine sequence capture with next generation sequencing (NGS) to discover single nucleotide polymorphisms (SNPs) in specific areas of the B. napus genome historically associated (via quantitative trait loci -QTL- analysis) to traits of agronomical and nutritional importance. A 2.1 million feature sequence capture platform was designed to interrogate DNA sequence variation across 47 specific genomic regions, representing 51.2 Mb of the Brassica A and C genomes, in ten diverse rapeseed genotypes. All ten genotypes were sequenced using the 454 Life Sciences chemistry and to assess the effect of increased sequence depth, two genotypes were also sequenced using Illumina HiSeq chemistry. As a result, 589,367 potentially useful SNPs were identified. Analysis of sequence coverage indicated a four-fold increased representation of target regions, with 57% of the filtered SNPs falling within these regions. Sixty percent of discovered SNPs corresponded to transitions while 40% were transversions. Interestingly, fifty eight percent of the SNPs were found in genic regions while 42% were found in intergenic regions. Further, a high percentage of genic SNPs was found in exons (65% and 64% for the A and C genomes, respectively). Two different genotyping assays were used to validate the discovered SNPs. Validation rates ranged from 61.5% to 84% of tested SNPs, underpinning the effectiveness of this SNP discovery approach. Most importantly, the discovered SNPs were associated with agronomically important regions of the B. napus genome generating a novel data resource for research and breeding this crop species.


Subject(s)
Brassica napus/genetics , DNA, Plant/genetics , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Oligonucleotide Array Sequence Analysis , Polymorphism, Single Nucleotide , Sequence Analysis, DNA/methods , Genome, Plant/genetics , Genotype , Introns/genetics , Quantitative Trait Loci/genetics , Reproducibility of Results
11.
Genome Biol ; 14(9): R96, 2013.
Article in English | MEDLINE | ID: mdl-24025402

ABSTRACT

BACKGROUND: We have used a sensitized ENU mutagenesis screen to produce mouse lines that carry mutations in genes required for epigenetic regulation. We call these lines Modifiers of murine metastable epialleles (Mommes). RESULTS: We report a basic molecular and phenotypic characterization for twenty of the Momme mouse lines, and in each case we also identify the causative mutation. Three of the lines carry a mutation in a novel epigenetic modifier, Rearranged L-myc fusion (Rlf), and one gene, Rap-interacting factor 1 (Rif1), has not previously been reported to be involved in transcriptional regulation in mammals. Many of the other lines are novel alleles of known epigenetic regulators. For two genes, Rlf and Widely-interspaced zinc finger (Wiz), we describe the first mouse mutants. All of the Momme mutants show some degree of homozygous embryonic lethality, emphasizing the importance of epigenetic processes. The penetrance of lethality is incomplete in a number of cases. Similarly ,abnormalities in phenotype seen in the heterozygous individuals of some lines occur with incomplete penetrance. CONCLUSIONS: Recent advances in sequencing enhance the power of sensitized mutagenesis screens to identify the function of previously uncharacterized factors and to discover additional functions for previously characterized proteins. The observation of incomplete penetrance of phenotypes in these inbred mutant mice, at various stages of development, is of interest. Overall, the Momme collection of mouse mutants provides a valuable resource for researchers across many disciplines.


Subject(s)
Epigenesis, Genetic , Ethylnitrosourea/pharmacology , Genes, Lethal , Mutagenesis , Mutagens/pharmacology , Mutation/drug effects , Alleles , Animals , Gene Expression Regulation , Genome-Wide Association Study , Genotype , Guanine Nucleotide Exchange Factors , Heterozygote , Homozygote , Kruppel-Like Transcription Factors/genetics , Mice , Nerve Tissue Proteins/genetics , Phenotype , Telomere-Binding Proteins/genetics , Transcription Factors/genetics
12.
Plant J ; 76(3): 494-505, 2013 Nov.
Article in English | MEDLINE | ID: mdl-23889683

ABSTRACT

Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes.


Subject(s)
Exome , Genome, Plant , Genomics/methods , Hordeum/genetics , Genomics/trends , Ploidies , Polymorphism, Single Nucleotide , Triticum/genetics
13.
Cell ; 153(1): 101-11, 2013 Mar 28.
Article in English | MEDLINE | ID: mdl-23540693

ABSTRACT

LINE-1 (L1) retrotransposons are mobile genetic elements comprising ~17% of the human genome. New L1 insertions can profoundly alter gene function and cause disease, though their significance in cancer remains unclear. Here, we applied enhanced retrotransposon capture sequencing (RC-seq) to 19 hepatocellular carcinoma (HCC) genomes and elucidated two archetypal L1-mediated mechanisms enabling tumorigenesis. In the first example, 4/19 (21.1%) donors presented germline retrotransposition events in the tumor suppressor mutated in colorectal cancers (MCC). MCC expression was ablated in each case, enabling oncogenic ß-catenin/Wnt signaling. In the second example, suppression of tumorigenicity 18 (ST18) was activated by a tumor-specific L1 insertion. Experimental assays confirmed that the L1 interrupted a negative feedback loop by blocking ST18 repression of its enhancer. ST18 was also frequently amplified in HCC nodules from Mdr2(-/-) mice, supporting its assignment as a candidate liver oncogene. These proof-of-principle results substantiate L1-mediated retrotransposition as an important etiological factor in HCC.


Subject(s)
Carcinoma, Hepatocellular/genetics , DNA Mutational Analysis , Genes, Tumor Suppressor , Liver Neoplasms/genetics , Long Interspersed Nucleotide Elements , Mutagenesis, Insertional , ATP Binding Cassette Transporter, Subfamily B/genetics , Adult , Aged , Animals , Cell Line, Tumor , Cell Transformation, Neoplastic , Female , Humans , Male , Mice , Middle Aged , Repressor Proteins/genetics , Tumor Suppressor Proteins/genetics , ATP-Binding Cassette Sub-Family B Member 4
14.
Plant Physiol ; 159(4): 1295-308, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22696021

ABSTRACT

Genome-wide structural and gene content variations are hypothesized to drive important phenotypic variation within a species. Structural and gene content variations were assessed among four soybean (Glycine max) genotypes using array hybridization and targeted resequencing. Many chromosomes exhibited relatively low rates of structural variation (SV) among genotypes. However, several regions exhibited both copy number and presence-absence variation, the most prominent found on chromosomes 3, 6, 7, 16, and 18. Interestingly, the regions most enriched for SV were specifically localized to gene-rich regions that harbor clustered multigene families. The most abundant classes of gene families associated with these regions were the nucleotide-binding and receptor-like protein classes, both of which are important for plant biotic defense. The colocalization of SV with plant defense response signal transduction pathways provides insight into the mechanisms of soybean resistance gene evolution and may inform the development of new approaches to resistance gene cloning.


Subject(s)
Genes, Plant/genetics , Glycine max/genetics , Glycine max/physiology , Multigene Family/genetics , Stress, Physiological/genetics , Chromosomes, Plant/genetics , Comparative Genomic Hybridization , DNA Copy Number Variations/genetics , Disease Resistance/genetics , Ecotype , Exome/genetics , Genotype , Genotyping Techniques , Polymorphism, Single Nucleotide/genetics , Reproducibility of Results , Sequence Analysis, DNA , Glycine max/immunology
15.
Plant Biotechnol J ; 10(6): 733-42, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22703335

ABSTRACT

Bread wheat, Triticum aestivum, is an allohexaploid composed of the three distinct ancestral genomes, A, B and D. The polyploid nature of the wheat genome together with its large size has limited our ability to generate the significant amount of sequence data required for whole genome studies. Even with the advent of next-generation sequencing technology, it is still relatively expensive to generate whole genome sequences for more than a few wheat genomes at any one time. To overcome this problem, we have developed a targeted-capture re-sequencing protocol based upon NimbleGen array technology to capture and characterize 56.5 Mb of genomic DNA with sequence similarity to over 100 000 transcripts from eight different UK allohexaploid wheat varieties. Using this procedure in conjunction with a carefully designed bioinformatic procedure, we have identified more than 500 000 putative single-nucleotide polymorphisms (SNPs). While 80% of these were variants between the homoeologous genomes, A, B and D, a significant number (20%) were putative varietal SNPs between the eight varieties studied. A small number of these latter polymorphisms were experimentally validated using KASPar technology and 94% proved to be genuine. The procedures described here to sequence a large proportion of the wheat genome, and the various SNPs identified should be of considerable use to the wider wheat community.


Subject(s)
Exome , Genome, Plant , Polymorphism, Single Nucleotide , Sequence Analysis, DNA , Triticum/genetics , Alleles , Polyploidy , Species Specificity
16.
Plant J ; 72(3): 390-9, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22731681

ABSTRACT

A careful analysis of two maize recombinant inbred lines (RILs) relative to their inbred parents revealed the presence of several hundred apparently de novo copy number variants (CNVs). These changes in genome content were validated via both PCR and whole exome-array capture-and-sequencing experiments. One hundred and eighty-five genomic regions, which overlap with 38 high-confidence genes, exhibited apparently de novo copy number variation (CNV) in these two RILs and in many instances the same apparently de novo CNV events were observed in multiple RILs. Further analyses revealed that these recurrent apparently de novo CNVs were caused by segregation of single-copy homologous sequences that are located in non-allelic positions in the two parental inbred lines. F(1) individuals derived from these inbred lines will be hemizygous for each of these non-allelic homologs but RIL genotypes will contain these sequences at zero, one or two genomic loci. Hence, the segregation of non-allelic homologs may contribute to transgressive segregation. Indeed, statistical associations between phenotypic quantitative trait loci and genomic losses were observed for two of 14 tested pairs of non-allelic homologs.


Subject(s)
Chromosome Segregation/genetics , DNA Copy Number Variations/genetics , DNA, Plant/genetics , Genome, Plant/genetics , Zea mays/genetics , Chromosome Mapping , Comparative Genomic Hybridization , Exome , Exons , Gene Dosage/genetics , Genotype , Models, Genetic , Oligonucleotide Array Sequence Analysis , Phenotype , Quantitative Trait Loci
17.
Nature ; 479(7374): 534-7, 2011 Oct 30.
Article in English | MEDLINE | ID: mdl-22037309

ABSTRACT

Retrotransposons are mobile genetic elements that use a germline 'copy-and-paste' mechanism to spread throughout metazoan genomes. At least 50 per cent of the human genome is derived from retrotransposons, with three active families (L1, Alu and SVA) associated with insertional mutagenesis and disease. Epigenetic and post-transcriptional suppression block retrotransposition in somatic cells, excluding early embryo development and some malignancies. Recent reports of L1 expression and copy number variation in the human brain suggest that L1 mobilization may also occur during later development. However, the corresponding integration sites have not been mapped. Here we apply a high-throughput method to identify numerous L1, Alu and SVA germline mutations, as well as 7,743 putative somatic L1 insertions, in the hippocampus and caudate nucleus of three individuals. Surprisingly, we also found 13,692 somatic Alu insertions and 1,350 SVA insertions. Our results demonstrate that retrotransposons mobilize to protein-coding genes differentially expressed and active in the brain. Thus, somatic genome mosaicism driven by retrotransposition may reshape the genetic circuitry that underpins normal and abnormal neurobiological processes.


Subject(s)
Brain/metabolism , Germ-Line Mutation/genetics , Mutagenesis, Insertional/genetics , Retroelements/genetics , Alu Elements/genetics , Base Sequence/genetics , Caudate Nucleus/metabolism , Clonal Evolution/genetics , DNA Copy Number Variations/genetics , Epistasis, Genetic , Genome, Human/genetics , Hippocampus/metabolism , Histone Deacetylase 1/genetics , Humans , Mosaicism , Nerve Tissue Proteins/genetics , Organ Specificity/genetics , Polymerase Chain Reaction , Trans-Activators , Transcription Factors/genetics
18.
Nat Biotechnol ; 30(1): 99-104, 2011 Nov 13.
Article in English | MEDLINE | ID: mdl-22081020

ABSTRACT

Transcriptomic analyses have revealed an unexpected complexity to the human transcriptome, whose breadth and depth exceeds current RNA sequencing capability. Using tiling arrays to target and sequence select portions of the transcriptome, we identify and characterize unannotated transcripts whose rare or transient expression is below the detection limits of conventional sequencing approaches. We use the unprecedented depth of coverage afforded by this technique to reach the deepest limits of the human transcriptome, exposing widespread, regulated and remarkably complex noncoding transcription in intergenic regions, as well as unannotated exons and splicing patterns in even intensively studied protein-coding loci such as p53 and HOX. The data also show that intermittent sequenced reads observed in conventional RNA sequencing data sets, previously dismissed as noise, are in fact indicative of unassembled rare transcripts. Collectively, these results reveal the range, depth and complexity of a human transcriptome that is far from fully characterized.


Subject(s)
DNA, Intergenic/genetics , Open Reading Frames/genetics , RNA, Untranslated/genetics , Transcriptome/genetics , Exons/genetics , Gene Expression , Humans , Molecular Sequence Annotation , Sequence Analysis, RNA/methods
19.
Genome Biol ; 12(9): R86, 2011 Sep 14.
Article in English | MEDLINE | ID: mdl-21917142

ABSTRACT

We report the development and optimization of reagents for in-solution, hybridization-based capture of the mouse exome. By validating this approach in a multiple inbred strains and in novel mutant strains, we show that whole exome sequencing is a robust approach for discovery of putative mutations, irrespective of strain background. We found strong candidate mutations for the majority of mutant exomes sequenced, including new models of orofacial clefting, urogenital dysmorphology, kyphosis and autoimmune hepatitis.


Subject(s)
DNA Mutational Analysis/methods , Exome , Genomics/methods , Mutation , Animals , Chromosome Mapping , Chromosomes, Mammalian/genetics , Collagen Type II/genetics , Exons , Gene Frequency , Genotype , INDEL Mutation , Indicators and Reagents/standards , MAP Kinase Kinase Kinases/genetics , Mice , Mice, Inbred Strains , Phenotype , Mitogen-Activated Protein Kinase Kinase Kinase 11
20.
Plant Physiol ; 156(1): 240-53, 2011 May.
Article in English | MEDLINE | ID: mdl-21321255

ABSTRACT

Mutagenized populations have become indispensable resources for introducing variation and studying gene function in plant genomics research. In this study, fast neutron (FN) radiation was used to induce deletion mutations in the soybean (Glycine max) genome. Approximately 120,000 soybean seeds were exposed to FN radiation doses of up to 32 Gray units to develop over 23,000 independent M2 lines. Here, we demonstrate the utility of this population for phenotypic screening and associated genomic characterization of striking and agronomically important traits. Plant variation was cataloged for seed composition, maturity, morphology, pigmentation, and nodulation traits. Mutants that showed significant increases or decreases in seed protein and oil content across multiple generations and environments were identified. The application of comparative genomic hybridization (CGH) to lesion-induced mutants for deletion mapping was validated on a midoleate x-ray mutant, M23, with a known FAD2-1A (for fatty acid desaturase) gene deletion. Using CGH, a subset of mutants was characterized, revealing deletion regions and candidate genes associated with phenotypes of interest. Exome resequencing and sequencing of PCR products confirmed FN-induced deletions detected by CGH. Beyond characterization of soybean FN mutants, this study demonstrates the utility of CGH, exome sequence capture, and next-generation sequencing approaches for analyses of mutant plant genomes. We present this FN mutant soybean population as a valuable public resource for future genetic screens and functional genomics research.


Subject(s)
Comparative Genomic Hybridization/methods , Genome, Plant/genetics , Genomics , Glycine max/genetics , Plant Proteins/genetics , Exome/genetics , Fast Neutrons , High-Throughput Nucleotide Sequencing , Seeds/genetics , Sequence Analysis, DNA , Sequence Deletion
SELECTION OF CITATIONS
SEARCH DETAIL
...