Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
Cancer Cell ; 41(4): 807-817.e6, 2023 04 10.
Article in English | MEDLINE | ID: mdl-37037617

ABSTRACT

Patients with short telomere syndromes (STS) are predisposed to developing cancer, believed to stem from chromosome instability in neoplastic cells. We tested this hypothesis in a large cohort assembled over the last 20 years. We found that the only solid cancers to which patients with STS are predisposed are squamous cell carcinomas of the head and neck, anus, or skin, a spectrum reminiscent of cancers seen in patients with immunodeficiency. Whole-genome sequencing showed no increase in chromosome instability, such as translocations or chromothripsis. Moreover, STS-associated cancers acquired telomere maintenance mechanisms, including telomerase reverse transcriptase (TERT) promoter mutations. A detailed study of the immune status of patients with STS revealed a striking T cell immunodeficiency at the time of cancer diagnosis. A similar immunodeficiency that impaired tumor surveillance was documented in mice with short telomeres. We conclude that STS patients' predisposition to solid cancers is due to T cell exhaustion rather than autonomous defects in the neoplastic cells themselves.


Subject(s)
Carcinoma, Squamous Cell , Telomerase , Animals , Mice , Telomere/genetics , Telomere/metabolism , Carcinoma, Squamous Cell/genetics , Chromosomal Instability , Mutation , Telomerase/genetics , Telomerase/metabolism , T-Lymphocytes/metabolism
2.
Nat Genet ; 54(9): 1376-1389, 2022 09.
Article in English | MEDLINE | ID: mdl-36050548

ABSTRACT

Acute lymphoblastic leukemia (ALL) is the most common childhood cancer. Here, using whole-genome, exome and transcriptome sequencing of 2,754 childhood patients with ALL, we find that, despite a generally low mutation burden, ALL cases harbor a median of four putative somatic driver alterations per sample, with 376 putative driver genes identified varying in prevalence across ALL subtypes. Most samples harbor at least one rare gene alteration, including 70 putative cancer driver genes associated with ubiquitination, SUMOylation, noncoding transcripts and other functions. In hyperdiploid B-ALL, chromosomal gains are acquired early and synchronously before ultraviolet-induced mutation. By contrast, ultraviolet-induced mutations precede chromosomal gains in B-ALL cases with intrachromosomal amplification of chromosome 21. We also demonstrate the prognostic significance of genetic alterations within subtypes. Intriguingly, DUX4- and KMT2A-rearranged subtypes separate into CEBPA/FLT3- or NFATC4-expressing subgroups with potential clinical implications. Together, these results deepen understanding of the ALL genomic landscape and associated outcomes.


Subject(s)
Precursor Cell Lymphoblastic Leukemia-Lymphoma , Child , Chromosome Aberrations , Exome/genetics , Genomics , Humans , Mutation , Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics
4.
Cancer Cell ; 39(1): 83-95.e4, 2021 01 11.
Article in English | MEDLINE | ID: mdl-33434514

ABSTRACT

GenomePaint (https://genomepaint.stjude.cloud/) is an interactive visualization platform for whole-genome, whole-exome, transcriptome, and epigenomic data of tumor samples. Its design captures the inter-relatedness between DNA variations and RNA expression, supporting in-depth exploration of both individual cancer genomes and full cohorts. Regulatory non-coding variants can be inspected and analyzed along with coding variants, and their functional impact further explored by examining 3D genome data from cancer cell lines. Further, GenomePaint correlates mutation and expression patterns with patient outcomes, and supports custom data upload. We used GenomePaint to unveil aberrant splicing that disrupts the RING domain of CREBBP, discover cis activation of the MYC oncogene by duplication of the NOTCH1-MYC enhancer in B-lineage acute lymphoblastic leukemia, and explore the inter- and intra-tumor heterogeneity at EGFR in adult glioblastomas. These examples demonstrate that deep multi-omics exploration of individual cancer genomes enabled by GenomePaint can lead to biological insights for follow-up validation.


Subject(s)
Computational Biology/methods , Gene Expression Profiling/methods , Genetic Variation , Neoplasms/genetics , Adult , Cell Line, Tumor , Child , Databases, Genetic , Epigenesis, Genetic , Female , Gene Expression Regulation, Neoplastic , Humans , Male , User-Computer Interface , Exome Sequencing , Whole Genome Sequencing
5.
Nucleic Acids Res ; 48(3): 1146-1163, 2020 02 20.
Article in English | MEDLINE | ID: mdl-31853540

ABSTRACT

Long Interspersed Element-1 (LINE-1) retrotransposition contributes to inter- and intra-individual genetic variation and occasionally can lead to human genetic disorders. Various strategies have been developed to identify human-specific LINE-1 (L1Hs) insertions from short-read whole genome sequencing (WGS) data; however, they have limitations in detecting insertions in complex repetitive genomic regions. Here, we developed a computational tool (PALMER) and used it to identify 203 non-reference L1Hs insertions in the NA12878 benchmark genome. Using PacBio long-read sequencing data, we identified L1Hs insertions that were absent in previous short-read studies (90/203). Approximately 81% (73/90) of the L1Hs insertions reside within endogenous LINE-1 sequences in the reference assembly and the analysis of unique breakpoint junction sequences revealed 63% (57/90) of these L1Hs insertions could be genotyped in 1000 Genomes Project sequences. Moreover, we observed that amplification biases encountered in single-cell WGS experiments led to a wide variation in L1Hs insertion detection rates between four individual NA12878 cells; under-amplification limited detection to 32% (65/203) of insertions, whereas over-amplification increased false positive calls. In sum, these data indicate that L1Hs insertions are often missed using standard short-read sequencing approaches and long-read sequencing approaches can significantly improve the detection of L1Hs insertions present in individual genomes.


Subject(s)
Long Interspersed Nucleotide Elements , Sequence Analysis, DNA/methods , Cell Line , Genome, Human , Humans , Polymorphism, Genetic , Single-Cell Analysis , Software , Whole Genome Sequencing
6.
Blood ; 135(1): 41-55, 2020 01 02.
Article in English | MEDLINE | ID: mdl-31697823

ABSTRACT

To study the mechanisms of relapse in acute lymphoblastic leukemia (ALL), we performed whole-genome sequencing of 103 diagnosis-relapse-germline trios and ultra-deep sequencing of 208 serial samples in 16 patients. Relapse-specific somatic alterations were enriched in 12 genes (NR3C1, NR3C2, TP53, NT5C2, FPGS, CREBBP, MSH2, MSH6, PMS2, WHSC1, PRPS1, and PRPS2) involved in drug response. Their prevalence was 17% in very early relapse (<9 months from diagnosis), 65% in early relapse (9-36 months), and 32% in late relapse (>36 months) groups. Convergent evolution, in which multiple subclones harbor mutations in the same drug resistance gene, was observed in 6 relapses and confirmed by single-cell sequencing in 1 case. Mathematical modeling and mutational signature analysis indicated that early relapse resistance acquisition was frequently a 2-step process in which a persistent clone survived initial therapy and later acquired bona fide resistance mutations during therapy. In contrast, very early relapses arose from preexisting resistant clone(s). Two novel relapse-specific mutational signatures, one of which was caused by thiopurine treatment based on in vitro drug exposure experiments, were identified in early and late relapses but were absent from 2540 pan-cancer diagnosis samples and 129 non-ALL relapses. The novel signatures were detected in 27% of relapsed ALLs and were responsible for 46% of acquired resistance mutations in NT5C2, PRPS1, NR3C1, and TP53. These results suggest that chemotherapy-induced drug resistance mutations facilitate a subset of pediatric ALL relapses.


Subject(s)
Biomarkers, Tumor/genetics , Methotrexate/therapeutic use , Mutagenesis/drug effects , Mutation , Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics , Precursor Cell Lymphoblastic Leukemia-Lymphoma/pathology , 5'-Nucleotidase/genetics , Antimetabolites, Antineoplastic/therapeutic use , Child , DNA Mutational Analysis , Female , Follow-Up Studies , Genomics , High-Throughput Nucleotide Sequencing , Humans , Male , Precursor Cell Lymphoblastic Leukemia-Lymphoma/drug therapy , Prognosis , Receptors, Glucocorticoid/genetics , Survival Rate , Tumor Suppressor Protein p53/genetics
7.
Nature ; 572(7767): 74-79, 2019 08.
Article in English | MEDLINE | ID: mdl-31341285

ABSTRACT

Medulloblastoma is a malignant childhood cerebellar tumour type that comprises distinct molecular subgroups. Whereas genomic characteristics of these subgroups are well defined, the extent to which cellular diversity underlies their divergent biology and clinical behaviour remains largely unexplored. Here we used single-cell transcriptomics to investigate intra- and intertumoral heterogeneity in 25 medulloblastomas spanning all molecular subgroups. WNT, SHH and Group 3 tumours comprised subgroup-specific undifferentiated and differentiated neuronal-like malignant populations, whereas Group 4 tumours consisted exclusively of differentiated neuronal-like neoplastic cells. SHH tumours closely resembled granule neurons of varying differentiation states that correlated with patient age. Group 3 and Group 4 tumours exhibited a developmental trajectory from primitive progenitor-like to more mature neuronal-like cells, the relative proportions of which distinguished these subgroups. Cross-species transcriptomics defined distinct glutamatergic populations as putative cells-of-origin for SHH and Group 4 subtypes. Collectively, these data provide insights into the cellular and developmental states underlying subtype-specific medulloblastoma biology.


Subject(s)
Genomics , Medulloblastoma/genetics , Medulloblastoma/pathology , Single-Cell Analysis , Transcriptome , Adolescent , Adult , Animals , Cell Lineage , Cerebellum/metabolism , Cerebellum/pathology , Child , Child, Preschool , DNA Copy Number Variations , Gene Expression Regulation, Neoplastic , Glutamic Acid/metabolism , Humans , Infant , Medulloblastoma/classification , Mice , Neurons/metabolism , Neurons/pathology
8.
Cell ; 177(4): 837-851.e28, 2019 05 02.
Article in English | MEDLINE | ID: mdl-30955886

ABSTRACT

L1 retrotransposon-derived sequences comprise approximately 17% of the human genome. Darwinian selective pressures alter L1 genomic distributions during evolution, confounding the ability to determine initial L1 integration preferences. Here, we generated high-confidence datasets of greater than 88,000 engineered L1 insertions in human cell lines that act as proxies for cells that accommodate retrotransposition in vivo. Comparing these insertions to a null model, in which L1 endonuclease activity is the sole determinant dictating L1 integration preferences, demonstrated that L1 insertions are not significantly enriched in genes, transcribed regions, or open chromatin. By comparison, we provide compelling evidence that the L1 endonuclease disproportionately cleaves predominant lagging strand DNA replication templates, while lagging strand 3'-hydroxyl groups may prime endonuclease-independent L1 retrotransposition in a Fanconi anemia cell line. Thus, acquisition of an endonuclease domain, in conjunction with the ability to integrate into replicating DNA, allowed L1 to become an autonomous, interspersed retrotransposon.


Subject(s)
Long Interspersed Nucleotide Elements/genetics , Retroelements/genetics , Cell Line , Endonucleases/genetics , Endonucleases/metabolism , Genome, Human/genetics , Genome-Wide Association Study/methods , Genomics , HeLa Cells , Humans , Mutagenesis, Insertional/genetics
9.
Genome Biol ; 20(1): 50, 2019 03 14.
Article in English | MEDLINE | ID: mdl-30867008

ABSTRACT

BACKGROUND: Sequencing errors are key confounding factors for detecting low-frequency genetic variants that are important for cancer molecular diagnosis, treatment, and surveillance using deep next-generation sequencing (NGS). However, there is a lack of comprehensive understanding of errors introduced at various steps of a conventional NGS workflow, such as sample handling, library preparation, PCR enrichment, and sequencing. In this study, we use current NGS technology to systematically investigate these questions. RESULTS: By evaluating read-specific error distributions, we discover that the substitution error rate can be computationally suppressed to 10-5 to 10-4, which is 10- to 100-fold lower than generally considered achievable (10-3) in the current literature. We then quantify substitution errors attributable to sample handling, library preparation, enrichment PCR, and sequencing by using multiple deep sequencing datasets. We find that error rates differ by nucleotide substitution types, ranging from 10-5 for A>C/T>G, C>A/G>T, and C>G/G>C changes to 10-4 for A>G/T>C changes. Furthermore, C>T/G>A errors exhibit strong sequence context dependency, sample-specific effects dominate elevated C>A/G>T errors, and target-enrichment PCR led to ~ 6-fold increase of overall error rate. We also find that more than 70% of hotspot variants can be detected at 0.1 ~ 0.01% frequency with the current NGS technology by applying in silico error suppression. CONCLUSIONS: We present the first comprehensive analysis of sequencing error sources in conventional NGS workflows. The error profiles revealed by our study highlight new directions for further improving NGS analysis accuracy both experimentally and computationally, ultimately enhancing the precision of deep sequencing.


Subject(s)
High-Throughput Nucleotide Sequencing/standards , Neoplasms/genetics , Polymerase Chain Reaction/standards , Sequence Analysis, DNA/standards , Software , Case-Control Studies , Humans , Mutation , Quality Control
10.
Science ; 356(6336)2017 04 28.
Article in English | MEDLINE | ID: mdl-28450582

ABSTRACT

Neuropsychiatric disorders have a complex genetic architecture. Human genetic population-based studies have identified numerous heritable sequence and structural genomic variants associated with susceptibility to neuropsychiatric disease. However, these germline variants do not fully account for disease risk. During brain development, progenitor cells undergo billions of cell divisions to generate the ~80 billion neurons in the brain. The failure to accurately repair DNA damage arising during replication, transcription, and cellular metabolism amid this dramatic cellular expansion can lead to somatic mutations. Somatic mutations that alter subsets of neuronal transcriptomes and proteomes can, in turn, affect cell proliferation and survival and lead to neurodevelopmental disorders. The long life span of individual neurons and the direct relationship between neural circuits and behavior suggest that somatic mutations in small populations of neurons can significantly affect individual neurodevelopment. The Brain Somatic Mosaicism Network has been founded to study somatic mosaicism both in neurotypical human brains and in the context of complex neuropsychiatric disorders.


Subject(s)
Brain/abnormalities , Mental Disorders/genetics , Mosaicism , Nervous System Diseases/genetics , Neural Stem Cells/physiology , Neurons/physiology , Brain/metabolism , Cell Division/genetics , DNA Damage , DNA Mutational Analysis/methods , DNA Repair/genetics , DNA Replication , Genome, Human , Germ Cells/metabolism , Humans , Nerve Net/growth & development , Nerve Net/metabolism , Neural Stem Cells/metabolism , Neurons/metabolism
11.
Methods Mol Biol ; 1400: 339-55, 2016.
Article in English | MEDLINE | ID: mdl-26895063

ABSTRACT

Long INterspersed Element-1 (LINE-1 or L1) retrotransposons encode two proteins (ORF1p and ORF2p) that are required for retrotransposition. The L1 element amplification protocol (LEAP) assays the ability of L1 ORF2p to reverse transcribe L1 RNA in vitro. Ultracentrifugation or immunoprecipitation is used to isolate L1 ribonucleoprotein particle (RNP) complexes from cultured human cells transfected with an engineered L1 expression construct. The isolated RNPs are incubated with an oligonucleotide that contains a unique sequence at its 5' end and a thymidine-rich sequence at its 3' end. The addition of dNTPs to the reaction allows L1 ORF2p bound to L1 RNA to generate L1 cDNA. The resultant L1 cDNAs then are amplified using polymerase chain reaction (PCR) and the products are visualized by gel electrophoresis. Sequencing the resultant PCR products then allows product verification. The LEAP assay has been instrumental in determining how mutations in L1 ORF1p and ORF2p affect L1 reverse transcriptase (RT) activity. Furthermore, the LEAP assay has revealed that the L1 ORF2p RT can extend a DNA primer with mismatched 3' terminal bases when it is annealed to an L1 RNA template. As the LINE-1 biology field gravitates toward studying cellular proteins that regulate LINE-1, molecular genetic and biochemical approaches such as LEAP, in conjunction with the LINE-1-cultured cell retrotransposition assay, are essential to dissect the molecular mechanism of L1 retrotransposition.


Subject(s)
Long Interspersed Nucleotide Elements , Polymerase Chain Reaction , HeLa Cells , Humans , Immunoprecipitation/methods , Open Reading Frames , Polymerase Chain Reaction/methods , Ribonucleoproteins/genetics , Ribonucleoproteins/metabolism , Ribonucleoproteins/poisoning , Ultracentrifugation/methods
12.
Mob DNA ; 5: 26, 2014.
Article in English | MEDLINE | ID: mdl-30117500

ABSTRACT

The Mobile Genetic Elements and Genome Evolution conference was hosted by Keystone Symposia in Santa Fe, NM USA, 9 March through 14 March 2014. The goal of this conference was to bring together scientists from around the world who study transposable elements in diverse organisms and researchers who study the impact these elements have on genome evolution. The meeting included over 200 scientists who participated through poster presentations, short talks selected from abstracts, and invited speakers. The talks were organized into eight sessions and two workshops. The topics varied from diverse mechanisms of mobilization to the evolution of genomes and their defense strategies against transposable elements.

13.
J Bacteriol ; 194(22): 6056-65, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22961847

ABSTRACT

Nitrosomonas europaea is a chemolithoautotroph that obtains energy by oxidizing ammonia in the presence of oxygen and fixes CO(2) via the Benson-Calvin cycle. Despite its environmental and evolutionary importance, very little is known about the regulation and metabolism of glycogen, a source of carbon and energy storage. Here, we cloned and heterologously expressed the genes coding for two major putative enzymes of the glycogen synthetic pathway in N. europaea, ADP-glucose pyrophosphorylase and glycogen synthase. In other bacteria, ADP-glucose pyrophosphorylase catalyzes the regulatory step of the synthetic pathway and glycogen synthase elongates the polymer. In starch synthesis in plants, homologous enzymes play similar roles. We purified to homogeneity the recombinant ADP-glucose pyrophosphorylase from N. europaea and characterized its kinetic, regulatory, and oligomeric properties. The enzyme was allosterically activated by pyruvate, oxaloacetate, and phosphoenolpyruvate and inhibited by AMP. It had a broad thermal and pH stability and used different divalent metal ions as cofactors. Depending on the cofactor, the enzyme was able to accept different nucleotides and sugar phosphates as alternative substrates. However, characterization of the recombinant glycogen synthase showed that only ADP-Glc elongates the polysaccharide, indicating that ATP and glucose-1-phosphate are the physiological substrates of the ADP-glucose pyrophosphorylase. The distinctive properties with respect to selectivity for substrates and activators of the ADP-glucose pyrophosphorylase were in good agreement with the metabolic routes operating in N. europaea, indicating an evolutionary adaptation. These unique properties place the enzyme in a category of its own within the family, highlighting the unique regulation in these organisms.


Subject(s)
Gene Expression Regulation, Bacterial/physiology , Glucose-1-Phosphate Adenylyltransferase/metabolism , Glycogen/metabolism , Nitrosomonas europaea/enzymology , Nitrosomonas europaea/metabolism , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Chemoautotrophic Growth , Cloning, Molecular , Enzyme Stability , Gene Expression Regulation, Enzymologic , Glucose-1-Phosphate Adenylyltransferase/genetics , Hydrogen-Ion Concentration , Kinetics , Metals/metabolism , Temperature
14.
In Silico Biol ; 8(5-6): 531-43, 2008.
Article in English | MEDLINE | ID: mdl-19374136

ABSTRACT

SIRE1 is a 2000-copy member of the Ty1/copia retroelement family found in the soybean genome and is closely related to sireviruses found in the genomes of other legumes. Although these elements closely resemble typical plant members of the Ty1/copia family, they are unusual in that they possess an envelope-like coding region immediately downstream of the reverse transcriptase gene. Despite its copy number, very few members of the SIRE1 family are currently present in publicly available genomic assemblies or draft contigs. However, fragments of family members are well-represented as BAC-ends in the GenBank Genome Survey Sequence database. This database was queried using the 5' and 3' ends of SIRE1 in order to catalog sequences into which SIRE1 members have integrated. Seven hundred and eighty-one unique SIRE1 insertions were identified and the majority of insertion sites constituted other repetitive elements, including Class I and Class II transposable elements and satellite DNAs. Ninety-four insertions were in single- or low-copy number sequences and three of these were homologous to characterized protein-coding genes. Examination of the ten bases flanking either side of SIRE1 revealed no clear consensus sequence, but the the distributions of A, C, G, and T at most of the positions were biased with strong statistical significance.


Subject(s)
DNA, Intergenic/genetics , Databases, Nucleic Acid , Retroelements/genetics , Sequence Analysis, DNA/methods , Glycine/genetics , Glycine/metabolism , Multigene Family/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...