Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
Cancer Res Commun ; 2024 May 09.
Article in English | MEDLINE | ID: mdl-38722600

ABSTRACT

Immune-checkpoint therapy (ICB) has conferred significant and durable clinical benefit to some cancer patients. However, most patients do not respond to ICB, and reliable biomarkers of ICB response are needed to improve patient stratification. Here, we performed a transcriptome-wide meta-analysis across 1,486 tumors from ICB-treated patients and tumors with expected ICB outcomes based on microsatellite status. Using a robust transcriptome deconvolution approach, we inferred cancer and stroma-specific gene expression differences and identified cell-type specific features of ICB response across cancer types. Consistent with current knowledge, stromal expression of CXCL9, CXCL13, and IFNG were the top determinants of favorable ICB response. In addition, we identified a group of potential immune-suppressive genes, including FCER1A, associated with poor response to ICB. Strikingly, PD-L1 expression in stromal cells, but not cancer cells, is correlated with ICB response across cancer types. Furthermore, the unbiased transcriptome-wide analysis failed to identify cancer-cell intrinsic expression signatures of ICB response conserved across tumor types, suggesting that cancer cells lack tissue-agnostic transcriptomic features of ICB response.

2.
Genome Res ; 2024 May 22.
Article in English | MEDLINE | ID: mdl-38777607

ABSTRACT

Gastric cancer (GC) is the fifth most common cancer worldwide and is a heterogeneous disease. Among GC subtypes, the mesenchymal phenotype (Mes-like) is more invasive than the epithelial phenotype (Epi-like). While gene expression of the epithelial-to-mesenchymal transition (EMT) has been studied, the regulatory landscape shaping this process is not fully understood. Here we use ATAC-seq and RNA-seq from a compendium of gastric cancer cell lines and primary tumors to detect drivers of regulatory state changes and their transcriptional responses. Using the ATAC-seq, we developed a machine learning approach to determine the transcription factors (TFs) regulating the subtypes of GC. We identified TFs driving the mesenchymal (RUNX2, ZEB1, SNAI2, AP-1 dimer) as well as the epithelial states (GATA4, GATA6, KLF5, HNF4A, FOXA2, GRHL2) in gastric cancer. We identified DNA copy number alterations associated with dysregulation of these TFs, specifically deletion of GATA4 and amplification of MAPK9 Comparisons with bulk and single-cell RNA-seq datasets identified activation toward fibroblast-like epigenomic and expression signatures in Mes-like GC. The activation of this mesenchymal fibrotic program is associated with differentially accessible DNA cis-regulatory elements flanking up-regulated mesenchymal genes. These findings establish a map of TF activity in GC and highlight the role of copy number driven alterations in shaping epigenomic regulatory programs as potential drivers of gastric cancer heterogeneity and progression.

3.
Gut ; 72(9): 1651-1663, 2023 09.
Article in English | MEDLINE | ID: mdl-36918265

ABSTRACT

OBJECTIVE: Gastric cancer (GC) is a leading cause of cancer mortality, with ARID1A being the second most frequently mutated driver gene in GC. We sought to decipher ARID1A-specific GC regulatory networks and examine therapeutic vulnerabilities arising from ARID1A loss. DESIGN: Genomic profiling of GC patients including a Singapore cohort (>200 patients) was performed to derive mutational signatures of ARID1A inactivation across molecular subtypes. Single-cell transcriptomic profiles of ARID1A-mutated GCs were analysed to examine tumour microenvironmental changes arising from ARID1A loss. Genome-wide ARID1A binding and chromatin profiles (H3K27ac, H3K4me3, H3K4me1, ATAC-seq) were generated to identify gastric-specific epigenetic landscapes regulated by ARID1A. Distinct cancer hallmarks of ARID1A-mutated GCs were converged at the genomic, single-cell and epigenomic level, and targeted by pharmacological inhibition. RESULTS: We observed prevalent ARID1A inactivation across GC molecular subtypes, with distinct mutational signatures and linked to a NFKB-driven proinflammatory tumour microenvironment. ARID1A-depletion caused loss of H3K27ac activation signals at ARID1A-occupied distal enhancers, but unexpectedly gain of H3K27ac at ARID1A-occupied promoters in genes such as NFKB1 and NFKB2. Promoter activation in ARID1A-mutated GCs was associated with enhanced gene expression, increased BRD4 binding, and reduced HDAC1 and CTCF occupancy. Combined targeting of promoter activation and tumour inflammation via bromodomain and NFKB inhibitors confirmed therapeutic synergy specific to ARID1A-genomic status. CONCLUSION: Our results suggest a therapeutic strategy for ARID1A-mutated GCs targeting both tumour-intrinsic (BRD4-assocatiated promoter activation) and extrinsic (NFKB immunomodulation) cancer phenotypes.


Subject(s)
Stomach Neoplasms , Transcription Factors , Humans , Transcription Factors/genetics , Transcription Factors/metabolism , Stomach Neoplasms/genetics , Stomach Neoplasms/therapy , Stomach Neoplasms/pathology , Nuclear Proteins/genetics , Epigenomics , Mutation , Tumor Microenvironment/genetics , DNA-Binding Proteins/genetics , Cell Cycle Proteins/genetics
4.
Gut ; 72(2): 226-241, 2023 02.
Article in English | MEDLINE | ID: mdl-35817555

ABSTRACT

OBJECTIVE: Gastric cancer (GC) comprises multiple molecular subtypes. Recent studies have highlighted mesenchymal-subtype GC (Mes-GC) as a clinically aggressive subtype with few treatment options. Combining multiple studies, we derived and applied a consensus Mes-GC classifier to define the Mes-GC enhancer landscape revealing disease vulnerabilities. DESIGN: Transcriptomic profiles of ~1000 primary GCs and cell lines were analysed to derive a consensus Mes-GC classifier. Clinical and genomic associations were performed across >1200 patients with GC. Genome-wide epigenomic profiles (H3K27ac, H3K4me1 and assay for transposase-accessible chromatin with sequencing (ATAC-seq)) of 49 primary GCs and GC cell lines were generated to identify Mes-GC-specific enhancer landscapes. Upstream regulators and downstream targets of Mes-GC enhancers were interrogated using chromatin immunoprecipitation followed by sequencing (ChIP-seq), RNA sequencing, CRISPR/Cas9 editing, functional assays and pharmacological inhibition. RESULTS: We identified and validated a 993-gene cancer-cell intrinsic Mes-GC classifier applicable to retrospective cohorts or prospective single samples. Multicohort analysis of Mes-GCs confirmed associations with poor patient survival, therapy resistance and few targetable genomic alterations. Analysis of enhancer profiles revealed a distinctive Mes-GC epigenomic landscape, with TEAD1 as a master regulator of Mes-GC enhancers and Mes-GCs exhibiting preferential sensitivity to TEAD1 pharmacological inhibition. Analysis of Mes-GC super-enhancers also highlighted NUAK1 kinase as a downstream target, with synergistic effects observed between NUAK1 inhibition and cisplatin treatment. CONCLUSION: Our results establish a consensus Mes-GC classifier applicable to multiple transcriptomic scenarios. Mes-GCs exhibit a distinct epigenomic landscape, and TEAD1 inhibition and combinatorial NUAK1 inhibition/cisplatin may represent potential targetable options.


Subject(s)
Enhancer Elements, Genetic , Epigenesis, Genetic , Gene Expression Regulation, Neoplastic , Stomach Neoplasms , Humans , Cisplatin/metabolism , Cisplatin/therapeutic use , Prospective Studies , Protein Kinases/genetics , Repressor Proteins , Retrospective Studies , Stomach Neoplasms/genetics
5.
JTO Clin Res Rep ; 3(12): 100416, 2022 Dec.
Article in English | MEDLINE | ID: mdl-36426287

ABSTRACT

Introduction: Although immune checkpoint inhibitors (ICIs) have dramatically improved outcomes for nononcogene-addicted NSCLC, monotherapy with programmed cell death protein-1 (PD1) inhibition has been associated with low efficacy in the EGFR-mutant setting. Given the potential for synergism with combination checkpoint blockade, we designed a trial to test the activity of combination nivolumab (N)-ipilimumab (NI) in EGFR-mutant NSCLC. Methods: This is a randomized phase 2 study (NCT03091491) of N versus NI combination in EGFR tyrosine kinase inhibitor (TKI)-resistant NSCLC, with crossover permitted on disease progression. The primary end point was the objective response rate, and the secondary end points included progression-free survival, overall survival, and safety of ICI after EGFR TKI. Results: Recruitment ceased owing to futility after 31 of 184 planned patients were treated. A total of 15 patients received N and 16 received NI combination. There were 16 patients (51.6%) who had programmed death-ligand (PDL1) 1 greater than or equal to 1%, and 15 (45.2%) harbored EGFR T790M. Five patients derived clinical benefits from ICI with one objective response (objective response rate 3.2%), and median progression-free survival was 1.22 months (95% confidence interval: 1.15-1.35) for the overall cohort. None of the four patients who crossed over achieved salvage response by NI. PDL1 and tumor mutational burden (TMB) were not able to predict ICI response. Rates of all grade immune-related adverse events were similar (80% versus 75%), with only two grade 3 events. Conclusions: Immune checkpoint inhibition is ineffective in EGFR TKI-resistant NSCLC. Whereas a small subgroup of EGFR-mutant NSCLC may be immunogenic and responsive to ICI, better biomarkers are needed to select appropriate patients.

6.
Nat Genet ; 54(7): 963-975, 2022 07.
Article in English | MEDLINE | ID: mdl-35773407

ABSTRACT

The consensus molecular subtype (CMS) classification of colorectal cancer is based on bulk transcriptomics. The underlying epithelial cell diversity remains unclear. We analyzed 373,058 single-cell transcriptomes from 63 patients, focusing on 49,155 epithelial cells. We identified a pervasive genetic and transcriptomic dichotomy of malignant cells, based on distinct gene expression, DNA copy number and gene regulatory network. We recapitulated these subtypes in bulk transcriptomes from 3,614 patients. The two intrinsic subtypes, iCMS2 and iCMS3, refine CMS. iCMS3 comprises microsatellite unstable (MSI-H) cancers and one-third of microsatellite-stable (MSS) tumors. iCMS3 MSS cancers are transcriptomically more similar to MSI-H cancers than to other MSS cancers. CMS4 cancers had either iCMS2 or iCMS3 epithelium; the latter had the worst prognosis. We defined the intrinsic epithelial axis of colorectal cancer and propose a refined 'IMF' classification with five subtypes, combining intrinsic epithelial subtype (I), microsatellite instability status (M) and fibrosis (F).


Subject(s)
Colorectal Neoplasms , Neoplasms, Glandular and Epithelial , Colorectal Neoplasms/genetics , Colorectal Neoplasms/pathology , Epithelial Cells/pathology , Humans , Microsatellite Instability , Microsatellite Repeats/genetics , Neoplasms, Glandular and Epithelial/genetics , Transcriptome/genetics
7.
Int J Cancer ; 151(3): 435-449, 2022 08 01.
Article in English | MEDLINE | ID: mdl-35415893

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal and debilitating disease with limited therapeutic options. The aim of this clinical study was to evaluate the safety, efficacy and pharmacokinetics of a novel regimen comprised of metronomic oxaliplatin (O), chronomodulated capecitabine (X) and UGT1A1 genotype-guided dosing of irinotecan (IRI) [OXIRI] as well as its immunomodulatory effects. Thirty-six patients were enrolled into either dose-escalation or expansion cohorts. In the dose escalation phase, capecitabine doses (2000, 2650, 3500 and 4500 mg/day) were administered at midnight on days 1 to 14 while oxaliplatin and irinotecan were intravenously infused at fixed doses of 50 and 75 mg/m2 respectively on days 1, 8 in a 21-day cycle. The maximum tolerated dose of capecitabine was 2650 mg/day and the most common grade 3 adverse events were neutropenia (30.6%) and diarrhea (13.9%). No grade 4 toxicity was observed. UGT1A1-genotype directed dosing resulted in similar exposure levels of irinotecan, SN-38 and SN-38G in all patients. Objective response rate was 22.2%. Median overall survival and progression-free survival were 8.1 and 5.2 months, respectively. Exploratory immunoprofiling by flow cytometry and quantitative spatial localization analysis of infiltrated immune cells performed on biopsy and plasma samples revealed significant declines in CCL22, CCL2 and TNFα levels at end of first cycle and an active host immune response. Our study showed that OXIRI was well-tolerated and exhibited good efficacy, with immunomodulatory effects. It may be considered as an alternative to FOLFIRINOX in patients intolerant to the latter.


Subject(s)
Adenocarcinoma , Pancreatic Neoplasms , Adenocarcinoma/pathology , Antineoplastic Combined Chemotherapy Protocols/adverse effects , Camptothecin , Capecitabine/therapeutic use , Fluorouracil/therapeutic use , Humans , Immunity , Irinotecan , Oxaliplatin , Pancreatic Neoplasms/pathology , Treatment Outcome , Pancreatic Neoplasms
8.
Gut ; 71(7): 1277-1288, 2022 07.
Article in English | MEDLINE | ID: mdl-34433583

ABSTRACT

OBJECTIVES: Epigenomic alterations in cancer interact with the immune microenvironment to dictate tumour evolution and therapeutic response. We aimed to study the regulation of the tumour immune microenvironment through epigenetic alternate promoter use in gastric cancer and to expand our findings to other gastrointestinal tumours. DESIGN: Alternate promoter burden (APB) was quantified using a novel bioinformatic algorithm (proActiv) to infer promoter activity from short-read RNA sequencing and samples categorised into APBhigh, APBint and APBlow. Single-cell RNA sequencing was performed to analyse the intratumour immune microenvironment. A humanised mouse cancer in vivo model was used to explore dynamic temporal interactions between tumour kinetics, alternate promoter usage and the human immune system. Multiple cohorts of gastrointestinal tumours treated with immunotherapy were assessed for correlation between APB and treatment outcomes. RESULTS: APBhigh gastric cancer tumours expressed decreased levels of T-cell cytolytic activity and exhibited signatures of immune depletion. Single-cell RNAsequencing analysis confirmed distinct immunological populations and lower T-cell proportions in APBhigh tumours. Functional in vivo studies using 'humanised mice' harbouring an active human immune system revealed distinct temporal relationships between APB and tumour growth, with APBhigh tumours having almost no human T-cell infiltration. Analysis of immunotherapy-treated patients with GI cancer confirmed resistance of APBhigh tumours to immune checkpoint inhibition. APBhigh gastric cancer exhibited significantly poorer progression-free survival compared with APBlow (median 55 days vs 121 days, HR 0.40, 95% CI 0.18 to 0.93, p=0.032). CONCLUSION: These findings demonstrate an association between alternate promoter use and the tumour microenvironment, leading to immune evasion and immunotherapy resistance.


Subject(s)
Gastrointestinal Neoplasms , Stomach Neoplasms , Animals , Epigenesis, Genetic , Epigenomics , Gastrointestinal Neoplasms/genetics , Gastrointestinal Neoplasms/therapy , Humans , Immune Checkpoint Inhibitors/pharmacology , Immune Checkpoint Inhibitors/therapeutic use , Immunotherapy , Mice , Stomach Neoplasms/drug therapy , Stomach Neoplasms/therapy , Tumor Microenvironment
9.
Genome Med ; 13(1): 158, 2021 10 11.
Article in English | MEDLINE | ID: mdl-34635154

ABSTRACT

BACKGROUND: Enhancers are distal cis-regulatory elements required for cell-specific gene expression and cell fate determination. In cancer, enhancer variation has been proposed as a major cause of inter-patient heterogeneity-however, most predicted enhancer regions remain to be functionally tested. METHODS: We analyzed 132 epigenomic histone modification profiles of 18 primary gastric cancer (GC) samples, 18 normal gastric tissues, and 28 GC cell lines using Nano-ChIP-seq technology. We applied Capture-based Self-Transcribing Active Regulatory Region sequencing (CapSTARR-seq) to assess functional enhancer activity. An Activity-by-contact (ABC) model was employed to explore the effects of histone acetylation and CapSTARR-seq levels on enhancer-promoter interactions. RESULTS: We report a comprehensive catalog of 75,730 recurrent predicted enhancers, the majority of which are GC-associated in vivo (> 50,000) and associated with lower somatic mutation rates inferred by whole-genome sequencing. Applying CapSTARR-seq to the enhancer catalog, we observed significant correlations between CapSTARR-seq functional activity and H3K27ac/H3K4me1 levels. Super-enhancer regions exhibited increased CapSTARR-seq signals compared to regular enhancers, even when decoupled from native chromatin contexture. We show that combining histone modification and CapSTARR-seq functional enhancer data improves the prediction of enhancer-promoter interactions and pinpointing of germline single nucleotide polymorphisms (SNPs), somatic copy number alterations (SCNAs), and trans-acting TFs involved in GC expression. We identified cancer-relevant genes (ING1, ARL4C) whose expression between patients is influenced by enhancer differences in genomic copy number and germline SNPs, and HNF4α as a master trans-acting factor associated with GC enhancer heterogeneity. CONCLUSIONS: Our results indicate that combining histone modification and functional assay data may provide a more accurate metric to assess enhancer activity than either platform individually, providing insights into the relative contribution of genetic (cis) and regulatory (trans) mechanisms to GC enhancer functional heterogeneity.


Subject(s)
Enhancer Elements, Genetic , Epigenomics , Stomach Neoplasms/genetics , ADP-Ribosylation Factors/genetics , ADP-Ribosylation Factors/metabolism , Acetylation , Cell Line, Tumor , Cell Proliferation , Chromatin , Gene Expression Regulation, Neoplastic , Genomics , Histones/metabolism , Humans , Inhibitor of Growth Protein 1/genetics , Inhibitor of Growth Protein 1/metabolism , Oncogenes , Promoter Regions, Genetic , RNA-Seq , Transcriptome , Whole Genome Sequencing
10.
Cancer Res ; 81(7): 1802-1812, 2021 04 01.
Article in English | MEDLINE | ID: mdl-33547160

ABSTRACT

Signaling between cancer and nonmalignant (stromal) cells in the tumor microenvironment (TME) is a key to tumor progression. Here, we deconvoluted bulk tumor transcriptomes to infer cross-talk between ligands and receptors on cancer and stromal cells in the TME of 20 solid tumor types. This approach recovered known transcriptional hallmarks of cancer and stromal cells and was concordant with single-cell, in situ hybridization and IHC data. Inferred autocrine cancer cell interactions varied between tissues but often converged on Ephrin, BMP, and FGFR-signaling pathways. Analysis of immune checkpoints nominated interactions with high levels of cancer-to-immune cross-talk across distinct tumor types. Strikingly, PD-L1 was found to be highly expressed in stromal rather than cancer cells. Overall, our study presents a new resource for hypothesis generation and exploration of cross-talk in the TME. SIGNIFICANCE: This study provides deconvoluted bulk tumor transcriptomes across multiple cancer types to infer cross-talk in the tumor microenvironment.


Subject(s)
Neoplasms , Receptor Cross-Talk/physiology , Tumor Microenvironment , Autocrine Communication/physiology , Cell Communication/genetics , Computational Biology , Datasets as Topic , Female , Genomics/methods , Humans , Ligands , Male , Neoplasms/genetics , Neoplasms/metabolism , Neoplasms/pathology , Receptors, Cytoplasmic and Nuclear/physiology , Tumor Microenvironment/genetics , Exome Sequencing
11.
NPJ Genom Med ; 5: 26, 2020.
Article in English | MEDLINE | ID: mdl-32550006

ABSTRACT

Recurrence and clustering of somatic mutations (hotspots) in cancer genomes may indicate positive selection and involvement in tumorigenesis. MutSpot performs genome-wide inference of mutation hotspots in non-coding and regulatory DNA of cancer genomes. MutSpot performs feature selection across hundreds of epigenetic and sequence features followed by estimation of position- and patient-specific background somatic mutation probabilities. MutSpot is user-friendly, works on a standard workstation, and scales to thousands of cancer genomes.

12.
J Clin Invest ; 130(6): 3005-3020, 2020 06 01.
Article in English | MEDLINE | ID: mdl-32364535

ABSTRACT

Transcriptional reactivation of telomerase catalytic subunit (TERT) is a frequent hallmark of cancer, occurring in 90% of human malignancies. However, specific mechanisms driving TERT reactivation remain obscure for many tumor types and in particular gastric cancer (GC), a leading cause of global cancer mortality. Here, through comprehensive genomic and epigenomic analysis of primary GCs and GC cell lines, we identified the transcription factor early B cell factor 1 (EBF1) as a TERT transcriptional repressor and inactivation of EBF1 function as a major cause of TERT upregulation. Abolishment of EBF1 function occurs through 3 distinct (epi)genomic mechanisms. First, EBF1 is epigenetically silenced via DNA methyltransferase, polycomb-repressive complex 2 (PRC2), and histone deacetylase activity in GCs. Second, recurrent, somatic, and heterozygous EBF1 DNA-binding domain mutations result in the production of dominant-negative EBF1 isoforms. Third, more rarely, genomic deletions and rearrangements proximal to the TERT promoter remobilize or abolish EBF1-binding sites, derepressing TERT and leading to high TERT expression. EBF1 is also functionally required for various malignant phenotypes in vitro and in vivo, highlighting its importance for GC development. These results indicate that multimodal genomic and epigenomic alterations underpin TERT reactivation in GC, converging on transcriptional repressors such as EBF1.


Subject(s)
Epigenomics , Gene Expression Regulation, Enzymologic , Gene Expression Regulation, Neoplastic , Neoplasm Proteins/metabolism , Stomach Neoplasms/metabolism , Telomerase/biosynthesis , Trans-Activators/metabolism , Cell Line, Tumor , Humans , Mutation , Neoplasm Proteins/genetics , Response Elements , Stomach Neoplasms/genetics , Telomerase/genetics , Trans-Activators/genetics
13.
Methods Mol Biol ; 2120: 37-46, 2020.
Article in English | MEDLINE | ID: mdl-32124310

ABSTRACT

Identification of somatic mutations in tumor tissue is challenged by both technical artifacts, diverse somatic mutational processes, and genetic heterogeneity in the tumors. Indeed, recent independent benchmark studies have revealed low concordance between different somatic mutation callers. Here, we describe Somatic Mutation calling method using a Random Forest (SMuRF), a portable ensemble method that combines the predictions and auxiliary features from individual mutation callers using supervised machine learning. SMuRF has improved prediction accuracy for both somatic point mutations (single nucleotide variants; SNVs) and small insertions/deletions (indels) in cancer genomes and exomes. Here, we describe the method and provide a tutorial on the installation and application of SMuRF.


Subject(s)
Genomics/methods , Mutation , Neoplasms/genetics , Software , Supervised Machine Learning , Genome, Human , Humans , INDEL Mutation , Point Mutation , Polymorphism, Single Nucleotide
14.
Nat Commun ; 10(1): 1405, 2019 03 29.
Article in English | MEDLINE | ID: mdl-30926812

ABSTRACT

Lung adenocarcinoma (LUAD)-derived Wnts increase cancer cell proliferative/stemness potential, but whether they impact the immune microenvironment is unknown. Here we show that LUAD cells use paracrine Wnt1 signaling to induce immune resistance. In TCGA, Wnt1 correlates strongly with tolerogenic genes. In another LUAD cohort, Wnt1 inversely associates with T cell abundance. Altering Wnt1 expression profoundly affects growth of murine lung adenocarcinomas and this is dependent on conventional dendritic cells (cDCs) and T cells. Mechanistically, Wnt1 leads to transcriptional silencing of CC/CXC chemokines in cDCs, T cell exclusion and cross-tolerance. Wnt-target genes are up-regulated in human intratumoral cDCs and decrease upon silencing Wnt1, accompanied by enhanced T cell cytotoxicity. siWnt1-nanoparticles given as single therapy or part of combinatorial immunotherapies act at both arms of the cancer-immune ecosystem to halt tumor growth. Collectively, our studies show that Wnt1 induces immunologically cold tumors through cDCs and highlight its immunotherapeutic targeting.


Subject(s)
Adaptive Immunity , Adenocarcinoma of Lung/genetics , Adenocarcinoma of Lung/immunology , Chemokines/genetics , Dendritic Cells/metabolism , Gene Silencing , Wnt1 Protein/metabolism , Adoptive Transfer , Animals , Cell Line, Tumor , Cell Proliferation , Chemokines/metabolism , Humans , Immune Evasion , Mice, Inbred C57BL , RNA Interference , Signal Transduction , T-Lymphocytes/metabolism , Up-Regulation
15.
Bioinformatics ; 35(17): 3157-3159, 2019 09 01.
Article in English | MEDLINE | ID: mdl-30649191

ABSTRACT

SUMMARY: Somatic Mutation calling method using a Random Forest (SMuRF) integrates predictions and auxiliary features from multiple somatic mutation callers using a supervised machine learning approach. SMuRF is trained on community-curated matched tumor and normal whole genome sequencing data. SMuRF predicts both SNVs and indels with high accuracy in genome or exome-level sequencing data. Furthermore, the method is robust across multiple tested cancer types and predicts low allele frequency variants with high accuracy. In contrast to existing ensemble-based somatic mutation calling approaches, SMuRF works out-of-the-box and is orders of magnitudes faster. AVAILABILITY AND IMPLEMENTATION: The method is implemented in R and available at https://github.com/skandlab/SMuRF. SMuRF operates as an add-on to the community-developed bcbio-nextgen somatic variant calling pipeline. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
High-Throughput Nucleotide Sequencing , Exome , Gene Frequency , Mutation , Supervised Machine Learning
16.
Nat Commun ; 9(1): 1520, 2018 04 18.
Article in English | MEDLINE | ID: mdl-29670109

ABSTRACT

Tissue-specific driver mutations in non-coding genomic regions remain undefined for most cancer types. Here, we unbiasedly analyze 212 gastric cancer (GC) whole genomes to identify recurrently mutated non-coding regions in GC. Applying comprehensive statistical approaches to accurately model background mutational processes, we observe significant enrichment of non-coding indels (insertions/deletions) in three gastric lineage-specific genes. We further identify 34 mutation hotspots, of which 11 overlap CTCF binding sites (CBSs). These CBS hotspots remain significant even after controlling for a genome-wide elevated mutation rate at CBSs. In 3 out of 4 tested CBS hotspots, mutations are nominally associated with expression change of neighboring genes. CBS hotspot mutations are enriched in tumors showing chromosomal instability, co-occur with neighboring chromosomal aberrations, and are common in gastric (25%) and colorectal (19%) tumors but rare in other cancer types. Mutational disruption of specific CBSs may thus represent a tissue-specific mechanism of tumorigenesis conserved across gastrointestinal cancers.


Subject(s)
CCCTC-Binding Factor/genetics , Chromosomal Instability , DNA Mutational Analysis , Gastrointestinal Neoplasms/genetics , INDEL Mutation , Mutation , Binding Sites , Cell Line, Tumor , Chromosome Aberrations , Conserved Sequence , Databases, Genetic , Epigenesis, Genetic , False Positive Reactions , Gene Expression Profiling , Genome, Human , Genomics , Humans , Models, Statistical , Mutation Rate
SELECTION OF CITATIONS
SEARCH DETAIL
...