ABSTRACT
In oncology, biomarkers are widely used to predict subgroups of patients that respond to a given drug. Although clinical decisions often rely on single gene biomarkers, machine learning approaches tend to generate complex multi-gene biomarkers that are hard to interpret. Models predicting drug response based on multiple altered genes often assume that the effects of single alterations are independent. We asked whether the association of cancer driver mutations with drug response is modulated by other driver mutations or the tissue of origin. We developed an analytic framework based on linear regression to study interactions in pharmacogenomic data from two large cancer cell line panels. Starting from a model with only covariates, we included additional variables only if they significantly improved simpler models. This allows to systematically assess interactions in small, easily interpretable models. Our results show that including mutation-mutation interactions in drug response prediction models tends to improve model performance and robustness. For example, we found that TP53 mutations decrease sensitivity to BRAF inhibitors in BRAF-mutated cell lines and patient tumors, suggesting a therapeutic benefit of combining inhibition of oncogenic BRAF with reactivation of the tumor suppressor TP53. Moreover, we identified tissue-specific mutation-drug associations and synthetic lethal triplets where the simultaneous mutation of two genes sensitizes cells to a drug. In summary, our interaction-based approach contributes to a holistic view on the determining factors of drug response.
Subject(s)
Antineoplastic Agents/pharmacology , Biomarkers, Tumor/genetics , Gene Expression Regulation, Neoplastic/drug effects , Mutation , Neoplasms/pathology , Pharmacogenetics , Protein Kinase Inhibitors/pharmacology , Databases, Factual , Humans , Neoplasms/drug therapy , Neoplasms/genetics , Organ SpecificityABSTRACT
Copy number alterations (CNAs) in cancer patients show a large variability in their number, length and position, but the sources of this variability are not known. CNA number and length are linked to patient survival, suggesting clinical relevance. We have identified genes that tend to be mutated in samples that have few or many CNAs, which we term CONIM genes (COpy Number Instability Modulators). CONIM proteins cluster into a densely connected subnetwork of physical interactions and many of them are epigenetic modifiers. Therefore, we investigated how the epigenome of the tissue-of-origin influences the position of CNA breakpoints and the properties of the resulting CNAs. We found that the presence of heterochromatin in the tissue-of-origin contributes to the recurrence and length of CNAs in the respective cancer type.