Search | VHL Regional Portal

1.

Predicting patient-specific enhancer-promoter interactions.

Baur, Brittany; Roy, Sushmita.

Cell Rep Methods ; 3(9): 100594, 2023 09 25.

Article in English | MEDLINE | ID: mdl-37751694

ABSTRACT

Computational methods that can predict hard-to-measure modalities from those that are easier to measure, in a patient-specific manner, play a critical role in personalized medicine. In this issue of Cell Reports Methods, Khurana et al. present differential gene targets of accessible chromatin (DGTAC), an approach which predicts patient-specific enhancer-promoter interactions.

Subject(s)

Chromatin , Regulatory Sequences, Nucleic Acid , Humans , Promoter Regions, Genetic/genetics , Chromatin/genetics , Patients , Precision Medicine

2.

Leveraging epigenomes and three-dimensional genome organization for interpreting regulatory variation.

Baur, Brittany; Shin, Junha; Schreiber, Jacob; Zhang, Shilu; Zhang, Yi; Manjunath, Mohith; Song, Jun S; Stafford Noble, William; Roy, Sushmita.

PLoS Comput Biol ; 19(7): e1011286, 2023 07.

Article in English | MEDLINE | ID: mdl-37428809

ABSTRACT

Understanding the impact of regulatory variants on complex phenotypes is a significant challenge because the genes and pathways that are targeted by such variants and the cell type context in which regulatory variants operate are typically unknown. Cell-type-specific long-range regulatory interactions that occur between a distal regulatory sequence and a gene offer a powerful framework for examining the impact of regulatory variants on complex phenotypes. However, high-resolution maps of such long-range interactions are available only for a handful of cell types. Furthermore, identifying specific gene subnetworks or pathways that are targeted by a set of variants is a significant challenge. We have developed L-HiC-Reg, a Random Forests regression method to predict high-resolution contact counts in new cell types, and a network-based framework to identify candidate cell-type-specific gene networks targeted by a set of variants from a genome-wide association study (GWAS). We applied our approach to predict interactions in 55 Roadmap Epigenomics Mapping Consortium cell types, which we used to interpret regulatory single nucleotide polymorphisms (SNPs) in the NHGRI-EBI GWAS catalogue. Using our approach, we performed an in-depth characterization of fifteen different phenotypes including schizophrenia, coronary artery disease (CAD) and Crohn's disease. We found differentially wired subnetworks consisting of known as well as novel gene targets of regulatory SNPs. Taken together, our compendium of interactions and the associated network-based analysis pipeline leverages long-range regulatory interactions to examine the context-specific impact of regulatory variation in complex phenotypes.

Subject(s)

Epigenome , Genome-Wide Association Study , Humans , Genome-Wide Association Study/methods , Gene Regulatory Networks/genetics , Genome , Epigenomics , Polymorphism, Single Nucleotide/genetics , Genetic Predisposition to Disease

3.

Publisher Correction: Rewiring of the 3D genome during acquisition of carboplatin resistance in a triple-negative breast cancer patient-derived xenograft.

Dozmorov, Mikhail G; Marshall, Maggie A; Rashid, Narmeen S; Grible, Jacqueline M; Valentine, Aaron; Olex, Amy L; Murthy, Kavita; Chakraborty, Abhijit; Reyna, Joaquin; Figueroa, Daniela Salgado; Hinojosa-Gonzalez, Laura; Da-Inn Lee, Erika; Baur, Brittany A; Roy, Sushmita; Ay, Ferhat; Harrell, J Chuck.

Sci Rep ; 13(1): 7522, 2023 May 09.

Article in English | MEDLINE | ID: mdl-37160896

4.

Rewiring of the 3D genome during acquisition of carboplatin resistance in a triple-negative breast cancer patient-derived xenograft.

Dozmorov, Mikhail G; Marshall, Maggie A; Rashid, Narmeen S; Grible, Jacqueline M; Valentine, Aaron; Olex, Amy L; Murthy, Kavita; Chakraborty, Abhijit; Reyna, Joaquin; Figueroa, Daniela Salgado; Hinojosa-Gonzalez, Laura; Da-Inn Lee, Erika; Baur, Brittany A; Roy, Sushmita; Ay, Ferhat; Harrell, J Chuck.

Sci Rep ; 13(1): 5420, 2023 04 03.

Article in English | MEDLINE | ID: mdl-37012431

ABSTRACT

Changes in the three-dimensional (3D) structure of the genome are an emerging hallmark of cancer. Cancer-associated copy number variants and single nucleotide polymorphisms promote rewiring of chromatin loops, disruption of topologically associating domains (TADs), active/inactive chromatin state switching, leading to oncogene expression and silencing of tumor suppressors. However, little is known about 3D changes during cancer progression to a chemotherapy-resistant state. We integrated chromatin conformation capture (Hi-C), RNA-seq, and whole-genome sequencing obtained from triple-negative breast cancer patient-derived xenograft primary tumors (UCD52) and carboplatin-resistant samples and found increased short-range (< 2 Mb) interactions, chromatin looping, formation of TAD, chromatin state switching into a more active state, and amplification of ATP-binding cassette transporters. Transcriptome changes suggested the role of long-noncoding RNAs in carboplatin resistance. Rewiring of the 3D genome was associated with TP53, TP63, BATF, FOS-JUN family of transcription factors and led to activation of aggressiveness-, metastasis- and other cancer-related pathways. Integrative analysis highlighted increased ribosome biogenesis and oxidative phosphorylation, suggesting the role of mitochondrial energy metabolism. Our results suggest that 3D genome remodeling may be a key mechanism underlying carboplatin resistance.

Subject(s)

Triple Negative Breast Neoplasms , Humans , Carboplatin/pharmacology , Carboplatin/therapeutic use , Triple Negative Breast Neoplasms/drug therapy , Triple Negative Breast Neoplasms/genetics , Heterografts , Genome , Chromatin

5.

Deciphering the Role of 3D Genome Organization in Breast Cancer Susceptibility.

Baur, Brittany; Lee, Da-Inn; Haag, Jill; Chasman, Deborah; Gould, Michael; Roy, Sushmita.

Front Genet ; 12: 788318, 2021.

Article in English | MEDLINE | ID: mdl-35087569

ABSTRACT

Cancer risk by environmental exposure is modulated by an individual's genetics and age at exposure. This age-specific period of susceptibility is referred to as the "Window of Susceptibility" (WOS). Rats have a similar WOS for developing breast cancer. A previous study in rat identified an age-specific long-range regulatory interaction for the cancer gene, Pappa, that is associated with breast cancer susceptibility. However, the global role of three-dimensional genome organization and downstream gene expression programs in the WOS is not known. Therefore, we generated Hi-C and RNA-seq data in rat mammary epithelial cells within and outside the WOS. To systematically identify higher-order changes in 3D genome organization, we developed NE-MVNMF that combines network enhancement followed by multitask non-negative matrix factorization. We examined three-dimensional genome organization dynamics at the level of individual loops as well as higher-order domains. Differential chromatin interactions tend to be associated with differentially up-regulated genes with the WOS and recapitulate several human SNP-gene interactions associated with breast cancer susceptibility. Our approach identified genomic blocks of regions with greater overall differences in contact count between the two time points when the cluster assignments change and identified genes and pathways implicated in early carcinogenesis and cancer treatment. Our results suggest that WOS-specific changes in 3D genome organization are linked to transcriptional changes that may influence susceptibility to breast cancer.

6.

Data integration for inferring context-specific gene regulatory networks.

Baur, Brittany; Shin, Junha; Zhang, Shilu; Roy, Sushmita.

Curr Opin Syst Biol ; 23: 38-46, 2020 Oct.

Article in English | MEDLINE | ID: mdl-33225112

ABSTRACT

Transcriptional regulatory networks control context-specific gene expression patterns and play important roles in normal and disease processes. Advances in genomics are rapidly increasing our ability to measure different components of the regulation machinery at the single-cell and bulk population level. An important challenge is to combine different types of regulatory genomic measurements to construct a more complete picture of gene regulatory networks across different disease, environmental, and developmental contexts. In this review, we focus on recent computational methods that integrate regulatory genomic data sets to infer context specificity and dynamics in regulatory networks.

7.

The Cancer-Associated Genetic Variant Rs3903072 Modulates Immune Cells in the Tumor Microenvironment.

Zhang, Yi; Manjunath, Mohith; Yan, Jialu; Baur, Brittany A; Zhang, Shilu; Roy, Sushmita; Song, Jun S.

Front Genet ; 10: 754, 2019.

Article in English | MEDLINE | ID: mdl-31507631

ABSTRACT

Genome-wide association studies (GWAS) have hitherto identified several germline variants associated with cancer susceptibility, but the molecular functions of these risk modulators remain largely uncharacterized. Recent studies have begun to uncover the regulatory potential of noncoding GWAS SNPs using epigenetic information in corresponding cancer cell types and matched normal tissues. However, this approach does not explore the potential effect of risk germline variants on other important cell types that constitute the microenvironment of tumor or its precursor. This paper presents evidence that the breast-cancer-associated variant rs3903072 may regulate the expression of CTSW in tumor-infiltrating lymphocytes. CTSW is a candidate tumor-suppressor gene, with expression highly specific to immune cells and also positively correlated with breast cancer patient survival. Integrative analyses suggest a putative causative variant in a GWAS-linked enhancer in lymphocytes that loops to the 3' end of CTSW through three-dimensional chromatin interaction. Our work thus poses the possibility that a cancer-associated genetic variant could regulate a gene not only in the cell of cancer origin but also in immune cells in the microenvironment, thereby modulating the immune surveillance by T lymphocytes and natural killer cells and affecting the clearing of early cancer initiating cells.

8.

Genetic Fine-Mapping and Identification of Candidate Genes and Variants for Adiposity Traits in Outbred Rats.

Keele, Gregory R; Prokop, Jeremy W; He, Hong; Holl, Katie; Littrell, John; Deal, Aaron; Francic, Sanja; Cui, Leilei; Gatti, Daniel M; Broman, Karl W; Tschannen, Michael; Tsaih, Shirng-Wern; Zagloul, Maie; Kim, Yunjung; Baur, Brittany; Fox, Joseph; Robinson, Melanie; Levy, Shawn; Flister, Michael J; Mott, Richard; Valdar, William; Solberg Woods, Leah C.

Obesity (Silver Spring) ; 26(1): 213-222, 2018 01.

Article in English | MEDLINE | ID: mdl-29193816

ABSTRACT

OBJECTIVE: Obesity is a major risk factor for multiple diseases and is in part heritable, yet the majority of causative genetic variants that drive excessive adiposity remain unknown. Here, outbred heterogeneous stock (HS) rats were used in controlled environmental conditions to fine-map novel genetic modifiers of adiposity. METHODS: Body weight and visceral fat pad weights were measured in male HS rats that were also genotyped genome-wide. Quantitative trait loci (QTL) were identified by genome-wide association of imputed single-nucleotide polymorphism (SNP) genotypes using a linear mixed effect model that accounts for unequal relatedness between the HS rats. Candidate genes were assessed by protein modeling and mediation analysis of expression for coding and noncoding variants, respectively. RESULTS: HS rats exhibited large variation in adiposity traits, which were highly heritable and correlated with metabolic health. Fine-mapping of fat pad weight and body weight revealed three QTL and prioritized five candidate genes. Fat pad weight was associated with missense SNPs in Adcy3 and Prlhr and altered expression of Krtcap3 and Slc30a3, whereas Grid2 was identified as a candidate within the body weight locus. CONCLUSIONS: These data demonstrate the power of HS rats for identification of known and novel heritable mediators of obesity traits.

Subject(s)

Adiposity/genetics , Body Weight/genetics , Chromosome Mapping/methods , Genetic Variation/genetics , Genome-Wide Association Study/methods , Obesity/genetics , Animals , Genotype , Male , Phenotype , Polymorphism, Single Nucleotide , Rats

9.

ProcessDriver: A computational pipeline to identify copy number drivers and associated disrupted biological processes in cancer.

Baur, Brittany; Bozdag, Serdar.

Genomics ; 109(3-4): 233-240, 2017 07.

Article in English | MEDLINE | ID: mdl-28438487

ABSTRACT

Copy number amplifications and deletions that are recurrent in cancer samples harbor genes that confer a fitness advantage to cancer tumor proliferation and survival. One important challenge in computational biology is to separate the causal (i.e., driver) genes from passenger genes in large, aberrated regions. Many previous studies focus on the genes within the aberration (i.e., cis genes), but do not utilize the genes that are outside of the aberrated region and dysregulated as a result of the aberration (i.e., trans genes). We propose a computational pipeline, called ProcessDriver, that prioritizes candidate drivers by relating cis genes to dysregulated trans genes and biological processes. ProcessDriver is based on the assumption that a driver cis gene should be closely associated with the dysregulated trans genes and biological processes, as opposed to previous studies that assume a driver cis gene should be the most correlated gene to the copy number of an aberrated region. We applied our method on breast, bladder and ovarian cancer data from the Cancer Genome Atlas database. Our results included previously known driver genes and cancer genes, as well as potentially novel driver genes. Additionally, many genes in the final set of drivers were linked to new tumor events after initial treatment using survival analysis. Our results highlight the importance of selecting driver genes based on their widespread downstream effects in trans.

Subject(s)

Breast Neoplasms/genetics , Gene Dosage , Genomics/methods , Oncogenes , Ovarian Neoplasms/genetics , Urinary Bladder Neoplasms/genetics , Algorithms , Breast Neoplasms/pathology , DNA Copy Number Variations , Disease Progression , Female , Humans , Ovarian Neoplasms/pathology , Urinary Bladder Neoplasms/pathology

10.

A Feature Selection Algorithm to Compute Gene Centric Methylation from Probe Level Methylation Data.

Baur, Brittany; Bozdag, Serdar.

PLoS One ; 11(2): e0148977, 2016.

Article in English | MEDLINE | ID: mdl-26872146

ABSTRACT

DNA methylation is an important epigenetic event that effects gene expression during development and various diseases such as cancer. Understanding the mechanism of action of DNA methylation is important for downstream analysis. In the Illumina Infinium HumanMethylation 450K array, there are tens of probes associated with each gene. Given methylation intensities of all these probes, it is necessary to compute which of these probes are most representative of the gene centric methylation level. In this study, we developed a feature selection algorithm based on sequential forward selection that utilized different classification methods to compute gene centric DNA methylation using probe level DNA methylation data. We compared our algorithm to other feature selection algorithms such as support vector machines with recursive feature elimination, genetic algorithms and ReliefF. We evaluated all methods based on the predictive power of selected probes on their mRNA expression levels and found that a K-Nearest Neighbors classification using the sequential forward selection algorithm performed better than other algorithms based on all metrics. We also observed that transcriptional activities of certain genes were more sensitive to DNA methylation changes than transcriptional activities of other genes. Our algorithm was able to predict the expression of those genes with high accuracy using only DNA methylation data. Our results also showed that those DNA methylation-sensitive genes were enriched in Gene Ontology terms related to the regulation of various biological processes.

Subject(s)

Breast Neoplasms/genetics , DNA Methylation , Algorithms , Breast Neoplasms/metabolism , Cell Line, Tumor , DNA Probes , Epigenesis, Genetic , Female , Gene Expression , Gene Expression Regulation, Neoplastic , Gene Ontology , Humans , Multigene Family

11.

A canonical correlation analysis-based dynamic bayesian network prior to infer gene regulatory networks from multiple types of biological data.

Baur, Brittany; Bozdag, Serdar.

J Comput Biol ; 22(4): 289-99, 2015 Apr.

Article in English | MEDLINE | ID: mdl-25844668

ABSTRACT

One of the challenging and important computational problems in systems biology is to infer gene regulatory networks (GRNs) of biological systems. Several methods that exploit gene expression data have been developed to tackle this problem. In this study, we propose the use of copy number and DNA methylation data to infer GRNs. We developed an algorithm that scores regulatory interactions between genes based on canonical correlation analysis. In this algorithm, copy number or DNA methylation variables are treated as potential regulator variables, and expression variables are treated as potential target variables. We first validated that the canonical correlation analysis method is able to infer true interactions in high accuracy. We showed that the use of DNA methylation or copy number datasets leads to improved inference over steady-state expression. Our results also showed that epigenetic and structural information could be used to infer directionality of regulatory interactions. Additional improvements in GRN inference can be gleaned from incorporating the result in an informative prior in a dynamic Bayesian algorithm. This is the first study that incorporates copy number and DNA methylation into an informative prior in dynamic Bayesian framework. By closely examining top-scoring interactions with different sources of epigenetic or structural information, we also identified potential novel regulatory interactions.

Subject(s)

Gene Expression , Gene Regulatory Networks , Algorithms , Area Under Curve , Bayes Theorem , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , DNA Methylation , Epigenesis, Genetic , Female , Gene Expression Regulation, Neoplastic , Humans , Models, Genetic , ROC Curve

12.

Biological network motif detection: principles and practice.

Wong, Elisabeth; Baur, Brittany; Quader, Saad; Huang, Chun-Hsi.

Brief Bioinform ; 13(2): 202-15, 2012 Mar.

Article in English | MEDLINE | ID: mdl-22396487

ABSTRACT

Network motifs are statistically overrepresented sub-structures (sub-graphs) in a network, and have been recognized as 'the simple building blocks of complex networks'. Study of biological network motifs may reveal answers to many important biological questions. The main difficulty in detecting larger network motifs in biological networks lies in the facts that the number of possible sub-graphs increases exponentially with the network or motif size (node counts, in general), and that no known polynomial-time algorithm exists in deciding if two graphs are topologically equivalent. This article discusses the biological significance of network motifs, the motivation behind solving the motif-finding problem, and strategies to solve the various aspects of this problem. A simple classification scheme is designed to analyze the strengths and weaknesses of several existing algorithms. Experimental results derived from a few comparative studies in the literature are discussed, with conclusions that lead to future research directions.

Subject(s)

Computational Biology/methods , Gene Regulatory Networks , Algorithms , Models, Biological , Protein Interaction Mapping/methods

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL