Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 143
Filter
1.
Integr Comp Biol ; 2024 May 30.
Article in English | MEDLINE | ID: mdl-38816211

ABSTRACT

Comparative genomics provides ample ways to study genome evolution and its relationship to phenotypic traits. By developing and testing alternate models of evolution throughout a phylogeny, one can estimate rates of molecular evolution along different lineages in a phylogeny and link these rates with observations in extant species, such as convergent phenotypes. Pipelines for such work can help identify when and where genomic changes may be associated with, or possibly influence, phenotypic traits. We recently developed a set of models called PhyloAcc, using a Bayesian framework to estimate rates of nucleotide substitution on different branches a phylogenetic tree and evaluate their association with pre-defined or estimated phenotypic traits PhyloAcc-ST and PhyloAcc-GT both allow users to define a priori a set of target lineages and then compare different models to identify loci accelerating in one or more target lineages. Whereas ST considers only one species tree across all input loci, GT considers alternate topologies for every locus. PhyloAcc-C simultaneously models molecular rates and rates of continuous trait evolution,allowing the user to ask whether the two are associated. Here we describe these models and provide tips and workflows on how to prepare the input data and run PhyloAcc.

2.
PLoS Comput Biol ; 20(4): e1011995, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38656999

ABSTRACT

Genomes contain conserved non-coding sequences that perform important biological functions, such as gene regulation. We present a phylogenetic method, PhyloAcc-C, that associates nucleotide substitution rates with changes in a continuous trait of interest. The method takes as input a multiple sequence alignment of conserved elements, continuous trait data observed in extant species, and a background phylogeny and substitution process. Gibbs sampling is used to assign rate categories (background, conserved, accelerated) to lineages and explore whether the assigned rate categories are associated with increases or decreases in the rate of trait evolution. We test our method using simulations and then illustrate its application using mammalian body size and lifespan data previously analyzed with respect to protein coding genes. Like other studies, we find processes such as tumor suppression, telomere maintenance, and p53 regulation to be related to changes in longevity and body size. In addition, we also find that skeletal genes, and developmental processes, such as sprouting angiogenesis, are relevant.


Subject(s)
Evolution, Molecular , Models, Genetic , Phylogeny , Animals , Longevity/genetics , Humans , Computational Biology/methods , Computer Simulation , Body Size/genetics , Nucleotides/genetics , Sequence Alignment/methods
3.
Mol Biol Evol ; 40(9)2023 09 01.
Article in English | MEDLINE | ID: mdl-37665177

ABSTRACT

An important goal of evolutionary genomics is to identify genomic regions whose substitution rates differ among lineages. For example, genomic regions experiencing accelerated molecular evolution in some lineages may provide insight into links between genotype and phenotype. Several comparative genomics methods have been developed to identify genomic accelerations between species, including a Bayesian method called PhyloAcc, which models shifts in substitution rate in multiple target lineages on a phylogeny. However, few methods consider the possibility of discordance between the trees of individual loci and the species tree due to incomplete lineage sorting, which might cause false positives. Here, we present PhyloAcc-GT, which extends PhyloAcc by modeling gene tree heterogeneity. Given a species tree, we adopt the multispecies coalescent model as the prior distribution of gene trees, use Markov chain Monte Carlo (MCMC) for inference, and design novel MCMC moves to sample gene trees efficiently. Through extensive simulations, we show that PhyloAcc-GT outperforms PhyloAcc and other methods in identifying target lineage-specific accelerations and detecting complex patterns of rate shifts, and is robust to specification of population size parameters. PhyloAcc-GT is usually more conservative than PhyloAcc in calling convergent rate shifts because it identifies more accelerations on ancestral than on terminal branches. We apply PhyloAcc-GT to two examples of convergent evolution: flightlessness in ratites and marine mammal adaptations, and show that PhyloAcc-GT is a robust tool to identify shifts in substitution rate associated with specific target lineages while accounting for incomplete lineage sorting.


Subject(s)
Biological Evolution , Models, Genetic , Animals , Bayes Theorem , Phylogeny , Genomics , Mammals
4.
Cancer Discov ; 13(3): 672-701, 2023 03 01.
Article in English | MEDLINE | ID: mdl-36745048

ABSTRACT

Drugs that kill tumors through multiple mechanisms have the potential for broad clinical benefits. Here, we first developed an in silico multiomics approach (BipotentR) to find cancer cell-specific regulators that simultaneously modulate tumor immunity and another oncogenic pathway and then used it to identify 38 candidate immune-metabolic regulators. We show the tumor activities of these regulators stratify patients with melanoma by their response to anti-PD-1 using machine learning and deep neural approaches, which improve the predictive power of current biomarkers. The topmost identified regulator, ESRRA, is activated in immunotherapy-resistant tumors. Its inhibition killed tumors by suppressing energy metabolism and activating two immune mechanisms: (i) cytokine induction, causing proinflammatory macrophage polarization, and (ii) antigen-presentation stimulation, recruiting CD8+ T cells into tumors. We also demonstrate a wide utility of BipotentR by applying it to angiogenesis and growth suppressor evasion pathways. BipotentR (http://bipotentr.dfci.harvard.edu/) provides a resource for evaluating patient response and discovering drug targets that act simultaneously through multiple mechanisms. SIGNIFICANCE: BipotentR presents resources for evaluating patient response and identifying targets for drugs that can kill tumors through multiple mechanisms concurrently. Inhibition of the topmost candidate target killed tumors by suppressing energy metabolism and effects on two immune mechanisms. This article is highlighted in the In This Issue feature, p. 517.


Subject(s)
Antineoplastic Agents , Melanoma , Humans , Antineoplastic Agents/pharmacology , Receptors, Estrogen , Immunotherapy , Melanoma/pathology , CD8-Positive T-Lymphocytes , Tumor Microenvironment , Cell Line, Tumor , ERRalpha Estrogen-Related Receptor
5.
Genes (Basel) ; 13(7)2022 07 08.
Article in English | MEDLINE | ID: mdl-35886003

ABSTRACT

Openness-weighted association study (OWAS) is a method that leverages the in silico prediction of chromatin accessibility to prioritize genome-wide association studies (GWAS) signals, and can provide novel insights into the roles of non-coding variants in complex diseases. A prerequisite to apply OWAS is to choose a trait-related cell type beforehand. However, for most complex traits, the trait-relevant cell types remain elusive. In addition, many complex traits involve multiple related cell types. To address these issues, we develop OWAS-joint, an efficient framework that aggregates predicted chromatin accessibility across multiple cell types, to prioritize disease-associated genomic segments. In simulation studies, we demonstrate that OWAS-joint achieves a greater statistical power compared to OWAS. Moreover, the heritability explained by OWAS-joint segments is higher than or comparable to OWAS segments. OWAS-joint segments also have high replication rates in independent replication cohorts. Applying the method to six complex human traits, we demonstrate the advantages of OWAS-joint over a single-cell-type OWAS approach. We highlight that OWAS-joint enhances the biological interpretation of disease mechanisms, especially for non-coding regions.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Chromatin , Genome-Wide Association Study/methods , Genomics , Humans , Phenotype
7.
Bioinformatics ; 38(7): 1938-1946, 2022 03 28.
Article in English | MEDLINE | ID: mdl-35020805

ABSTRACT

MOTIVATION: Polygenic risk score (PRS) has been widely exploited for genetic risk prediction due to its accuracy and conceptual simplicity. We introduce a unified Bayesian regression framework, NeuPred, for PRS construction, which accommodates varying genetic architectures and improves overall prediction accuracy for complex diseases by allowing for a wide class of prior choices. To take full advantage of the framework, we propose a summary-statistics-based cross-validation strategy to automatically select suitable chromosome-level priors, which demonstrates a striking variability of the prior preference of each chromosome, for the same complex disease, and further significantly improves the prediction accuracy. RESULTS: Simulation studies and real data applications with seven disease datasets from the Wellcome Trust Case Control Consortium cohort and eight groups of large-scale genome-wide association studies demonstrate that NeuPred achieves substantial and consistent improvements in terms of predictive r2 over existing methods. In addition, NeuPred has similar or advantageous computational efficiency compared with the state-of-the-art Bayesian methods. AVAILABILITY AND IMPLEMENTATION: The R package implementing NeuPred is available at https://github.com/shuangsong0110/NeuPred. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genome-Wide Association Study , Multifactorial Inheritance , Humans , Bayes Theorem , Genome-Wide Association Study/methods , Computer Simulation , Case-Control Studies
8.
R Soc Open Sci ; 8(8): 210653, 2021 Aug.
Article in English | MEDLINE | ID: mdl-34457345

ABSTRACT

Cooperation is one of the key collective behaviours of human society. Despite discoveries of several social mechanisms underpinning cooperation, relatively little is known about how our neural functions affect cooperative behaviours. Here, we study the effect of a main neural function, working-memory capacity, on cooperation in repeated Prisoner's Dilemma experiments. Our experimental paradigm overcomes the obstacles in measuring and changing subjects' working-memory capacity. We find that the optimal cooperation level occurs when subjects remember two previous rounds of information, and cooperation increases abruptly from no memory capacity to minimal memory capacity. The results can be explained by memory-based conditional cooperation of subjects. We propose evolutionary models based on replicator dynamics and Markov processes, respectively, which are in good agreement with experimental results of different memory capacities. Our experimental findings differ from previous hypotheses and predictions of existent models and theories, and suggest a neural basis and evolutionary roots of cooperation beyond cultural influences.

9.
Bioinformatics ; 37(24): 4737-4743, 2021 12 11.
Article in English | MEDLINE | ID: mdl-34260700

ABSTRACT

MOTIVATION: Identification and interpretation of non-coding variations that affect disease risk remain a paramount challenge in genome-wide association studies (GWAS) of complex diseases. Experimental efforts have provided comprehensive annotations of functional elements in the human genome. On the other hand, advances in computational biology, especially machine learning approaches, have facilitated accurate predictions of cell-type-specific functional annotations. Integrating functional annotations with GWAS signals has advanced the understanding of disease mechanisms. In previous studies, functional annotations were treated as static of a genomic region, ignoring potential functional differences imposed by different genotypes across individuals. RESULTS: We develop a computational approach, Openness Weighted Association Studies (OWAS), to leverage and aggregate predictions of chromosome accessibility in personal genomes for prioritizing GWAS signals. The approach relies on an analytical expression we derived for identifying disease associated genomic segments whose effects in the etiology of complex diseases are evaluated. In extensive simulations and real data analysis, OWAS identifies genes/segments that explain more heritability than existing methods, and has a better replication rate in independent cohorts than GWAS. Moreover, the identified genes/segments show tissue-specific patterns and are enriched in disease relevant pathways. We use rheumatic arthritis and asthma as examples to demonstrate how OWAS can be exploited to provide novel insights on complex diseases. AVAILABILITY AND IMPLEMENTATION: The R package OWAS that implements our method is available at https://github.com/shuangsong0110/OWAS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genome-Wide Association Study , Software , Humans , Genome-Wide Association Study/methods , Genotype , Genomics , Computational Biology
10.
Adv Ther ; 38(8): 4321-4332, 2021 08.
Article in English | MEDLINE | ID: mdl-34236672

ABSTRACT

INTRODUCTION: The phase 3 trial PALISADE, comparing peanut (Arachis hypogaea) allergen powder-dnfp (PTAH) oral immunotherapy versus placebo in peanut-allergic children, reported that a significantly higher percentage of PTAH-treated participants tolerated higher doses of peanut protein after 1 year of treatment. This study used PALISADE data to estimate the reduction in the risk of systemic allergic reaction (SAR) after accidental exposure following 1 year of PTAH treatment. METHODS: Participants (aged 4-17 years) enrolled in PALISADE were included. Parametric interval-censoring survival analysis with the maximum likelihood estimation was used to construct a real-world distribution of peanut protein exposure using lifetime SAR history and highest tolerated dose (HTD) from a double-blind, placebo-controlled food challenge conducted at baseline. The SAR risk reduction was extrapolated using the exposure distribution and the HTD were collected at baseline and trial exit for PTAH- and placebo-treated participants. RESULTS: Assuming a maximum peanut protein intake of 1500 mg, participants were estimated to have < 1% probability of ingesting > 0.01 mg during daily life. The mean annual SAR risk at trial entry was 9.25-9.98%. At trial exit, the relative SAR risk reduction following accidental exposure was 94.9% for PTAH versus 6.4% for placebo. For PTAH-treated participants with exit HTD of 600 or 1000 mg without dose-limiting symptoms, the SAR risk reduction increased to 97.2%. The result was consistent in the sensitivity analysis across different parametric distributions. CONCLUSION: Oral immunotherapy with PTAH is expected to result in a substantially greater reduction in risk of SAR following accidental exposure compared to placebo among children with peanut allergy.


Subject(s)
Arachis , Peanut Hypersensitivity , Administration, Oral , Allergens , Child , Desensitization, Immunologic , Humans , Peanut Hypersensitivity/therapy , Risk Reduction Behavior
11.
Brief Bioinform ; 22(3)2021 05 20.
Article in English | MEDLINE | ID: mdl-34020537

ABSTRACT

Deciphering microRNA (miRNA) targets is important for understanding the function of miRNAs as well as miRNA-based diagnostics and therapeutics. Given the highly cell-specific nature of miRNA regulation, recent computational approaches typically exploit expression data to identify the most physiologically relevant target messenger RNAs (mRNAs). Although effective, those methods usually require a large sample size to infer miRNA-mRNA interactions, thus limiting their applications in personalized medicine. In this study, we developed a novel miRNA target prediction algorithm called miRACLe (miRNA Analysis by a Contact modeL). It integrates sequence characteristics and RNA expression profiles into a random contact model, and determines the target preferences by relative probability of effective contacts in an individual-specific manner. Evaluation by a variety of measures shows that fitting TargetScan, a frequently used prediction tool, into the framework of miRACLe can improve its predictive power with a significant margin and consistently outperform other state-of-the-art methods in prediction accuracy, regulatory potential and biological relevance. Notably, the superiority of miRACLe is robust to various biological contexts, types of expression data and validation datasets, and the computation process is fast and efficient. Additionally, we show that the model can be readily applied to other sequence-based algorithms to improve their predictive power, such as DIANA-microT-CDS, miRanda-mirSVR and MirTarget4. MiRACLe is publicly available at https://github.com/PANWANG2014/miRACLe.


Subject(s)
Databases, Nucleic Acid , Gene Expression Regulation , MicroRNAs , Models, Genetic , Transcriptome , HeLa Cells , Humans , MicroRNAs/biosynthesis , MicroRNAs/genetics
12.
Proc Natl Acad Sci U S A ; 118(13)2021 03 30.
Article in English | MEDLINE | ID: mdl-33766915

ABSTRACT

Microglial-derived inflammation has been linked to a broad range of neurodegenerative and neuropsychiatric conditions, including amyotrophic lateral sclerosis (ALS). Using single-cell RNA sequencing, a class of Disease-Associated Microglia (DAMs) have been characterized in neurodegeneration. However, the DAM phenotype alone is insufficient to explain the functional complexity of microglia, particularly with regard to regulating inflammation that is a hallmark of many neurodegenerative diseases. Here, we identify a subclass of microglia in mouse models of ALS which we term RIPK1-Regulated Inflammatory Microglia (RRIMs). RRIMs show significant up-regulation of classical proinflammatory pathways, including increased levels of Tnf and Il1b RNA and protein. We find that RRIMs are highly regulated by TNFα signaling and that the prevalence of these microglia can be suppressed by inhibiting receptor-interacting protein kinase 1 (RIPK1) activity downstream of the TNF receptor 1. These findings help to elucidate a mechanism by which RIPK1 kinase inhibition has been shown to provide therapeutic benefit in mouse models of ALS and may provide an additional biomarker for analysis in ongoing phase 2 clinical trials of RIPK1 inhibitors in ALS.


Subject(s)
Amyotrophic Lateral Sclerosis/enzymology , Inflammation/enzymology , Microglia/enzymology , Receptor-Interacting Protein Serine-Threonine Kinases/metabolism , Amyotrophic Lateral Sclerosis/genetics , Amyotrophic Lateral Sclerosis/pathology , Animals , Cell Cycle Proteins/genetics , Disease Models, Animal , Interleukin-1beta/metabolism , Membrane Transport Proteins/genetics , Mice , Mice, Mutant Strains , Microglia/pathology , Receptor-Interacting Protein Serine-Threonine Kinases/antagonists & inhibitors , Receptor-Interacting Protein Serine-Threonine Kinases/genetics , Single-Cell Analysis , Superoxide Dismutase-1/genetics , Transcriptome , Tumor Necrosis Factor-alpha/metabolism
13.
Entropy (Basel) ; 22(3)2020 Mar 02.
Article in English | MEDLINE | ID: mdl-33286064

ABSTRACT

Traditional hypothesis-margin researches focus on obtaining large margins and feature selection. In this work, we show that the robustness of margins is also critical and can be measured using entropy. In addition, our approach provides clear mathematical formulations and explanations to uncover feature interactions, which is often lack in large hypothesis-margin based approaches. We design an algorithm, termed IMMIGRATE (Iterative max-min entropy margin-maximization with interaction terms), for training the weights associated with the interaction terms. IMMIGRATE simultaneously utilizes both local and global information and can be used as a base learner in Boosting. We evaluate IMMIGRATE in a wide range of tasks, in which it demonstrates exceptional robustness and achieves the state-of-the-art results with high interpretability.

14.
Cell Rep ; 33(10): 108447, 2020 12 08.
Article in English | MEDLINE | ID: mdl-33296651

ABSTRACT

The contribution and mechanism of cerebrovascular pathology in Alzheimer's disease (AD) pathogenesis are still unclear. Here, we show that venular and capillary cerebral endothelial cells (ECs) are selectively vulnerable to necroptosis in AD. We identify reduced cerebromicrovascular expression of murine N-acetyltransferase 1 (mNat1) in two AD mouse models and hNat2, the human ortholog of mNat1 and a genetic risk factor for type-2 diabetes and insulin resistance, in human AD. mNat1 deficiency in Nat1-/- mice and two AD mouse models promotes blood-brain barrier (BBB) damage and endothelial necroptosis. Decreased mNat1 expression induces lysosomal degradation of A20, an important regulator of necroptosis, and LRP1ß, a key component of LRP1 complex that exports Aß in cerebral ECs. Selective restoration of cerebral EC expression of mNAT1 delivered by adeno-associated virus (AAV) rescues cerebromicrovascular levels of A20 and LRP1ß, inhibits endothelial necroptosis and activation, ameliorates mitochondrial fragmentation, reduces Aß deposits, and improves cognitive function in the AD mouse model.


Subject(s)
Alzheimer Disease/metabolism , Arylamine N-Acetyltransferase/metabolism , Isoenzymes/metabolism , Aged , Aged, 80 and over , Alzheimer Disease/physiopathology , Amyloid beta-Peptides/metabolism , Animals , Arylamine N-Acetyltransferase/genetics , Biological Transport/physiology , Blood-Brain Barrier/metabolism , Brain/metabolism , Cell Cycle Proteins/metabolism , Cerebrum/metabolism , Disease Models, Animal , Endothelial Cells/metabolism , Female , Humans , Isoenzymes/genetics , Low Density Lipoprotein Receptor-Related Protein-1/metabolism , Male , Mice , Mice, Inbred C57BL , Mice, Knockout , Necroptosis/physiology , Peptide Fragments/metabolism , Transcription Factors/metabolism
15.
NAR Genom Bioinform ; 2(4): lqaa077, 2020 Dec.
Article in English | MEDLINE | ID: mdl-33029585

ABSTRACT

A main challenge in analyzing single-cell RNA sequencing (scRNA-seq) data is to reduce technical variations yet retain cell heterogeneity. Due to low mRNAs content per cell and molecule losses during the experiment (called 'dropout'), the gene expression matrix has a substantial amount of zero read counts. Existing imputation methods treat either each cell or each gene as independently and identically distributed, which oversimplifies the gene correlation and cell type structure. We propose a statistical model-based approach, called SIMPLEs (SIngle-cell RNA-seq iMPutation and celL clustErings), which iteratively identifies correlated gene modules and cell clusters and imputes dropouts customized for individual gene module and cell type. Simultaneously, it quantifies the uncertainty of imputation and cell clustering via multiple imputations. In simulations, SIMPLEs performed significantly better than prevailing scRNA-seq imputation methods according to various metrics. By applying SIMPLEs to several real datasets, we discovered gene modules that can further classify subtypes of cells. Our imputations successfully recovered the expression trends of marker genes in stem cell differentiation and can discover putative pathways regulating biological processes.

16.
Nat Commun ; 11(1): 2472, 2020 05 18.
Article in English | MEDLINE | ID: mdl-32424124

ABSTRACT

Characterization of the genomic distances over which transcription factor (TF) binding influences gene expression is important for inferring target genes from TF chromatin immunoprecipitation followed by sequencing (ChIP-seq) data. Here we systematically examine the relationship between thousands of TF and histone modification ChIP-seq data sets with thousands of gene expression profiles. We develop a model for integrating these data, which reveals two classes of TFs with distinct ranges of regulatory influence, chromatin-binding preferences, and auto-regulatory properties. We find that the regulatory range of the same TF bound within different topologically associating domains (TADs) depend on intrinsic TAD properties such as local gene density and G/C content, but also on the TAD chromatin states. Our results suggest that considering TF type, binding distance to gene locus, as well as chromatin context is important in identifying implicated TFs from GWAS SNPs.


Subject(s)
Gene Expression Regulation , Transcription Factors/metabolism , Acetylation , Animals , Cell Line , Chromatin/metabolism , Genome-Wide Association Study , Histones/metabolism , Lysine/metabolism , Mice , Models, Genetic , Polymorphism, Single Nucleotide/genetics , Protein Binding/genetics , Quantitative Trait Loci/genetics , Transcription Initiation Site
17.
Phys Rev E ; 101(3-1): 033301, 2020 Mar.
Article in English | MEDLINE | ID: mdl-32289991

ABSTRACT

We show that the Wang-Landau algorithm can be formulated as a stochastic gradient descent algorithm minimizing a smooth and convex objective function, of which the gradient is estimated using Markov chain Monte Carlo iterations. The optimization formulation provides us another way to establish the convergence rate of the Wang-Landau algorithm, by exploiting the fact that almost surely the density estimates (on the logarithmic scale) remain in a compact set, upon which the objective function is strongly convex. The optimization viewpoint motivates us to improve the efficiency of the Wang-Landau algorithm using popular tools including the momentum method and the adaptive learning rate method. We demonstrate the accelerated Wang-Landau algorithm on a two-dimensional Ising model and a two-dimensional ten-state Potts model.

18.
Methods Mol Biol ; 2120: 249-262, 2020.
Article in English | MEDLINE | ID: mdl-32124325

ABSTRACT

Tumor-infiltrating immune cells play critical roles in immune-mediated tumor rejection and/or progression, and are key targets of immunotherapies. Estimation of different immune subsets becomes increasingly important with the decreased cost of high-throughput molecular profiling and the rapidly growing amount of cancer genomics data. Here, we present Tumor IMmune Estimation Resource (TIMER), an in silico deconvolution method for inference of tumor-infiltrating immune components. TIMER takes bulk tissue gene expression profiles measured with RNA-seq or microarray to evaluate the abundance of six immune cell types in the tumor microenvironment: B cell, CD4+ T cell, CD8+ T cell, neutrophil, macrophage, and dendritic cell. We further introduce its associated webserver for convenient, user-friendly analysis of tumor immune infiltrates across multiple cancer types.


Subject(s)
Dendritic Cells/immunology , Gene Expression Profiling/methods , Lymphocytes, Tumor-Infiltrating/immunology , Neoplasms/genetics , Neutrophils/immunology , Tumor-Associated Macrophages/immunology , B-Lymphocytes/immunology , B-Lymphocytes/metabolism , CD4-Positive T-Lymphocytes/immunology , CD4-Positive T-Lymphocytes/metabolism , CD8-Positive T-Lymphocytes/immunology , CD8-Positive T-Lymphocytes/metabolism , Computer Simulation , Dendritic Cells/metabolism , Genomics/methods , Humans , Lymphocytes, Tumor-Infiltrating/metabolism , Neoplasms/immunology , Neutrophils/metabolism , Software , Tumor Microenvironment , Tumor-Associated Macrophages/metabolism
19.
Article in English | MEDLINE | ID: mdl-30113897

ABSTRACT

The recently developed Hi-C technology enables a genome-wide view of chromosome spatial organizations, and has shed deep insights into genome structure and genome function. However, multiple sources of uncertainties make downstream data analysis and interpretation challenging. Specifically, statistical models for inferring three-dimensional (3D) chromosomal structure from Hi-C data are far from their maturity. Most existing methods are highly over-parameterized, lacking clear interpretations, and sensitive to outliers. In this study, we propose a parsimonious, easy to interpret, and robust piecewise helical model for the inference of 3D chromosomal structure of individual topologically associated domain from Hi-C data. When applied to a real Hi-C dataset, the piecewise helical model not only achieves much better model fitting than existing models, but also reveals that geometric properties of chromatin spatial organization are closely related to genome function.


Subject(s)
Chromosomes , Genomics/methods , Animals , Bayes Theorem , Chromosomes/chemistry , Chromosomes/genetics , Chromosomes/ultrastructure , Computer Simulation , Genome/genetics , Mice
SELECTION OF CITATIONS
SEARCH DETAIL
...