Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 44
Filter
1.
J Integr Bioinform ; 2024 Apr 15.
Article in English | MEDLINE | ID: mdl-38613325

ABSTRACT

Modern biological research is increasingly informed by computational simulation experiments, which necessitate the development of methods for annotating, archiving, sharing, and reproducing the conducted experiments. These simulations increasingly require extensive collaboration among modelers, experimentalists, and engineers. The Minimum Information About a Simulation Experiment (MIASE) guidelines outline the information needed to share simulation experiments. SED-ML is a computer-readable format for the information outlined by MIASE, created as a community project and supported by many investigators and software tools. Level 1 Version 5 of SED-ML expands the ability of modelers to define simulations in SED-ML using the Kinetic Simulation Algorithm Onotoloy (KiSAO). While it was possible in Version 4 to define a simulation entirely using KiSAO, Version 5 now allows users to define tasks, model changes, ranges, and outputs using the ontology as well. SED-ML is supported by a growing ecosystem of investigators, model languages, and software tools, including various languages for constraint-based, kinetic, qualitative, rule-based, and spatial models, and many simulation tools, visual editors, model repositories, and validators. Additional information about SED-ML is available at https://sed-ml.org/.

2.
Mol Cell ; 84(2): 234-243.e4, 2024 Jan 18.
Article in English | MEDLINE | ID: mdl-38159566

ABSTRACT

Transcription coactivators are proteins or protein complexes that mediate transcription factor (TF) function. However, they lack DNA-binding capacity, prompting the question of how they engage target loci. Three non-exclusive hypotheses have been posited: coactivators are recruited by complexing with TFs, by binding histones through epigenetic reader domains, or by partitioning into condensates through their extensive intrinsically disordered regions. Using p300 as a prototypical coactivator, we systematically mutated its annotated domains and show by single-molecule tracking in live U2OS cells that coactivator-chromatin binding depends entirely on combinatorial binding of multiple TF-interaction domains. Furthermore, we demonstrate that acetyltransferase activity opposes p300-chromatin association and that the N-terminal TF-interaction domains regulate that activity. Single TF-interaction domains are insufficient for chromatin binding and regulation of catalytic activity, implying a principle that we speculate could broadly apply to eukaryotic gene regulation: a TF must act in coordination with other TFs to recruit coactivator activity.


Subject(s)
Transcription Factors , Transcription, Genetic , Transcription Factors/genetics , Transcription Factors/metabolism , Gene Expression Regulation , Histones/metabolism , Chromatin/genetics
3.
bioRxiv ; 2023 May 18.
Article in English | MEDLINE | ID: mdl-37292840

ABSTRACT

Transcription coactivators are proteins or protein complexes that mediate transcription factor (TF) function. However, they lack DNA binding capacity, prompting the question of how they engage target loci. Three non-exclusive hypotheses have been posited: coactivators are recruited by complexing with TFs, by binding histones through epigenetic reader domains, or by partitioning into phase-separated compartments through their extensive intrinsically disordered regions (IDRs). Using p300 as a prototypical coactivator, we systematically mutated its annotated domains and show by single-molecule tracking in live cells that coactivator-chromatin binding depends entirely on combinatorial binding of multiple TF-interaction domains. Furthermore, we demonstrate that acetyltransferase activity negatively impacts p300-chromatin association and that the N-terminal TF-interaction domains regulate that activity. Single TF-interaction domains are insufficient for both chromatin binding and regulation of catalytic activity, implying a principle that could broadly inform eukaryotic gene regulation: a TF must act in coordination with other TFs to recruit coactivator activity.

4.
Bioinform Adv ; 3(1): vbad056, 2023.
Article in English | MEDLINE | ID: mdl-37179703

ABSTRACT

Motivation: Efficient resource allocation can contribute to an organism's fitness and can improve evolutionary success. Resource Balance Analysis (RBA) is a computational framework that models an organism's growth-optimal proteome configurations in various environments. RBA software enables the construction of RBA models on genome scale and the calculation of medium-specific, growth-optimal cell states including metabolic fluxes and the abundance of macromolecular machines. However, existing software lacks a simple programming interface for non-expert users, easy to use and interoperable with other software. Results: The python package RBAtools provides convenient access to RBA models. As a flexible programming interface, it enables the implementation of custom workflows and the modification of existing genome-scale RBA models. Its high-level functions comprise simulation, model fitting, parameter screens, sensitivity analysis, variability analysis and the construction of Pareto fronts. Models and data are represented as structured tables and can be exported to common data formats for fluxomics and proteomics visualization. Availability and implementation: RBAtools documentation, installation instructions and tutorials are available at https://sysbioinra.github.io/rbatools/. General information about RBA and related software can be found at rba.inrae.fr.

5.
Mol Cell ; 82(21): 3970-3984, 2022 11 03.
Article in English | MEDLINE | ID: mdl-36265487

ABSTRACT

Many principles of bacterial gene regulation have been foundational to understanding mechanisms of eukaryotic transcription. However, stark structural and functional differences exist between eukaryotic and bacterial transcription factors that complicate inferring properties of the eukaryotic system from that of bacteria. Here, we review those differences, focusing on the impact of intrinsically disordered regions on the thermodynamic and kinetic parameters governing eukaryotic transcription factor interactions-both with other proteins and with chromatin. The prevalence of unstructured domains in eukaryotic transcription factors as well as their known impact on function call for more sophisticated knowledge of what mechanisms they support. Using the evidence available to date, we posit that intrinsically disordered regions are necessary for the complex and integrative functions of eukaryotic transcription factors and that only by understanding their rich biochemistry can we develop a deep molecular understanding of their regulatory mechanisms.


Subject(s)
Intrinsically Disordered Proteins , Transcription Factors , Transcription Factors/genetics , Transcription Factors/metabolism , Eukaryota/genetics , Eukaryotic Cells/metabolism , Gene Expression Regulation , Intrinsically Disordered Proteins/metabolism
6.
Genes Dev ; 36(1-2): 7-16, 2022 01 01.
Article in English | MEDLINE | ID: mdl-34969825

ABSTRACT

How distal cis-regulatory elements (e.g., enhancers) communicate with promoters remains an unresolved question of fundamental importance. Although transcription factors and cofactors are known to mediate this communication, the mechanism by which diffusible molecules relay regulatory information from one position to another along the chromosome is a biophysical puzzle-one that needs to be revisited in light of recent data that cannot easily fit into previous solutions. Here we propose a new model that diverges from the textbook enhancer-promoter looping paradigm and offer a synthesis of the literature to make a case for its plausibility, focusing on the coactivator p300.


Subject(s)
Enhancer Elements, Genetic , Transcription, Genetic , Enhancer Elements, Genetic/genetics , Gene Expression Regulation , Promoter Regions, Genetic/genetics , Transcription Factors/genetics , Transcription Factors/metabolism
7.
J Mol Biol ; 434(2): 167351, 2022 01 30.
Article in English | MEDLINE | ID: mdl-34774566

ABSTRACT

Building structural models of entire cells has been a long-standing cross-discipline challenge for the research community, as it requires an unprecedented level of integration between multiple sources of biological data and enhanced methods for computational modeling and visualization. Here, we present the first 3D structural models of an entire Mycoplasma genitalium (MG) cell, built using the CellPACK suite of computational modeling tools. Our model recapitulates the data described in recent whole-cell system biology simulations and provides a structural representation for all MG proteins, DNA and RNA molecules, obtained by combining experimental and homology-modeled structures and lattice-based models of the genome. We establish a framework for gathering, curating and evaluating these structures, exposing current weaknesses of modeling methods and the boundaries of MG structural knowledge, and visualization methods to explore functional characteristics of the genome and proteome. We compare two approaches for data gathering, a manually-curated workflow and an automated workflow that uses homologous structures, both of which are appropriate for the analysis of mesoscale properties such as crowding and volume occupancy. Analysis of model quality provides estimates of the regularization that will be required when these models are used as starting points for atomic molecular dynamics simulations.


Subject(s)
Models, Structural , Mycoplasma/chemistry , Bacteria , Computational Biology , Genome, Bacterial , Molecular Dynamics Simulation , Mycoplasma/genetics , Mycoplasma genitalium , Proteome/genetics , Transcriptome
8.
Article in English | MEDLINE | ID: mdl-36909847

ABSTRACT

During the COVID-19 pandemic, mathematical modeling of disease transmission has become a cornerstone of key state decisions. To advance the state-of-the-art host viral modeling to handle future pandemics, many scientists working on related issues assembled to discuss the topics. These discussions exposed the reproducibility crisis that leads to inability to reuse and integrate models. This document summarizes these discussions, presents difficulties, and mentions existing efforts towards future solutions that will allow future model utility and integration. We argue that without addressing these challenges, scientists will have diminished ability to build, disseminate, and implement high-impact multi-scale modeling that is needed to understand the health crises we face.

9.
Biophys J ; 120(23): 5231-5242, 2021 12 07.
Article in English | MEDLINE | ID: mdl-34757076

ABSTRACT

Stochasticity from gene expression in single cells is known to drive metabolic heterogeneity at the level of cellular populations, which is understood to have important consequences for issues such as microbial drug tolerance and treatment of human diseases like cancer. Despite considerable advancements in profiling the genomes, transcriptomes, and proteomes of single cells, it remains difficult to experimentally characterize their metabolism at the genome scale. Computational methods could bridge this gap toward a systems understanding of single-cell biology. To address this challenge, we developed stochastic simulation algorithm with flux-balance analysis embedded (SSA-FBA), a computational framework for simulating the stochastic dynamics of the metabolism of individual cells using genome-scale metabolic models with experimental estimates of gene expression and enzymatic reaction rate parameters. SSA-FBA extends the constraint-based modeling formalism of metabolic network modeling to the single-cell regime, enabling simulation when experimentation is intractable. We also developed an efficient implementation of SSA-FBA that leverages the topology of embedded flux-balance analysis models to significantly reduce the computational cost of simulation. As a preliminary case study, we built a reduced single-cell model of Mycoplasma pneumoniae and used SSA-FBA to illustrate the role of stochasticity on the dynamics of metabolism at the single-cell level.


Subject(s)
Metabolic Networks and Pathways , Models, Biological , Algorithms , Computer Simulation , Humans
10.
Nature ; 600(7887): 138-142, 2021 12.
Article in English | MEDLINE | ID: mdl-34759314

ABSTRACT

Pathogens use virulence factors to inhibit the immune system1. The guard hypothesis2,3 postulates that hosts monitor (or 'guard') critical innate immune pathways such that their disruption by virulence factors provokes a secondary immune response1. Here we describe a 'self-guarded' immune pathway in human monocytes, in which guarding and guarded functions are combined in one protein. We find that this pathway is triggered by ICP0, a key virulence factor of herpes simplex virus type 1, resulting in robust induction of anti-viral type I interferon (IFN). Notably, induction of IFN by ICP0 is independent of canonical immune pathways and the IRF3 and IRF7 transcription factors. A CRISPR screen identified the ICP0 target MORC34 as an essential negative regulator of IFN. Loss of MORC3 recapitulates the IRF3- and IRF7-independent IFN response induced by ICP0. Mechanistically, ICP0 degrades MORC3, which leads to de-repression of a MORC3-regulated DNA element (MRE) adjacent to the IFNB1 locus. The MRE is required in cis for IFNB1 induction by the MORC3 pathway, but is not required for canonical IFN-inducing pathways. As well as repressing the MRE to regulate IFNB1, MORC3 is also a direct restriction factor of HSV-15. Our results thus suggest a model in which the primary anti-viral function of MORC3 is self-guarded by its secondary IFN-repressing function-thus, a virus that degrades MORC3 to avoid its primary anti-viral function will unleash the secondary anti-viral IFN response.


Subject(s)
Adenosine Triphosphatases/immunology , DNA-Binding Proteins/immunology , Models, Immunological , Virulence Factors/immunology , Adenosine Triphosphatases/deficiency , Adenosine Triphosphatases/metabolism , CRISPR-Cas Systems , Cell Line , DNA-Binding Proteins/deficiency , DNA-Binding Proteins/metabolism , Gene Editing , Herpesvirus 1, Human/immunology , Herpesvirus 1, Human/pathogenicity , Humans , Immediate-Early Proteins/immunology , Immunity, Innate , Interferon Regulatory Factor-3/metabolism , Interferon Regulatory Factor-7/metabolism , Interferon Type I/antagonists & inhibitors , Interferon Type I/genetics , Interferon Type I/immunology , Monocytes/immunology , Receptor, Interferon alpha-beta , Repressor Proteins/deficiency , Repressor Proteins/immunology , Repressor Proteins/metabolism , Response Elements/genetics , Ubiquitin-Protein Ligases/immunology
12.
Bioinformatics ; 37(21): 3702-3706, 2021 11 05.
Article in English | MEDLINE | ID: mdl-34179955

ABSTRACT

Computational models of biological systems can exploit a broad range of rapidly developing approaches, including novel experimental approaches, bioinformatics data analysis, emerging modelling paradigms, data standards and algorithms. A discussion about the most recent advances among experts from various domains is crucial to foster data-driven computational modelling and its growing use in assessing and predicting the behaviour of biological systems. Intending to encourage the development of tools, approaches and predictive models, and to deepen our understanding of biological systems, the Community of Special Interest (COSI) was launched in Computational Modelling of Biological Systems (SysMod) in 2016. SysMod's main activity is an annual meeting at the Intelligent Systems for Molecular Biology (ISMB) conference, which brings together computer scientists, biologists, mathematicians, engineers, computational and systems biologists. In the five years since its inception, SysMod has evolved into a dynamic and expanding community, as the increasing number of contributions and participants illustrate. SysMod maintains several online resources to facilitate interaction among the community members, including an online forum, a calendar of relevant meetings and a YouTube channel with talks and lectures of interest for the modelling community. For more than half a decade, the growing interest in computational systems modelling and multi-scale data integration has inspired and supported the SysMod community. Its members get progressively more involved and actively contribute to the annual COSI meeting and several related community workshops and meetings, focusing on specific topics, including particular techniques for computational modelling or standardisation efforts.


Subject(s)
Computational Biology , Systems Biology , Humans , Computer Simulation , Algorithms , Data Analysis
13.
Nucleic Acids Res ; 49(W1): W597-W602, 2021 07 02.
Article in English | MEDLINE | ID: mdl-34019658

ABSTRACT

Comprehensive, predictive computational models have significant potential for science, bioengineering, and medicine. One promising way to achieve more predictive models is to combine submodels of multiple subsystems. To capture the multiple scales of biology, these submodels will likely require multiple modeling frameworks and simulation algorithms. Several community resources are already available for working with many of these frameworks and algorithms. However, the variety and sheer number of these resources make it challenging to find and use appropriate tools for each model, especially for novice modelers and experimentalists. To make these resources easier to use, we developed RunBioSimulations (https://run.biosimulations.org), a single web application for executing a broad range of models. RunBioSimulations leverages community resources, including BioSimulators, a new open registry of simulation tools. These resources currently enable RunBioSimulations to execute nine frameworks and 44 algorithms, and they make RunBioSimulations extensible to additional frameworks and algorithms. RunBioSimulations also provides features for sharing simulations and interactively visualizing their results. We anticipate that RunBioSimulations will foster reproducibility, stimulate collaboration, and ultimately facilitate the creation of more predictive models.


Subject(s)
Computer Simulation , Models, Biological , Software , Algorithms , Computational Biology , Internet
14.
Brief Bioinform ; 22(5)2021 09 02.
Article in English | MEDLINE | ID: mdl-33834185

ABSTRACT

Detailed maps of the molecular basis of the disease are powerful tools for interpreting data and building predictive models. Modularity and composability are considered necessary network features for large-scale collaborative efforts to build comprehensive molecular descriptions of disease mechanisms. An effective way to create and manage large systems is to compose multiple subsystems. Composable network components could effectively harness the contributions of many individuals and enable teams to seamlessly assemble many individual components into comprehensive maps. We examine manually built versions of the RAS-RAF-MEK-ERK cascade from the Atlas of Cancer Signalling Network, PANTHER and Reactome databases and review them in terms of their reusability and composability for assembling new disease models. We identify design principles for managing complex systems that could make it easier for investigators to share and reuse network components. We demonstrate the main challenges including incompatible levels of detail and ambiguous representation of complexes and highlight the need to address these challenges.


Subject(s)
Computational Biology/methods , Databases, Factual , MAP Kinase Signaling System , Neoplasms/metabolism , raf Kinases/metabolism , ras Proteins/metabolism , Data Mining/methods , Humans , Internet , Models, Biological , Phosphorylation , Reproducibility of Results
15.
J Integr Bioinform ; 18(3): 20210021, 2021 Oct 05.
Article in English | MEDLINE | ID: mdl-35330701

ABSTRACT

Computational simulation experiments increasingly inform modern biological research, and bring with them the need to provide ways to annotate, archive, share and reproduce the experiments performed. These simulations increasingly require extensive collaboration among modelers, experimentalists, and engineers. The Minimum Information About a Simulation Experiment (MIASE) guidelines outline the information needed to share simulation experiments. SED-ML is a computer-readable format for the information outlined by MIASE, created as a community project and supported by many investigators and software tools. The first versions of SED-ML focused on deterministic and stochastic simulations of models. Level 1 Version 4 of SED-ML substantially expands these capabilities to cover additional types of models, model languages, parameter estimations, simulations and analyses of models, and analyses and visualizations of simulation results. To facilitate consistent practices across the community, Level 1 Version 4 also more clearly describes the use of SED-ML constructs, and includes numerous concrete validation rules. SED-ML is supported by a growing ecosystem of investigators, model languages, and software tools, including eight languages for constraint-based, kinetic, qualitative, rule-based, and spatial models, over 20 simulation tools, visual editors, model repositories, and validators. Additional information about SED-ML is available at https://sed-ml.org/.


Subject(s)
Language , Programming Languages , Ecosystem , Models, Biological , Systems Biology
16.
Nucleic Acids Res ; 49(D1): D516-D522, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33174603

ABSTRACT

Integrative research about multiple biochemical subsystems has significant potential to help advance biology, bioengineering and medicine. However, it is difficult to obtain the diverse data needed for integrative research. To facilitate biochemical research, we developed Datanator (https://datanator.info), an integrated database and set of tools for finding clouds of multiple types of molecular data about specific molecules and reactions in specific organisms and environments, as well as data about chemically-similar molecules and reactions in phylogenetically-similar organisms in similar environments. Currently, Datanator includes metabolite concentrations, RNA modifications and half-lives, protein abundances and modifications, and reaction rate constants about a broad range of organisms. Going forward, we aim to launch a community initiative to curate additional data. Datanator also provides tools for filtering, visualizing and exporting these data clouds. We believe that Datanator can facilitate a wide range of research from integrative mechanistic models, such as whole-cell models, to comparative data-driven analyses of multiple organisms.


Subject(s)
Cells/metabolism , Databases, Genetic , Models, Biological , Data Analysis
17.
Cell Syst ; 11(2): 109-120, 2020 08 26.
Article in English | MEDLINE | ID: mdl-32853539

ABSTRACT

Like many scientific disciplines, dynamical biochemical modeling is hindered by irreproducible results. This limits the utility of biochemical models by making them difficult to understand, trust, or reuse. We comprehensively list the best practices that biochemical modelers should follow to build reproducible biochemical model artifacts-all data, model descriptions, and custom software used by the model-that can be understood and reused. The best practices provide advice for all steps of a typical biochemical modeling workflow in which a modeler collects data; constructs, trains, simulates, and validates the model; uses the predictions of a model to advance knowledge; and publicly shares the model artifacts. The best practices emphasize the benefits obtained by using standard tools and formats and provides guidance to modelers who do not or cannot use standards in some stages of their modeling workflow. Adoption of these best practices will enhance the ability of researchers to reproduce, understand, and reuse biochemical models.


Subject(s)
Computer Simulation/standards , Systems Biology/methods , Humans
18.
Article in English | MEDLINE | ID: mdl-32850764

ABSTRACT

Computer-aided design (CAD) for synthetic biology promises to accelerate the rational and robust engineering of biological systems. It requires both detailed and quantitative mathematical and experimental models of the processes to (re)design biology, and software and tools for genetic engineering and DNA assembly. Ultimately, the increased precision in the design phase will have a dramatic impact on the production of designer cells and organisms with bespoke functions and increased modularity. CAD strategies require quantitative models of cells that can capture multiscale processes and link genotypes to phenotypes. Here, we present a perspective on how whole-cell, multiscale models could transform design-build-test-learn cycles in synthetic biology. We show how these models could significantly aid in the design and learn phases while reducing experimental testing by presenting case studies spanning from genome minimization to cell-free systems. We also discuss several challenges for the realization of our vision. The possibility to describe and build whole-cells in silico offers an opportunity to develop increasingly automatized, precise and accessible CAD tools and strategies.

19.
Genome Biol ; 21(1): 117, 2020 05 18.
Article in English | MEDLINE | ID: mdl-32423472

ABSTRACT

Non-canonical residues, caps, crosslinks, and nicks are important to many functions of DNAs, RNAs, proteins, and complexes. However, we do not fully understand how networks of such non-canonical macromolecules generate behavior. One barrier is our limited formats for describing macromolecules. To overcome this barrier, we develop BpForms and BcForms, a toolkit for representing the primary structure of macromolecules as combinations of residues, caps, crosslinks, and nicks. The toolkit can help omics researchers perform quality control and exchange information about macromolecules, help systems biologists assemble global models of cells that encompass processes such as post-translational modification, and help bioengineers design cells.


Subject(s)
Macromolecular Substances/standards , Molecular Structure , Software , Macromolecular Substances/chemistry , Proteomics , Synthetic Biology , Systems Biology
20.
Nat Commun ; 11(1): 689, 2020 02 04.
Article in English | MEDLINE | ID: mdl-32019919

ABSTRACT

Genome-scale engineering holds great potential to impact science, industry, medicine, and society, and recent improvements in DNA synthesis have enabled the manipulation of megabase genomes. However, coordinating and integrating the workflows and large teams necessary for gigabase genome engineering remains a considerable challenge. We examine this issue and recommend a path forward by: 1) adopting and extending existing representations for designs, assembly plans, samples, data, and workflows; 2) developing new technologies for data curation and quality control; 3) conducting fundamental research on genome-scale modeling and design; and 4) developing new legal and contractual infrastructure to facilitate collaboration.


Subject(s)
Genetic Engineering , Genome , Animals , DNA/genetics , DNA Replication , Databases, Genetic , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...