Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
J Phys Chem B ; 127(40): 8537-8550, 2023 Oct 12.
Article in English | MEDLINE | ID: mdl-37791670

ABSTRACT

The "bottom-up" approach to coarse-graining, for building accurate and efficient computational models to simulate large-scale and complex phenomena and processes, is an important approach in computational chemistry, biophysics, and materials science. As one example, the Multiscale Coarse-Graining (MS-CG) approach to developing CG models can be rigorously derived using statistical mechanics applied to fine-grained, i.e., all-atom simulation data for a given system. Under a number of circumstances, a systematic procedure, such as MS-CG modeling, is particularly valuable. Here, we present the development of the OpenMSCG software, a modularized open-source software that provides a collection of successful and widely applied bottom-up CG methods, including Boltzmann Inversion (BI), Force-Matching (FM), Ultra-Coarse-Graining (UCG), Relative Entropy Minimization (REM), Essential Dynamics Coarse-Graining (EDCG), and Heterogeneous Elastic Network Modeling (HeteroENM). OpenMSCG is a high-performance and comprehensive toolset that can be used to derive CG models from large-scale fine-grained simulation data in file formats from common molecular dynamics (MD) software packages, such as GROMACS, LAMMPS, and NAMD. OpenMSCG is modularized in the Python programming framework, which allows users to create and customize modeling "recipes" for reproducible results, thus greatly improving the reliability, reproducibility, and sharing of bottom-up CG models and their applications.

2.
J Chem Phys ; 158(23)2023 Jun 21.
Article in English | MEDLINE | ID: mdl-37318166

ABSTRACT

Bottom-up coarse-grained (CG) molecular dynamics models are parameterized using complex effective Hamiltonians. These models are typically optimized to approximate high dimensional data from atomistic simulations. However, human validation of these models is often limited to low dimensional statistics that do not necessarily differentiate between the CG model and said atomistic simulations. We propose that classification can be used to variationally estimate high dimensional error and that explainable machine learning can help convey this information to scientists. This approach is demonstrated using Shapley additive explanations and two CG protein models. This framework may also be valuable for ascertaining whether allosteric effects at the atomistic level are accurately propagated to a CG model.

3.
J Phys Chem Lett ; 14(17): 3970-3979, 2023 May 04.
Article in English | MEDLINE | ID: mdl-37079800

ABSTRACT

Machine-learned coarse-grained (CG) models have the potential for simulating large molecular complexes beyond what is possible with atomistic molecular dynamics. However, training accurate CG models remains a challenge. A widely used methodology for learning bottom-up CG force fields maps forces from all-atom molecular dynamics to the CG representation and matches them with a CG force field on average. We show that there is flexibility in how to map all-atom forces to the CG representation and that the most commonly used mapping methods are statistically inefficient and potentially even incorrect in the presence of constraints in the all-atom simulation. We define an optimization statement for force mappings and demonstrate that substantially improved CG force fields can be learned from the same simulation data when using optimized force maps. The method is demonstrated on the miniproteins chignolin and tryptophan cage and published as open-source code.

4.
Curr Opin Struct Biol ; 79: 102533, 2023 04.
Article in English | MEDLINE | ID: mdl-36731338

ABSTRACT

The successful recent application of machine learning methods to scientific problems includes the learning of flexible and accurate atomic-level force-fields for materials and biomolecules from quantum chemical data. In parallel, the machine learning of force-fields at coarser resolutions is rapidly gaining relevance as an efficient way to represent the higher-body interactions needed in coarse-grained force-fields to compensate for the omitted degrees of freedom. Coarse-grained models are important for the study of systems at time and length scales exceeding those of atomistic simulations. However, the development of transferable coarse-grained models via machine learning still presents significant challenges. Here, we discuss recent developments in this field and current efforts to address the remaining challenges.


Subject(s)
Machine Learning , Thermodynamics
5.
J Chem Theory Comput ; 19(14): 4402-4413, 2023 Jul 25.
Article in English | MEDLINE | ID: mdl-36802592

ABSTRACT

Coarse-grained (CG) models parametrized using atomistic reference data, i.e., "bottom up" CG models, have proven useful in the study of biomolecules and other soft matter. However, the construction of highly accurate, low resolution CG models of biomolecules remains challenging. We demonstrate in this work how virtual particles, CG sites with no atomistic correspondence, can be incorporated into CG models within the context of relative entropy minimization (REM) as latent variables. The methodology presented, variational derivative relative entropy minimization (VD-REM), enables optimization of virtual particle interactions through a gradient descent algorithm aided by machine learning. We apply this methodology to the challenging case of a solvent-free CG model of a 1,2-dioleoyl-sn-glycero-3-phosphocholine (DOPC) lipid bilayer and demonstrate that introduction of virtual particles captures solvent-mediated behavior and higher-order correlations which REM alone cannot capture in a more standard CG model based only on the mapping of collections of atoms to the CG sites.

6.
J Chem Theory Comput ; 18(10): 5759-5791, 2022 Oct 11.
Article in English | MEDLINE | ID: mdl-36070494

ABSTRACT

Large-scale computational molecular models provide scientists a means to investigate the effect of microscopic details on emergent mesoscopic behavior. Elucidating the relationship between variations on the molecular scale and macroscopic observable properties facilitates an understanding of the molecular interactions driving the properties of real world materials and complex systems (e.g., those found in biology, chemistry, and materials science). As a result, discovering an explicit, systematic connection between microscopic nature and emergent mesoscopic behavior is a fundamental goal for this type of investigation. The molecular forces critical to driving the behavior of complex heterogeneous systems are often unclear. More problematically, simulations of representative model systems are often prohibitively expensive from both spatial and temporal perspectives, impeding straightforward investigations over possible hypotheses characterizing molecular behavior. While the reduction in resolution of a study, such as moving from an atomistic simulation to that of the resolution of large coarse-grained (CG) groups of atoms, can partially ameliorate the cost of individual simulations, the relationship between the proposed microscopic details and this intermediate resolution is nontrivial and presents new obstacles to study. Small portions of these complex systems can be realistically simulated. Alone, these smaller simulations likely do not provide insight into collectively emergent behavior. However, by proposing that the driving forces in both smaller and larger systems (containing many related copies of the smaller system) have an explicit connection, systematic bottom-up CG techniques can be used to transfer CG hypotheses discovered using a smaller scale system to a larger system of primary interest. The proposed connection between different CG systems is prescribed by (i) the CG representation (mapping) and (ii) the functional form and parameters used to represent the CG energetics, which approximate potentials of mean force (PMFs). As a result, the design of CG methods that facilitate a variety of physically relevant representations, approximations, and force fields is critical to moving the frontier of systematic CG forward. Crucially, the proposed connection between the system used for parametrization and the system of interest is orthogonal to the optimization used to approximate the potential of mean force present in all systematic CG methods. The empirical efficacy of machine learning techniques on a variety of tasks provides strong motivation to consider these approaches for approximating the PMF and analyzing these approximations.


Subject(s)
Molecular Dynamics Simulation
7.
J Chem Phys ; 155(4): 045101, 2021 Jul 28.
Article in English | MEDLINE | ID: mdl-34340389

ABSTRACT

Understanding the permeation of biomolecules through cellular membranes is critical for many biotechnological applications, including targeted drug delivery, pathogen detection, and the development of new antibiotics. To this end, computer simulations are routinely used to probe the underlying mechanisms of membrane permeation. Despite great progress and continued development, permeation simulations of realistic systems (e.g., more complex drug molecules or biologics through heterogeneous membranes) remain extremely challenging if not intractable. In this work, we combine molecular dynamics simulations with transition-tempered metadynamics and techniques from the variational approach to conformational dynamics to study the permeation mechanism of a drug molecule, trimethoprim, through a multicomponent membrane. We show that collective variables (CVs) obtained from an unsupervised machine learning algorithm called time-structure based Independent Component Analysis (tICA) improve performance and substantially accelerate convergence of permeation potential of mean force (PMF) calculations. The addition of cholesterol to the lipid bilayer is shown to increase both the width and height of the free energy barrier due to a condensing effect (lower area per lipid) and increase bilayer thickness. Additionally, the tICA CVs reveal a subtle effect of cholesterol increasing the resistance to permeation in the lipid head group region, which is not observed when canonical CVs are used. We conclude that the use of tICA CVs can enable more efficient PMF calculations with additional insight into the permeation mechanism.


Subject(s)
Pharmacokinetics , Unsupervised Machine Learning , Algorithms , Cholesterol/chemistry , Lipid Bilayers/chemistry , Molecular Dynamics Simulation , Phosphatidylcholines/chemistry , Thermodynamics , Trimethoprim/chemistry
8.
J Chem Phys ; 151(13): 134115, 2019 Oct 07.
Article in English | MEDLINE | ID: mdl-31594316

ABSTRACT

Coarse-grained (CG) observable expressions, such as pressure or potential energy, are generally different than their fine-grained (FG, e.g., atomistic) counterparts. Recently, we analyzed this so-called "representability problem" in Wagner et al. [J. Chem. Phys. 145, 044108 (2016)]. While the issue of representability was clearly and mathematically stated in that work, it was not made clear how to actually determine CG observable expressions from the underlying FG systems that can only be simulated numerically. In this work, we propose minimization targets for the CG observables of such systems. These CG observables are compatible with each other and with structural observables. Also, these CG observables are systematically improvable since they are variationally minimized. Our methods are local and data efficient because we decompose the observable contributions. Hence, our approaches are called the multiscale compatible observable decomposition (MS-CODE) and the relative entropy compatible observable decomposition (RE-CODE), which reflect two main approaches to the "bottom-up" coarse-graining of real FG systems. The parameterization of these CG observable expressions requires the introduction of new, symmetric basis sets and one-body terms. We apply MS-CODE and RE-CODE to 1-site and 2-site CG models of methanol for the case of pressure, as well as to 1-site methanol and acetonitrile models for potential energy.

9.
J Chem Phys ; 151(12): 124110, 2019 Sep 28.
Article in English | MEDLINE | ID: mdl-31575201

ABSTRACT

We utilize connections between molecular coarse-graining (CG) approaches and implicit generative models in machine learning to describe a new framework for systematic molecular CG. Focus is placed on the formalism encompassing generative adversarial networks. The resulting method enables a variety of model parameterization strategies, some of which show similarity to previous CG methods. We demonstrate that the resulting framework can rigorously parameterize CG models containing CG sites with no prescribed connection to the reference atomistic system (termed virtual sites); however, this advantage is offset by the lack of a closed-form expression for the CG Hamiltonian at the resolution obtained after integration over the virtual CG sites. Computational examples are provided for cases in which these methods ideally return identical parameters as relative entropy minimization CG but where traditional relative entropy minimization CG optimization equations are not applicable.

10.
Biophys J ; 115(8): 1589-1602, 2018 10 16.
Article in English | MEDLINE | ID: mdl-30249402

ABSTRACT

Actin filaments continually assemble and disassemble within a cell. Assembled filaments "age" as a bound nucleotide ATP within each actin subunit quickly hydrolyzes followed by a slower release of the phosphate Pi, leaving behind a bound ADP. This subtle change in nucleotide state of actin subunits affects filament rigidity as well as its interactions with binding partners. We present here a systematic multiscale ultra-coarse-graining approach that provides a computationally efficient way to simulate a long actin filament undergoing ATP hydrolysis and phosphate-release reactions while systematically taking into account available atomistic details. The slower conformational changes and their dependence on the chemical reactions are simulated with the ultra-coarse-graining model by assigning internal states to the coarse-grained sites. Each state is represented by a unique potential surface of a local heterogeneous elastic network. Internal states undergo stochastic transitions that are coupled to conformations of the underlying molecular system. The model reproduces mechanical properties of the filament and allows us to study whether conformational fluctuations in actin subunits produce cooperative filament aging. We find that the nucleotide states of neighboring subunits modulate the reaction kinetics, implying cooperativity in ATP hydrolysis and Pi release. We further systematically coarse grain the system into a Markov state model that incorporates assembly and disassembly, facilitating a direct comparison with previously published models. We find that cooperativity in ATP hydrolysis and Pi release significantly affects the filament growth dynamics only near the critical G-actin concentration, whereas far from it, both cooperative and random mechanisms show similar growth dynamics. In contrast, filament composition in terms of the bound nucleotide distribution varies significantly at all monomer concentrations studied. These results provide new insights, to our knowledge, into the cooperative nature of ATP hydrolysis and Pi release and the implications it has for actin filament properties, providing novel predictions for future experimental studies.


Subject(s)
Actin Cytoskeleton/physiology , Actins/metabolism , Adenosine Diphosphate/metabolism , Adenosine Triphosphate/metabolism , Phosphates/metabolism , Humans , Hydrolysis , Kinetics
11.
Proc Natl Acad Sci U S A ; 114(47): E10056-E10065, 2017 11 21.
Article in English | MEDLINE | ID: mdl-29114055

ABSTRACT

The packaging and budding of Gag polyprotein and viral RNA is a critical step in the HIV-1 life cycle. High-resolution structures of the Gag polyprotein have revealed that the capsid (CA) and spacer peptide 1 (SP1) domains contain important interfaces for Gag self-assembly. However, the molecular details of the multimerization process, especially in the presence of RNA and the cell membrane, have remained unclear. In this work, we investigate the mechanisms that work in concert between the polyproteins, RNA, and membrane to promote immature lattice growth. We develop a coarse-grained (CG) computational model that is derived from subnanometer resolution structural data. Our simulations recapitulate contiguous and hexameric lattice assembly driven only by weak anisotropic attractions at the helical CA-SP1 junction. Importantly, analysis from CG and single-particle tracking photoactivated localization (spt-PALM) trajectories indicates that viral RNA and the membrane are critical constituents that actively promote Gag multimerization through scaffolding, while overexpression of short competitor RNA can suppress assembly. We also find that the CA amino-terminal domain imparts intrinsic curvature to the Gag lattice. As a consequence, immature lattice growth appears to be coupled to the dynamics of spontaneous membrane deformation. Our findings elucidate a simple network of interactions that regulate the early stages of HIV-1 assembly and budding.


Subject(s)
Cell Membrane/chemistry , Gene Products, gag/chemistry , HIV-1/physiology , RNA, Viral/chemistry , Virus Assembly/physiology , Virus Release/physiology , Binding Sites , Cell Membrane/metabolism , Gene Expression , Gene Products, gag/genetics , Gene Products, gag/metabolism , HEK293 Cells , Host-Pathogen Interactions , Humans , Kinetics , Molecular Dynamics Simulation , Protein Binding , Protein Interaction Domains and Motifs , Protein Multimerization , Protein Structure, Secondary , RNA, Viral/genetics , RNA, Viral/metabolism , Recombinant Proteins/chemistry , Recombinant Proteins/genetics , Recombinant Proteins/metabolism , Thermodynamics
12.
J Chem Phys ; 145(4): 044108, 2016 Jul 28.
Article in English | MEDLINE | ID: mdl-27475349

ABSTRACT

In coarse-grained (CG) models where certain fine-grained (FG, i.e., atomistic resolution) observables are not directly represented, one can nonetheless identify indirect the CG observables that capture the FG observable's dependence on CG coordinates. Often, in these cases it appears that a CG observable can be defined by analogy to an all-atom or FG observable, but the similarity is misleading and significantly undermines the interpretation of both bottom-up and top-down CG models. Such problems emerge especially clearly in the framework of the systematic bottom-up CG modeling, where a direct and transparent correspondence between FG and CG variables establishes precise conditions for consistency between CG observables and underlying FG models. Here we present and investigate these representability challenges and illustrate them via the bottom-up conceptual framework for several simple analytically tractable polymer models. The examples provide special focus on the observables of configurational internal energy, entropy, and pressure, which have been at the root of controversy in the CG literature, as well as discuss observables that would seem to be entirely missing in the CG representation but can nonetheless be correlated with CG behavior. Though we investigate these problems in the framework of systematic coarse-graining, the lessons apply to top-down CG modeling also, with crucial implications for simulation at constant pressure and surface tension and for the interpretations of structural and thermodynamic correlations for comparison to experiment.

SELECTION OF CITATIONS
SEARCH DETAIL
...