Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
J Chem Phys ; 156(18): 184103, 2022 May 14.
Article in English | MEDLINE | ID: mdl-35568532

ABSTRACT

Finding a low dimensional representation of data from long-timescale trajectories of biomolecular processes, such as protein folding or ligand-receptor binding, is of fundamental importance, and kinetic models, such as Markov modeling, have proven useful in describing the kinetics of these systems. Recently, an unsupervised machine learning technique called VAMPNet was introduced to learn the low dimensional representation and the linear dynamical model in an end-to-end manner. VAMPNet is based on the variational approach for Markov processes and relies on neural networks to learn the coarse-grained dynamics. In this paper, we combine VAMPNet and graph neural networks to generate an end-to-end framework to efficiently learn high-level dynamics and metastable states from the long-timescale molecular dynamics trajectories. This method bears the advantages of graph representation learning and uses graph message passing operations to generate an embedding for each datapoint, which is used in the VAMPNet to generate a coarse-grained dynamical model. This type of molecular representation results in a higher resolution and a more interpretable Markov model than the standard VAMPNet, enabling a more detailed kinetic study of the biomolecular processes. Our GraphVAMPNet approach is also enhanced with an attention mechanism to find the important residues for classification into different metastable states.


Subject(s)
Neural Networks, Computer , Protein Folding , Kinetics , Markov Chains , Molecular Dynamics Simulation
2.
J Comput Aided Mol Des ; 36(4): 263-277, 2022 04.
Article in English | MEDLINE | ID: mdl-35597880

ABSTRACT

Accurately predicting free energy differences is essential in realizing the full potential of rational drug design. Unfortunately, high levels of accuracy often require computationally expensive QM/MM Hamiltonians. Fortuitously, the cost of employing QM/MM approaches in rigorous free energy simulation can be reduced through the use of the so-called "indirect" approach to QM/MM free energies, in which the need for QM/MM simulations is avoided via a QM/MM "correction" at the classical endpoints of interest. Herein, we focus on the computation of QM/MM binding free energies in the context of the SAMPL8 Drugs of Abuse host-guest challenge. Of the 5 QM/MM correction coupled with force-matching submissions, PM6-D3H4/MM ranked submission proved the best overall QM/MM entry, with an RMSE from experimental results of 2.43 kcal/mol (best in ranked submissions), a Pearson's correlation of 0.78 (second-best in ranked submissions), and a Kendall [Formula: see text] correlation of 0.52 (best in ranked submissions).


Subject(s)
Molecular Dynamics Simulation , Proteins , Ligands , Protein Binding , Quantum Theory , Thermodynamics
3.
J Chem Phys ; 155(19): 194108, 2021 Nov 21.
Article in English | MEDLINE | ID: mdl-34800961

ABSTRACT

Conformational sampling of biomolecules using molecular dynamics simulations often produces a large amount of high dimensional data that makes it difficult to interpret using conventional analysis techniques. Dimensionality reduction methods are thus required to extract useful and relevant information. Here, we devise a machine learning method, Gaussian mixture variational autoencoder (GMVAE), that can simultaneously perform dimensionality reduction and clustering of biomolecular conformations in an unsupervised way. We show that GMVAE can learn a reduced representation of the free energy landscape of protein folding with highly separated clusters that correspond to the metastable states during folding. Since GMVAE uses a mixture of Gaussians as its prior, it can directly acknowledge the multi-basin nature of the protein folding free energy landscape. To make the model end-to-end differentiable, we use a Gumbel-softmax distribution. We test the model on three long-timescale protein folding trajectories and show that GMVAE embedding resembles the folding funnel with folded states down the funnel and unfolded states outside the funnel path. Additionally, we show that the latent space of GMVAE can be used for kinetic analysis and Markov state models built on this embedding produce folding and unfolding timescales that are in close agreement with other rigorous dynamical embeddings such as time independent component analysis.


Subject(s)
Cluster Analysis , Molecular Dynamics Simulation , Protein Folding , Kinetics , Markov Chains , Thermodynamics
4.
J Comput Chem ; 42(19): 1373-1383, 2021 07 15.
Article in English | MEDLINE | ID: mdl-33977553

ABSTRACT

The Eighth-Shell method for parallelization of molecular dynamics simulations has previously been shown to be the most optimal for efficiency at large process counts. However, in its current formulation only the P1 space group is supported for periodic boundary conditions (PBC) and thus reflection and/or rotational crystal symmetries are not supported. In this work, we outline the development and implementation of the Extended Eighth-Shell (EES) method that allows rotational symmetry by using an extended import region compared to the ES method. It simulates only the asymmetric unit and communicates coordinates and forces with images that correspond to P21 PBC. The P21 PBC has application in lipid bilayer simulations as it can be used to allow lipids to switch leaftlets, thus rapidly balancing the chemical potential difference between the two layers. Our results show that the EES method scales efficiently over large number of processes and can be used for simulations with P21 symmetry in an orthorhombic crystal.


Subject(s)
Lipids/chemistry , Molecular Dynamics Simulation , Lipid Bilayers/chemistry , Rotation
5.
J Comput Aided Mol Des ; 34(5): 535-542, 2020 05.
Article in English | MEDLINE | ID: mdl-32002779

ABSTRACT

Water octanol partition coefficient serves as a measure for the lipophilicity of a molecule and is important in the field of drug discovery. A novel method for computational prediction of logarithm of partition coefficient (logP) has been developed using molecular fingerprints and a deep neural network. The machine learning model was trained on a dataset of 12,000 molecules and tested on 2000 molecules. In this article, we present our results for the blind prediction of logP for the SAMPL6 challenge. While the best submission achieved a RMSE of 0.41 logP units, our submission had a RMSE of 0.61 logP units. Overall, we ranked in the top quarter out of the 92 submissions that were made. Our results show that the deep learning model can be used as a fast, accurate and robust method for high throughput prediction of logP of small molecules.


Subject(s)
Deep Learning , Octanols/chemistry , Thermodynamics , Water/chemistry , Drug Discovery , Machine Learning , Models, Chemical , Molecular Structure , Solubility
6.
Article in English | MEDLINE | ID: mdl-34458687

ABSTRACT

Alchemical free energy calculations are a useful tool for predicting free energy differences associated with the transfer of molecules from one environment to another. The hallmark of these methods is the use of "bridging" potential energy functions representing alchemical intermediate states that cannot exist as real chemical species. The data collected from these bridging alchemical thermodynamic states allows the efficient computation of transfer free energies (or differences in transfer free energies) with orders of magnitude less simulation time than simulating the transfer process directly. While these methods are highly flexible, care must be taken in avoiding common pitfalls to ensure that computed free energy differences can be robust and reproducible for the chosen force field, and that appropriate corrections are included to permit direct comparison with experimental data. In this paper, we review current best practices for several popular application domains of alchemical free energy calculations performed with equilibrium simulations, in particular relative and absolute small molecule binding free energy calculations to biomolecular targets.

7.
Article in English | MEDLINE | ID: mdl-31788666

ABSTRACT

This document provides a starting point for approaching molecular simulations, guiding beginning practitioners to what issues they need to know about before and while starting their first simulations, and why those issues are so critical. This document makes no claims to provide an adequate introduction to the subject on its own. Instead, our goal is to help people know what issues are critical before beginning, and to provide references to good resources on those topics. We also provide a checklist of key issues to consider before and while setting up molecular simulations which may serve as a foundation for other best practices documents.

8.
J Comput Aided Mol Des ; 32(10): 1191-1201, 2018 10.
Article in English | MEDLINE | ID: mdl-30276503

ABSTRACT

In this work we have developed a hybrid QM and MM approach to predict pKa of small drug-like molecules in explicit solvent. The gas phase free energy of deprotonation is calculated using the M06-2X density functional theory level with Pople basis sets. The solvation free energy difference of the acid and its conjugate base is calculated at MD level using thermodynamic integration. We applied this method to the 24 drug-like molecules in the SAMPL6 blind pKa prediction challenge. We achieved an overall RMSE of 2.4 pKa units in our prediction. Our results show that further optimization of the protocol needs to be done before this method can be used as an alternative approach to the well established approaches of a full quantum level or empirical pKa prediction methods.


Subject(s)
Heterocyclic Compounds/chemistry , Models, Chemical , Solvents/chemistry , Density Functional Theory , Hydrogen-Ion Concentration , Molecular Structure , Thermodynamics , Water/chemistry
9.
J Comput Aided Mol Des ; 30(11): 989-1006, 2016 11.
Article in English | MEDLINE | ID: mdl-27577746

ABSTRACT

One of the central aspects of biomolecular recognition is the hydrophobic effect, which is experimentally evaluated by measuring the distribution coefficients of compounds between polar and apolar phases. We use our predictions of the distribution coefficients between water and cyclohexane from the SAMPL5 challenge to estimate the hydrophobicity of different explicit solvent simulation techniques. Based on molecular dynamics trajectories with the CHARMM General Force Field, we compare pure molecular mechanics (MM) with quantum-mechanical (QM) calculations based on QM/MM schemes that treat the solvent at the MM level. We perform QM/MM with both density functional theory (BLYP) and semi-empirical methods (OM1, OM2, OM3, PM3). The calculations also serve to test the sensitivity of partition coefficients to solute polarizability as well as the interplay of the quantum-mechanical region with the fixed-charge molecular mechanics environment. Our results indicate that QM/MM with both BLYP and OM2 outperforms pure MM. However, this observation is limited to a subset of cases where convergence of the free energy can be achieved.


Subject(s)
Computer Simulation , Cyclohexanes/chemistry , Pharmaceutical Preparations/chemistry , Solvents/chemistry , Water/chemistry , Models, Chemical , Molecular Structure , Quantum Theory , Solubility , Thermodynamics
10.
Nature ; 509(7502): 575-81, 2014 May 29.
Article in English | MEDLINE | ID: mdl-24870542

ABSTRACT

The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples, including 17 adult tissues, 7 fetal tissues and 6 purified primary haematopoietic cells, resulted in identification of proteins encoded by 17,294 genes accounting for approximately 84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream open reading frames. This large human proteome catalogue (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.


Subject(s)
Proteome/metabolism , Proteomics , Adult , Cells, Cultured , Databases, Protein , Fetus/metabolism , Fourier Analysis , Gene Expression Profiling , Genome, Human/genetics , Hematopoietic Stem Cells/cytology , Hematopoietic Stem Cells/metabolism , Humans , Internet , Mass Spectrometry , Molecular Sequence Annotation , Open Reading Frames/genetics , Organ Specificity , Protein Biosynthesis , Protein Isoforms/analysis , Protein Isoforms/genetics , Protein Isoforms/metabolism , Protein Sorting Signals , Protein Transport , Proteome/analysis , Proteome/chemistry , Proteome/genetics , Pseudogenes/genetics , RNA, Untranslated/genetics , Reproducibility of Results , Untranslated Regions/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...