Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
J Chem Inf Model ; 64(9): 3812-3825, 2024 May 13.
Article in English | MEDLINE | ID: mdl-38651738

ABSTRACT

In the realm of medicinal chemistry, the primary objective is to swiftly optimize a multitude of chemical properties of a set of compounds to yield a clinical candidate poised for clinical trials. In recent years, two computational techniques, machine learning (ML) and physics-based methods, have evolved substantially and are now frequently incorporated into the medicinal chemist's toolbox to enhance the efficiency of both hit optimization and candidate design. Both computational methods come with their own set of limitations, and they are often used independently of each other. ML's capability to screen extensive compound libraries expediently is tempered by its reliance on quality data, which can be scarce especially during early-stage optimization. Contrarily, physics-based approaches like free energy perturbation (FEP) are frequently constrained by low throughput and high cost by comparison; however, physics-based methods are capable of making highly accurate binding affinity predictions. In this study, we harnessed the strength of FEP to overcome data paucity in ML by generating virtual activity data sets which then inform the training of algorithms. Here, we show that ML algorithms trained with an FEP-augmented data set could achieve comparable predictive accuracy to data sets trained on experimental data from biological assays. Throughout the paper, we emphasize key mechanistic considerations that must be taken into account when aiming to augment data sets and lay the groundwork for successful implementation. Ultimately, the study advocates for the synergy of physics-based methods and ML to expedite the lead optimization process. We believe that the physics-based augmentation of ML will significantly benefit drug discovery, as these techniques continue to evolve.


Subject(s)
Machine Learning , Thermodynamics , Drug Discovery/methods , Algorithms , Humans
2.
Environ Health Perspect ; 128(2): 27002, 2020 02.
Article in English | MEDLINE | ID: mdl-32074470

ABSTRACT

BACKGROUND: Endocrine disrupting chemicals (EDCs) are xenobiotics that mimic the interaction of natural hormones and alter synthesis, transport, or metabolic pathways. The prospect of EDCs causing adverse health effects in humans and wildlife has led to the development of scientific and regulatory approaches for evaluating bioactivity. This need is being addressed using high-throughput screening (HTS) in vitro approaches and computational modeling. OBJECTIVES: In support of the Endocrine Disruptor Screening Program, the U.S. Environmental Protection Agency (EPA) led two worldwide consortiums to virtually screen chemicals for their potential estrogenic and androgenic activities. Here, we describe the Collaborative Modeling Project for Androgen Receptor Activity (CoMPARA) efforts, which follows the steps of the Collaborative Estrogen Receptor Activity Prediction Project (CERAPP). METHODS: The CoMPARA list of screened chemicals built on CERAPP's list of 32,464 chemicals to include additional chemicals of interest, as well as simulated ToxCast™ metabolites, totaling 55,450 chemical structures. Computational toxicology scientists from 25 international groups contributed 91 predictive models for binding, agonist, and antagonist activity predictions. Models were underpinned by a common training set of 1,746 chemicals compiled from a combined data set of 11 ToxCast™/Tox21 HTS in vitro assays. RESULTS: The resulting models were evaluated using curated literature data extracted from different sources. To overcome the limitations of single-model approaches, CoMPARA predictions were combined into consensus models that provided averaged predictive accuracy of approximately 80% for the evaluation set. DISCUSSION: The strengths and limitations of the consensus predictions were discussed with example chemicals; then, the models were implemented into the free and open-source OPERA application to enable screening of new chemicals with a defined applicability domain and accuracy assessment. This implementation was used to screen the entire EPA DSSTox database of ∼875,000 chemicals, and their predicted AR activities have been made available on the EPA CompTox Chemicals dashboard and National Toxicology Program's Integrated Chemical Environment. https://doi.org/10.1289/EHP5580.


Subject(s)
Computer Simulation , Endocrine Disruptors , Androgens , Databases, Factual , High-Throughput Screening Assays , Humans , Receptors, Androgen , United States , United States Environmental Protection Agency
3.
Anal Bioanal Chem ; 412(6): 1303-1315, 2020 Feb.
Article in English | MEDLINE | ID: mdl-31965249

ABSTRACT

High-resolution mass spectrometry (HRMS) enables rapid chemical annotation via accurate mass measurements and matching of experimentally derived spectra with reference spectra. Reference libraries are generated from chemical standards and are therefore limited in size relative to known chemical space. To address this limitation, in silico spectra (i.e., MS/MS or MS2 spectra), predicted via Competitive Fragmentation Modeling-ID (CFM-ID) algorithms, were generated for compounds within the U.S. Environmental Protection Agency's (EPA) Distributed Structure-Searchable Toxicity (DSSTox) database (totaling, at the time of analysis, ~ 765,000 substances). Experimental spectra from EPA's Non-Targeted Analysis Collaborative Trial (ENTACT) mixtures (n = 10) were then used to evaluate the performance of the in silico spectra. Overall, MS2 spectra were acquired for 377 unique compounds from the ENTACT mixtures. Approximately 53% of these compounds were correctly identified using a commercial reference library, whereas up to 50% were correctly identified as the top hit using the in silico library. Together, the reference and in silico libraries were able to correctly identify 73% of the 377 ENTACT substances. When using the in silico spectra for candidate filtering, an examination of binary classifiers showed a true positive rate (TPR) of 0.90 associated with false positive rates (FPRs) of 0.10 to 0.85, depending on the sample and method of candidate filtering. Taken together, these findings show the abilities of in silico spectra to correctly identify true positives in complex samples (at rates comparable to those observed with reference spectra), and efficiently filter large numbers of potential false positives from further consideration. Graphical abstract.

4.
Sci Data ; 6(1): 141, 2019 08 02.
Article in English | MEDLINE | ID: mdl-31375670

ABSTRACT

Confident identification of unknown chemicals in high resolution mass spectrometry (HRMS) screening studies requires cohesive workflows and complementary data, tools, and software. Chemistry databases, screening libraries, and chemical metadata have become fixtures in identification workflows. To increase confidence in compound identifications, the use of structural fragmentation data collected via tandem mass spectrometry (MS/MS or MS2) is vital. However, the availability of empirically collected MS/MS data for identification of unknowns is limited. Researchers have therefore turned to in silico generation of MS/MS data for use in HRMS-based screening studies. This paper describes the generation en masse of predicted MS/MS spectra for the entirety of the US EPA's DSSTox database using competitive fragmentation modelling and a freely available open source tool, CFM-ID. The generated dataset comprises predicted MS/MS spectra for ~700,000 structures, and mappings between predicted spectra, structures, associated substances, and chemical metadata. Together, these resources facilitate improved compound identifications in HRMS screening studies. These data are accessible via an SQL database, a comma-separated export file (.csv), and EPA's CompTox Chemicals Dashboard.

5.
J Cheminform ; 10(1): 47, 2018 Sep 18.
Article in English | MEDLINE | ID: mdl-30229396

ABSTRACT

BACKGROUND: Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and noise or uncertainty in the activity data themselves. To address these challenges, we introduce and explore a QSAR model based on custom distance metrics in the structure-activity space. METHODS: The model is built on top of the k-nearest neighbor model, incorporating non-linearity not only in the chemical structure space, but also in the biological activity space. The model is tuned and evaluated using activity data for human estrogen receptor from the US EPA ToxCast and Tox21 databases. RESULTS: The model closely trails the CERAPP consensus model (built on top of 48 individual human estrogen receptor activity models) in agonist activity predictions and consistently outperforms the CERAPP consensus model in antagonist activity predictions. DISCUSSION: We suggest that incorporating non-linear distance metrics may significantly improve QSAR model performance when the available biological activity data are limited.

6.
Environ Health Perspect ; 124(7): 1023-33, 2016 07.
Article in English | MEDLINE | ID: mdl-26908244

ABSTRACT

BACKGROUND: Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program. OBJECTIVES: We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing. METHODS: CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies. RESULTS: Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing. CONCLUSION: This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points. CITATION: Mansouri K, Abdelaziz A, Rybacka A, Roncaglioni A, Tropsha A, Varnek A, Zakharov A, Worth A, Richard AM, Grulke CM, Trisciuzzi D, Fourches D, Horvath D, Benfenati E, Muratov E, Wedebye EB, Grisoni F, Mangiatordi GF, Incisivo GM, Hong H, Ng HW, Tetko IV, Balabin I, Kancherla J, Shen J, Burton J, Nicklaus M, Cassotti M, Nikolov NG, Nicolotti O, Andersson PL, Zang Q, Politi R, Beger RD, Todeschini R, Huang R, Farag S, Rosenberg SA, Slavov S, Hu X, Judson RS. 2016. CERAPP: Collaborative Estrogen Receptor Activity Prediction Project. Environ Health Perspect 124:1023-1033; http://dx.doi.org/10.1289/ehp.1510267.


Subject(s)
Endocrine Disruptors/toxicity , Receptors, Estrogen/metabolism , Toxicity Tests , Computer Simulation , Endocrine Disruptors/classification , Environmental Policy , Quantitative Structure-Activity Relationship , United States
7.
J Comput Chem ; 33(8): 906-10, 2012 Mar 30.
Article in English | MEDLINE | ID: mdl-22298319

ABSTRACT

We describe the new Pathways plugin for the molecular visualization program visual molecular dynamics. The plugin identifies and visualizes tunneling pathways and pathway families in biomolecules, and calculates relative electronic couplings. The plugin includes unique features to estimate the importance of individual atoms for mediating the coupling, to analyze the coupling sensitivity to thermal motion, and to visualize pathway fluctuations. The Pathways plugin is open source software distributed under the terms of the GNU's Not Unix (GNU) public license.


Subject(s)
Azurin/chemistry , Bacterial Proteins/chemistry , Pseudomonas aeruginosa/chemistry , Software , Computer Simulation , Electron Transport , Models, Molecular , Signal Transduction
8.
Acc Chem Res ; 42(10): 1669-78, 2009 Oct 20.
Article in English | MEDLINE | ID: mdl-19645446

ABSTRACT

Electron transfer (ET) reactions provide a nexus among chemistry, biochemistry, and physics. These reactions underpin the "power plants" and "power grids" of bioenergetics, and they challenge us to understand how evolution manipulates structure to control ET kinetics. Ball-and-stick models for the machinery of electron transfer, however, fail to capture the rich electronic and nuclear dynamics of ET molecules: these static representations disguise, for example, the range of thermally accessible molecular conformations. The influence of structural fluctuations on electron-transfer kinetics is amplified by the exponential decay of electron tunneling probabilities with distance, as well as the delicate interference among coupling pathways. Fluctuations in the surrounding medium can also switch transport between coherent and incoherent ET mechanisms--and may gate ET so that its kinetics is limited by conformational interconversion times, rather than by the intrinsic ET time scale. Moreover, preparation of a charge-polarized donor state or of a donor state with linear or angular momentum can have profound dynamical and kinetic consequences. In this Account, we establish a vocabulary to describe how the conformational ensemble and the prepared donor state influence ET kinetics in macromolecules. This framework is helping to unravel the richness of functional biological ET pathways, which have evolved within fluctuating macromolecular structures. The conceptual framework for describing nonadiabatic ET seems disarmingly simple: compute the ensemble-averaged (mean-squared) donor-acceptor (DA) tunneling interaction, , and the Franck-Condon weighted density of states, rho(FC), to describe the rate, (2pi/variant Planck's over 2pi)rho(FC). Modern descriptions of the thermally averaged electronic coupling and of the Franck-Condon factor establish a useful predictive framework in biology, chemistry, and nanoscience. Describing the influence of geometric and energetic fluctuations on ET allows us to address a rich array of mechanistic and kinetic puzzles. How strongly is a protein's fold imprinted on the ET kinetics, and might thermal fluctuations "wash out" signatures of structure? What is the influence of thermal fluctuations on ET kinetics beyond averaging of the tunneling barrier structure? Do electronic coupling mechanisms change as donor and acceptor reposition in a protein, and what are the consequences for the ET kinetics? Do fluctuations access minority species that dominate tunneling? Can energy exchanges between the electron and bridge vibrations generate vibronic signatures that label some of the D-to-A pathways traversed by the electron, thus eliminating unmarked pathways that would otherwise contribute to the DA coupling (as in other "which way" or double-slit experiments)? Might medium fluctuations drive tunneling-hopping mechanistic transitions? How does the donor-state preparation, in particular, its polarization toward the acceptor and its momentum characteristics (which may introduce complex rather than pure real relationships among donor orbital amplitudes), influence the electronic dynamics? In this Account, we describe our recent studies that address puzzling questions of how conformational distributions, excited-state polarization, and electronic and nuclear dynamical effects influence ET in macromolecules. Indeed, conformational and dynamical effects arise in all transport regimes, including the tunneling, resonant transport, and hopping regimes. Importantly, these effects can induce switching among ET mechanisms.


Subject(s)
Electron Transport , Kinetics , Macromolecular Substances/chemistry , Macromolecular Substances/metabolism , Nucleic Acids/chemistry , Nucleic Acids/metabolism , Temperature , Water/chemistry , Water/metabolism
9.
Proc Natl Acad Sci U S A ; 106(34): 14253-8, 2009 Aug 25.
Article in English | MEDLINE | ID: mdl-19706508

ABSTRACT

Allosteric regulation provides highly specific ligand recognition and signaling by transmembrane protein receptors. Unlike functions of protein molecular machines that rely on large-scale conformational transitions, signal transduction in receptors appears to be mediated by more subtle structural motions that are difficult to identify. We describe a theoretical model for allosteric regulation in receptors that addresses a fundamental riddle of signaling: What are the structural origins of the receptor agonism (specific signaling response to ligand binding)? The model suggests that different signaling pathways in bovine rhodopsin or human beta(2)-adrenergic receptor can be mediated by specific structural motions in the receptors. We discuss implications for understanding the receptor agonism, particularly the recently observed "biased agonism" (selected activation of specific signaling pathways), and for developing rational structure-based drug-design strategies.


Subject(s)
Models, Theoretical , Receptors, Adrenergic, beta-2/metabolism , Rhodopsin/metabolism , Adrenergic beta-2 Receptor Agonists , Adrenergic beta-Agonists/chemistry , Adrenergic beta-Agonists/metabolism , Adrenergic beta-Agonists/pharmacology , Algorithms , Allosteric Regulation , Animals , Binding Sites , Cattle , Humans , Ligands , Models, Molecular , Protein Conformation , Protein Structure, Secondary , Protein Structure, Tertiary , Receptors, Adrenergic, beta-2/chemistry , Rhodopsin/agonists , Rhodopsin/chemistry , Signal Transduction
10.
Phys Rev Lett ; 101(15): 158102, 2008 Oct 10.
Article in English | MEDLINE | ID: mdl-18999647

ABSTRACT

In the soft-wet environment of biomolecular electron transfer, it is possible that structural fluctuations could wash out medium-specific electronic effects on electron tunneling rates. We show that beyond a transition distance (2-3 A in water and 6-7 A in proteins), fluctuation contributions to the mean-squared donor-to-acceptor tunneling matrix element are likely to dominate over the average matrix element. Even though fluctuations dominate the tunneling mechanism at larger distances, we find that the protein fold is "remembered" by the electronic coupling, and structure remains a key determinant of electron transfer kinetics.


Subject(s)
Models, Biological , Models, Chemical , Proteins/chemistry , Azurin/chemistry , Azurin/metabolism , Cytochrome b Group/chemistry , Cytochrome b Group/metabolism , Escherichia coli Proteins/chemistry , Escherichia coli Proteins/metabolism , Models, Molecular , Myoglobin/chemistry , Myoglobin/metabolism , Protein Structure, Secondary , Proteins/metabolism , Thermodynamics
12.
Science ; 310(5752): 1311-3, 2005 Nov 25.
Article in English | MEDLINE | ID: mdl-16311331

ABSTRACT

Structured water molecules near redox cofactors were found recently to accelerate electron-transfer (ET) kinetics in several systems. Theoretical study of interprotein electron transfer across an aqueous interface reveals three distinctive electronic coupling mechanisms that we describe here: (i) a protein-mediated regime when the two proteins are in van der Waals contact; (ii) a structured water-mediated regime featuring anomalously weak distance decay at relatively close protein-protein contact distances; and (iii) a bulk water-mediated regime at large distances. Our analysis explains a range of otherwise puzzling biological ET kinetic data and provides a framework for including explicit water-mediated tunneling effects on ET kinetics.


Subject(s)
Cytochromes b5/metabolism , Electron Transport , Water/chemistry , Animals , Cattle , Chemical Phenomena , Chemistry, Physical , Cytochromes b5/chemistry , Kinetics , Models, Chemical , Porphyrins/chemistry , Protein Conformation , Thermodynamics
13.
Proc Natl Acad Sci U S A ; 102(10): 3552-7, 2005 Mar 08.
Article in English | MEDLINE | ID: mdl-15738409

ABSTRACT

We compute the autocorrelation function of the donor-acceptor tunneling matrix element for six Ru-azurin derivatives. Comparison of this decay time to the decay time of the time-dependent Franck-Condon factor {computed by Rossky and coworkers [Lockwood, D. M., Cheng, Y.-K. & Rossky, P. J. (2001) Chem. Phys. Lett. 345, 159-165]} reveals the extent to which non-Condon effects influence the electron-transfer rate. is studied as a function of donor-acceptor distance, tunneling pathway structure, tunneling energy, and temperature to explore the structural and dynamical origins of non-Condon effects. For azurin, the correlation function is remarkably insensitive to tunneling pathway structure. The decay time is only slightly shorter than it is for solvent-mediated electron transfer in small organic molecules and originates, largely, from fluctuations of valence angles rather than bond lengths.


Subject(s)
Azurin/chemistry , Electron Transport , Models, Molecular
14.
Biophys J ; 86(3): 1332-44, 2004 Mar.
Article in English | MEDLINE | ID: mdl-14990464

ABSTRACT

F(1)F(o)-ATP synthase is a ubiquitous membrane protein complex that efficiently converts a cell's transmembrane proton gradient into chemical energy stored as ATP. The protein is made of two molecular motors, F(o) and F(1), which are coupled by a central stalk. The membrane unit, F(o), converts the transmembrane electrochemical potential into mechanical rotation of a rotor in F(o) and the physically connected central stalk. Based on available data of individual components, we have built an all-atom model of F(o) and investigated through molecular dynamics simulations and mathematical modeling the mechanism of torque generation in F(o). The mechanism that emerged generates the torque at the interface of the a- and c-subunits of F(o) through side groups aSer-206, aArg-210, and aAsn-214 of the a-subunit and side groups cAsp-61 of the c-subunits. The mechanism couples protonation/deprotonation of two cAsp-61 side groups, juxtaposed to the a-subunit at any moment in time, to rotations of individual c-subunit helices as well as rotation of the entire c-subunit. The aArg-210 side group orients the cAsp-61 side groups and, thereby, establishes proton transfer via aSer-206 and aAsn-214 to proton half-channels, while preventing direct proton transfer between the half-channels. A mathematical model proves the feasibility of torque generation by the stated mechanism against loads typical during ATP synthesis; the essential model characteristics, e.g., helix and subunit rotation and associated friction constants, have been tested and furnished by steered molecular dynamics simulations.


Subject(s)
Cell Membrane/chemistry , Models, Chemical , Models, Molecular , Molecular Motor Proteins/chemistry , Proton-Translocating ATPases/chemistry , Binding Sites , Computer Simulation , Dimerization , Models, Statistical , Protein Binding , Protein Conformation , Protein Subunits , Rotation , Stochastic Processes
15.
Biophys J ; 86(3): 1813-9, 2004 Mar.
Article in English | MEDLINE | ID: mdl-14990507

ABSTRACT

Cytochrome c oxidase mediates the final step of electron transfer reactions in the respiratory chain, catalyzing the transfer between cytochrome c and the molecular oxygen and concomitantly pumping protons across the inner mitochondrial membrane. We investigate the electron transfer reactions in cytochrome c oxidase, particularly the control of the effective electronic coupling by the nuclear thermal motion. The effective coupling is calculated using the Green's function technique with an extended Huckel level electronic Hamiltonian, combined with all-atom molecular dynamics of the protein in a native (membrane and solvent) environment. The effective coupling between Cu(A) and heme a is found to be dominated by the pathway that starts from His(B204). The coupling between heme a and heme a(3) is dominated by a through-space jump between the two heme rings rather than by covalent pathways. In the both steps, the effective electronic coupling is robust to the thermal nuclear vibrations, thereby providing fast and efficient electron transfer.


Subject(s)
Copper/chemistry , Electron Transport Complex IV/chemistry , Energy Transfer , Heme/chemistry , Models, Chemical , Models, Molecular , Computer Simulation , Electron Transport , Enzyme Activation , Kinetics , Oxidation-Reduction
SELECTION OF CITATIONS
SEARCH DETAIL
...