Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 30
Filter
1.
SAR QSAR Environ Res ; 32(3): 207-219, 2021 Mar.
Article in English | MEDLINE | ID: mdl-33601989

ABSTRACT

In this article, we consider cross-validation of the quantitative structure-property relationship models for reactions and show that the conventional k-fold cross-validation (CV) procedure gives an 'optimistically' biased assessment of prediction performance. To address this issue, we suggest two strategies of model cross-validation, 'transformation-out' CV, and 'solvent-out' CV. Unlike the conventional k-fold cross-validation approach that does not consider the nature of objects, the proposed procedures provide an unbiased estimation of the predictive performance of the models for novel types of structural transformations in chemical reactions and reactions going under new conditions. Both the suggested strategies have been applied to predict the rate constants of bimolecular elimination and nucleophilic substitution reactions, and Diels-Alder cycloaddition. All suggested cross-validation methodologies and tutorial are implemented in the open-source software package CIMtools (https://github.com/cimm-kzn/CIMtools).


Subject(s)
Models, Chemical , Quantitative Structure-Activity Relationship , Software , Validation Studies as Topic
2.
SAR QSAR Environ Res ; 31(8): 597-613, 2020 Aug.
Article in English | MEDLINE | ID: mdl-32646236

ABSTRACT

Here we report a new predictive model for autoignition temperature (AIT), an important physical parameter widely used to assess potential safety hazards of combustible materials. Available structure-AIT data extracted from different sources were critically analysed. Support vector regression (SVR) models on different data subsets were built in order to identify a reliable compound set on which a realistic model could be built. This led to a selection of the dataset containing 875 compounds annotated with AIT values. The thereupon-based SVR model performs reasonably well in cross-validation with the determination coefficient r 2 = 0.77 and mean absolute error MAE = 37.8°C. External validation on 20 industrial compounds missing in the training set confirmed its good predictive power (MAE = 28.7°C).


Subject(s)
Fires , Quantitative Structure-Activity Relationship , Temperature , Chemical Phenomena , Data Analysis , Models, Chemical
3.
J Comput Aided Mol Des ; 32(3): 401-414, 2018 03.
Article in English | MEDLINE | ID: mdl-29380104

ABSTRACT

We report the first direct QSPR modeling of equilibrium constants of tautomeric transformations (logK T ) in different solvents and at different temperatures, which do not require intermediate assessment of acidity (basicity) constants for all tautomeric forms. The key step of the modeling consisted in the merging of two tautomers in one sole molecular graph ("condensed reaction graph") which enables to compute molecular descriptors characterizing entire equilibrium. The support vector regression method was used to build the models. The training set consisted of 785 transformations belonging to 11 types of tautomeric reactions with equilibrium constants measured in different solvents and at different temperatures. The models obtained perform well both in cross-validation (Q2 = 0.81 RMSE = 0.7 logK T units) and on two external test sets. Benchmarking studies demonstrate that our models outperform results obtained with DFT B3LYP/6-311 ++ G(d,p) and ChemAxon Tautomerizer applicable only in water at room temperature.


Subject(s)
Computer Simulation , Solvents/chemistry , Temperature , Isomerism , Molecular Structure , Quantitative Structure-Activity Relationship , Thermodynamics , Water/chemistry
4.
Mol Inform ; 34(6-7): 348-56, 2015 06.
Article in English | MEDLINE | ID: mdl-27490381

ABSTRACT

In this paper we demonstrate that Generative Topographic Mapping (GTM), a machine learning method traditionally used for data visualisation, can be efficiently applied to QSAR modelling using probability distribution functions (PDF) computed in the latent 2-dimensional space. Several different scenarios of the activity assessment were considered: (i) the "activity landscape" approach based on direct use of PDF, (ii) QSAR models involving GTM-generated on descriptors derived from PDF, and, (iii) the k-Nearest Neighbours approach in 2D latent space. Benchmarking calculations were performed on five different datasets: stability constants of metal cations Ca(2+) , Gd(3+) and Lu(3+) complexes with organic ligands in water, aqueous solubility and activity of thrombin inhibitors. It has been shown that the performance of GTM-based regression models is similar to that obtained with some popular machine-learning methods (random forest, k-NN, M5P regression tree and PLS) and ISIDA fragment descriptors. By comparing GTM activity landscapes built both on predicted and experimental activities, we may visually assess the model's performance and identify the areas in the chemical space corresponding to reliable predictions. The applicability domain used in this work is based on data likelihood. Its application has significantly improved the model performances for 4 out of 5 datasets.


Subject(s)
Calcium/chemistry , Gadolinium/chemistry , Lutetium/chemistry , Machine Learning , Models, Chemical , Thrombin/chemistry , Databases, Chemical , Humans
5.
SAR QSAR Environ Res ; 23(7-8): 627-47, 2012 Oct.
Article in English | MEDLINE | ID: mdl-22587543

ABSTRACT

Oxime reactivation of serine esterases (EOHs) inhibited by organophosphorus (OP) compounds can produce O-phosphorylated oximes (POXs). Such oxime derivatives are of interest, because some of them can have greater anti-EOH potencies than the OP inhibitors from which they were derived. Accordingly, inhibitor properties of 58 POXs against four EOHs, along with pair-wise selectivities between them, have been analysed using different QSAR approaches. EOHs (with their abbreviations and consequences of inhibition in parentheses) comprised acetylcholinesterase (AChE: acute neurotoxicity; cognition enhancement), butyrylcholinesterase (BChE: inhibition of drug metabolism or stoichiometric scavenging of EOH inhibitors; cognition enhancement), carboxylesterase (CaE: inhibition of drug metabolism or stoichiometric scavenging of EOH inhibitors), and neuropathy target esterase (NTE: delayed neurotoxicity). QSAR techniques encompassed linear regression and backpropagation neural networks in conjunction with fragmental descriptors containing labelled atoms, Molecular Field Topology Analysis (MFTA), Comparative Molecular Similarity Index Analysis (CoMSIA), and molecular modelling. All methods provided mostly consistent and complementary information, and they revealed structural features controlling the 'esterase profiles', i.e. patterns of anti-EOH activities and selectivities of the compounds of interest. In addition, MFTA models were used to design a library of compounds having a cognition-enhancement esterase profile suitable for potential application to the treatment of Alzheimer's disease.


Subject(s)
Alzheimer Disease/enzymology , Alzheimer Disease/physiopathology , Enzyme Inhibitors/chemistry , Enzyme Inhibitors/pharmacology , Esterases/antagonists & inhibitors , Oximes/chemistry , Oximes/pharmacology , Enzyme Inhibitors/metabolism , Humans , Models, Molecular , Oximes/metabolism , Phosphorylation , Quantitative Structure-Activity Relationship
6.
Mol Inform ; 31(3-4): 301-12, 2012 Apr.
Article in English | MEDLINE | ID: mdl-27477099

ABSTRACT

Here, the utility of Generative Topographic Maps (GTM) for data visualization, structure-activity modeling and database comparison is evaluated, on hand of subsets of the Database of Useful Decoys (DUD). Unlike other popular dimensionality reduction approaches like Principal Component Analysis, Sammon Mapping or Self-Organizing Maps, the great advantage of GTMs is providing data probability distribution functions (PDF), both in the high-dimensional space defined by molecular descriptors and in 2D latent space. PDFs for the molecules of different activity classes were successfully used to build classification models in the framework of the Bayesian approach. Because PDFs are represented by a mixture of Gaussian functions, the Bhattacharyya kernel has been proposed as a measure of the overlap of datasets, which leads to an elegant method of global comparison of chemical libraries.

7.
Chem Biol Interact ; 187(1-3): 177-84, 2010 Sep 06.
Article in English | MEDLINE | ID: mdl-20035729

ABSTRACT

This paper reviews previously published data and presents new results to address the hypothesis that fluorinated aminophosphonates (FAPs), (RO)(2)P(O)C(CF(3))(2)NHS(O)(2)C(6)H(5), R=alkyl, inhibit serine esterases by scission of the P-C bond. Kinetics studies demonstrated that FAPs are progressive irreversible inhibitors of acetylcholinesterase (AChE, EC 3.1.1.7.), butyrylcholinesterase (BChE, EC 3.1.1.8.), carboxylesterase (CaE, EC 3.1.1.1.), and neuropathy target esterase (NTE, EC 3.1.1.5.), consistent with P-C bond breakage. Chemical reactivity experiments showed that diMe-FAP and diEt-FAP react with water to yield the corresponding dialkylphosphates and (CF(3))(2)CHNHS(O)(2)C(6)H(5), indicating lability of the P-C bond. X-ray crystallography of diEt-FAP revealed an elongated (and therefore weaker) P-C bond (1.8797 (13)A) compared to P-C bonds in dialkylphosphonates lacking alpha-CF(3) groups (1.805-1.822A). Semi-empirical and non-empirical molecular modeling of diEt-FAP and (EtO)(2)P(O)C(CH(3))(2)NHS(O)(2)C(6)H(5) (diEt-AP), which lacks CF(3) groups, indicated lengthening and destabilization of the P-C bond in diEt-FAP compared to diEt-AP. Active site peptide adducts formed by reacting diEt-FAP with BChE and diBu-FAP with NTE catalytic domain (NEST) were identified using peptide mass mapping with mass spectrometry (MS). Mass shifts (mean+/-SE, average mass) for peaks corresponding to active site peptides with diethylphosphoryl and monoethylphosphoryl adducts on BChE were 136.1+/-0.1 and 108.0+/-0.1Da, respectively. Corresponding mass shifts for dibutylphosphoryl and monobutylphosphoryl adducts on NEST were 191.8+/-0.2 and 135.5+/-0.1Da, respectively. Each of these values was statistically identical to the theoretical mass shift for each dialkylphosphoryl and monoalkylphosphoryl species. The MS results demonstrate that inhibition of BChE and NEST by FAPs yields dialkylphosphoryl and monoalkylphosphoryl adducts, consistent with phosphorylation via P-C bond cleavage and aging by net dealkylation. Taken together, predictions from enzyme kinetics, chemical reactivity, X-ray crystallography, and molecular modeling were confirmed by MS and support the hypothesis that FAPs inhibit serine esterases via scission of the P-C bond.


Subject(s)
Enzyme Inhibitors/chemistry , Enzyme Inhibitors/pharmacology , Esterases/antagonists & inhibitors , Halogenation , Organophosphonates/chemistry , Organophosphonates/pharmacology , Animals , Crystallography, X-Ray , Esterases/chemistry , Esterases/metabolism , Humans , Kinetics , Mass Spectrometry , Models, Molecular , Molecular Conformation , Peptide Mapping
9.
J Mol Graph Model ; 26(7): 1179-87, 2008 Apr.
Article in English | MEDLINE | ID: mdl-18296089

ABSTRACT

Signaling pathways of Wnt-proteins and Fzd-receptors play important role in processes of growth and development of stem cells and in many types of cancers. The binding of the Wnt-proteins and Fzd-receptors is a complicated process, in which 19 Wnt-proteins and 10 Fzd-receptors are involved. Such a large number of combinations of Wnt-Fzd pairs leads to many different influences of Fzd-Wnt-complexes on the development and differentiation of stem cells. The molecular models of xWnt8, hWnt8, mFzd8, hFzd8-proteins and their complexes were constructed and studied in the present work. The amino acids of the binding sites of proteins which participate in these complexes formation and the protein-protein interactions were studied. The pharmacophoric model of the binding site on the xWnt8 and hWnt8-proteins was constructed. In this work we suggested the peptidomimetic ligands, which can be used for the inhibition of the xWnt8-mFzd8 and hWnt8-hFzd8 proteins formation. The de novo design method of Allegrow software was used for the predictions of most prospective functional groups of the peptidomimetic ligands. These ligands can be used as inhibitors of xWnt8-mFzd8 and hWnt8-hFzd8 complex formation and also can be used for drug design by other methods.


Subject(s)
Models, Molecular , Peptides/chemistry , Wnt Proteins/chemistry , Xenopus Proteins/chemistry , Algorithms , Amino Acid Sequence , Animals , Binding Sites , Computer-Aided Design , Drug Design , Frizzled Receptors/chemistry , Frizzled Receptors/metabolism , Humans , Ligands , Mice , Molecular Sequence Data , Peptides/metabolism , Peptides/pharmacology , Protein Binding , Protein Conformation , Receptors, G-Protein-Coupled/chemistry , Receptors, G-Protein-Coupled/metabolism , Software , Wnt Proteins/antagonists & inhibitors , Wnt Proteins/metabolism , Xenopus , Xenopus Proteins/antagonists & inhibitors , Xenopus Proteins/metabolism
11.
Biochemistry (Mosc) ; 70(10): 1167-72, 2005 Oct.
Article in English | MEDLINE | ID: mdl-16271036

ABSTRACT

Modeling the structure of the C-domain of bovine angiotensin-converting enzyme revealed two putative chloride-binding sites. The kinetic parameters, K(m) and k(cat), of hydrolysis of the substrate Cbz-Phe-His-Leu catalyzed by the testicular (C-domain) enzyme were determined over a wide range of chloride concentrations. Chloride anions were found to be enzyme activators at relatively low concentrations, but they inhibit enzymatic activity at high concentrations. A general scheme for the effect of chloride anions on activity of the C-domain of bovine angiotensin-converting enzyme accounting for binding the "activating" and "inhibiting" anions is suggested.


Subject(s)
Chlorides/metabolism , Peptidyl-Dipeptidase A/metabolism , Testis/enzymology , Amino Acid Sequence , Angiotensin-Converting Enzyme Inhibitors/pharmacology , Animals , Aspartic Acid/chemistry , Binding Sites , Cattle , Hydrolysis , Kinetics , Lysine/chemistry , Male , Models, Chemical , Molecular Sequence Data , Oligopeptides/chemistry , Peptidyl-Dipeptidase A/chemistry , Peptidyl-Dipeptidase A/physiology , Substrate Specificity
16.
Bioorg Khim ; 29(5): 470-8, 2003.
Article in Russian | MEDLINE | ID: mdl-14601401

ABSTRACT

Somatic angiotensin-converting enzyme (ACE) consists of two homologous domains, each of them containing an active site. Differences in substrate specificities and affinity to inhibitors of the active sites of the two domains of bovine ACE are described. The ACE domains demonstrate different thermostability, and the reasons for this difference are analyzed. A structural model of the ACE domains is suggested, which allows us to reveal the structural subdomain important for the protein stability and localize the hydrophobic and the carbohydrate-binding sites.


Subject(s)
Peptidyl-Dipeptidase A/metabolism , Amino Acid Sequence , Kinetics , Models, Molecular , Molecular Sequence Data , Peptidyl-Dipeptidase A/chemistry , Protein Conformation , Sequence Homology, Amino Acid , Structure-Activity Relationship
SELECTION OF CITATIONS
SEARCH DETAIL
...