Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
J Org Chem ; 88(16): 11954-11967, 2023 Aug 18.
Article in English | MEDLINE | ID: mdl-37540578

ABSTRACT

The kinetic data indicate that the addition of tertiary phosphines to α-methylene lactones in acetic acid is strongly accelerated in comparison to the reactions of related open-chain esters. Six-membered α-methylene-δ-valerolactone exhibited a more pronounced rate increase than five-membered α-methylene-γ-butyrolactone. The use of α-methylene-γ-butyrolactam as a nitrogen analogue of α-methylene-γ-butyrolactone resulted in a total loss of the reaction acceleration. The observed reactivities were rationalized by DFT calculations at the RwB97XD/6-31+G(d,p) level of theory, showing that the intramolecular interaction between phosphonium and enolate oxygen centers provided by the locked s-cis-geometry of the heterocycles plays an important role in the stabilization of intermediate zwitterions. The reactivity is also controlled by the conformational flexibility of the heterocycle. The geometries of five-membered and, especially, six-membered lactone cycles are slightly changed upon the nucleophilic attack of phosphine, leading to the stabilizing stereoelectronic effect by the Ρ···Ο interaction. The addition of phosphine to α-methylene-γ-butyrolactam significantly distorts the initial geometry of the heterocycle, making the nucleophilic attack unfavorable. The application of the stereoelectronic effect to enhance the efficiency of the phosphine-catalyzed Michael and Pudovik reactions of α-methylene lactones was demonstrated.

2.
Mol Inform ; 41(4): e2100138, 2022 04.
Article in English | MEDLINE | ID: mdl-34726834

ABSTRACT

In this paper, we compare the most popular Atom-to-Atom Mapping (AAM) tools: ChemAxon,[1] Indigo,[2] RDTool,[3] NameRXN (NextMove),[4] and RXNMapper[5] which implement different AAM algorithms. An open-source RDTool program was optimized, and its modified version ("new RDTool") was considered together with several consensus mapping strategies. The Condensed Graph of Reaction approach was used to calculate chemical distances and develop the "AAM fixer" algorithm for an automatized correction of erroneous mapping. The benchmarking calculations were performed on a Golden dataset containing 1851 manually mapped and curated reactions. The best performing RXNMapper program together with the AMM Fixer was applied to map the USPTO database. The Golden dataset, mapped USPTO and optimized RDTool are available in the GitHub repository https://github.com/Laboratoire-de-Chemoinformatique.


Subject(s)
Benchmarking , Biochemical Phenomena , Algorithms , Databases, Factual
3.
J Chem Inf Model ; 61(10): 4913-4923, 2021 10 25.
Article in English | MEDLINE | ID: mdl-34554736

ABSTRACT

Modern QSAR approaches have wide practical applications in drug discovery for designing potentially bioactive molecules. If such models are based on the use of 2D descriptors, important information contained in the spatial structures of molecules is lost. The major problem in constructing models using 3D descriptors is the choice of a putative bioactive conformation, which affects the predictive performance. The multi-instance (MI) learning approach considering multiple conformations in model training could be a reasonable solution to the above problem. In this study, we implemented several multi-instance algorithms, both conventional and based on deep learning, and investigated their performance. We compared the performance of MI-QSAR models with those based on the classical single-instance QSAR (SI-QSAR) approach in which each molecule is encoded by either 2D descriptors computed for the corresponding molecular graph or 3D descriptors issued for a single lowest energy conformation. The calculations were carried out on 175 data sets extracted from the ChEMBL23 database. It is demonstrated that (i) MI-QSAR outperforms SI-QSAR in numerous cases and (ii) MI algorithms can automatically identify plausible bioactive conformations.


Subject(s)
Algorithms , Quantitative Structure-Activity Relationship , Databases, Factual , Drug Discovery , Molecular Conformation
4.
Mol Inform ; 40(12): e2100119, 2021 12.
Article in English | MEDLINE | ID: mdl-34427989

ABSTRACT

The quality of experimental data for chemical reactions is a critical consideration for any reaction-driven study. However, the curation of reaction data has not been extensively discussed in the literature so far. Here, we suggest a 4 steps protocol that includes the curation of individual structures (reactants and products), chemical transformations, reaction conditions and endpoints. Its implementation in Python3 using CGRTools toolkit has been used to clean three popular reaction databases Reaxys, USPTO and Pistachio. The curated USPTO database is available in the GitHub repository (Laboratoire-de-Chemoinformatique/Reaction_Data_Cleaning).


Subject(s)
Data Curation , Databases, Factual , Reference Standards
5.
Int J Mol Sci ; 23(1)2021 Dec 27.
Article in English | MEDLINE | ID: mdl-35008674

ABSTRACT

The selection of experimental conditions leading to a reasonable yield is an important and essential element for the automated development of a synthesis plan and the subsequent synthesis of the target compound. The classical QSPR approach, requiring one-to-one correspondence between chemical structure and a target property, can be used for optimal reaction conditions prediction only on a limited scale when only one condition component (e.g., catalyst or solvent) is considered. However, a particular reaction can proceed under several different conditions. In this paper, we describe the Likelihood Ranking Model representing an artificial neural network that outputs a list of different conditions ranked according to their suitability to a given chemical transformation. Benchmarking calculations demonstrated that our model outperformed some popular approaches to the theoretical assessment of reaction conditions, such as k Nearest Neighbors, and a recurrent artificial neural network performance prediction of condition components (reagents, solvents, catalysts, and temperature). The ability of the Likelihood Ranking model trained on a hydrogenation reactions dataset, (~42,000 reactions) from Reaxys® database, to propose conditions that led to the desired product was validated experimentally on a set of three reactions with rich selectivity issues.


Subject(s)
Models, Chemical , Hydrogenation , Likelihood Functions , Stereoisomerism
6.
Int J Mol Sci ; 21(15)2020 Aug 03.
Article in English | MEDLINE | ID: mdl-32756326

ABSTRACT

Nowadays, the problem of the model's applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described in the literature, no one for chemical reactions (Quantitative Reaction-Property Relationships (QRPR)) has been reported to date. The point is that a chemical reaction is a much more complex object than an individual molecule, and its yield, thermodynamic and kinetic characteristics depend not only on the structures of reactants and products but also on experimental conditions. The QRPR models' performance largely depends on the way that chemical transformation is encoded. In this study, various AD definition methods extensively used in QSAR/QSPR studies of individual molecules, as well as several novel approaches suggested in this work for reactions, were benchmarked on several reaction datasets. The ability to exclude wrong reaction types, increase coverage, improve the model performance and detect Y-outliers were tested. As a result, several "best" AD definitions for the QRPR models predicting reaction characteristics have been revealed and tested on a previously published external dataset with a clear AD definition problem.


Subject(s)
Cheminformatics/trends , Protein Domains , Quantitative Structure-Activity Relationship , Thermodynamics , Chemical Phenomena , Kinetics , Models, Molecular
7.
J Chem Inf Model ; 59(11): 4569-4576, 2019 11 25.
Article in English | MEDLINE | ID: mdl-31638794

ABSTRACT

Here, we describe a concept of conjugated models for several properties (activities) linked by a strict mathematical relationship. This relationship can be directly integrated analytically into the ridge regression (RR) algorithm or accounted for in a special case of "twin" neural networks (NN). Developed approaches were applied to the modeling of the logarithm of the prototropic tautomeric constant (logKT) which can be expressed as the difference between the acidity constants (pKa) of two related tautomers. Both conjugated and individual RR and NN models for logKT and pKa were developed. The modeling set included 639 tautomeric constants and 2371 acidity constants of organic molecules in various solvents. A descriptor vector for each reaction resulted from the concatenation of structural descriptors and some parameters for reaction conditions. For the former, atom-centered substructural fragments describing acid sites in tautomer molecules were used. The latter were automatically identified using the condensed graph of reaction approach. Conjugated models performed similarly to the best individual models for logKT and pKa. At the same time, the physically grounded relationship between logKT and pKa was respected only for conjugated but not individual models.


Subject(s)
Organic Chemicals/chemistry , Pharmaceutical Preparations/chemistry , Acids/chemistry , Algorithms , Drug Discovery , Models, Chemical , Molecular Structure , Neural Networks, Computer , Quantitative Structure-Activity Relationship , Solvents/chemistry , Stereoisomerism
8.
J Chem Inf Model ; 59(6): 2516-2521, 2019 06 24.
Article in English | MEDLINE | ID: mdl-31063394

ABSTRACT

CGRtools is an open-source Python library aimed to handle molecular and reaction information. It is the sole library developed so far which can process condensed graph of reaction (CGR) handling. CGR provides the possibility for advanced operations with reaction information and could be used for reaction descriptor calculation, structure-reactivity modeling, atom-to-atom mapping comparison and correction, reaction center extraction, reaction balancing, and some other related tasks. Unlike other popular libraries, CGRtools is fully written in Python with minor dependencies on other libraries and cross-platform. Reaction, molecule, and CGR objects in CGRtools support native Python methods and are comparable with the help of operations "equal to", "less than", and "bigger than". CGRtools supports common structural formats. CGRtools is distributed via an L-GPL license and available on https://github.com/cimm-kzn/CGRtools .


Subject(s)
Cheminformatics/methods , Small Molecule Libraries/chemistry , Software , Chemical Phenomena , Models, Chemical
9.
Beilstein J Org Chem ; 14: 1980-1993, 2018.
Article in English | MEDLINE | ID: mdl-30202452

ABSTRACT

The synthesis of new calix[4]arenes adopting a cone stereoisomeric form bearing two or four azide fragments on the upper rim and water-soluble triazolyl amphiphilic receptors with two or four polyammonium headgroups via copper-catalyzed azide-alkyne cycloaddition reaction has been performed for the first time. It was found that the synthesized macrocycles form stable aggregates with hydrodynamic diameters between 150-200 nm and electrokinetic potentials about +40 to +60 mV in water solutions. Critical aggregation concentration (CAC) values were measured using a micelle method with pyrene and eosin Y as dye probes. The CAC values of tetraalkyl-substituted macrocycles 12a,b (5 µM for both) are significantly lower than those for dialkyl-substituted macrocycles 10a,b (790 and 160 µM, respectively). Premicellar aggregates of macrocycles 10a,b and 12a,b with the dye eosin Y were used for nucleotides sensing through a dye replacement procedure. It is unusual that disubstituted macrocycles 10a,b bind more effectively a less charged adenosine 5'-diphosphate (ADP) than adenosine 5'-triphosphate (ATP). A simple colorimetric method based on polydiacetylene vesicles decorated with 10b was elaborated for the naked-eye detection of ADP with a detection limit of 0.5 mM.

10.
J Chem Inf Model ; 56(11): 2140-2148, 2016 11 28.
Article in English | MEDLINE | ID: mdl-27783508

ABSTRACT

We report a new method to assess protective groups (PGs) reactivity as a function of reaction conditions (catalyst, solvent) using raw reaction data. It is based on an intuitive similarity principle for chemical reactions: similar reactions proceed under similar conditions. Technically, reaction similarity can be assessed using the Condensed Graph of Reaction (CGR) approach representing an ensemble of reactants and products as a single molecular graph, i.e., as a pseudomolecule for which molecular descriptors or fingerprints can be calculated. CGR-based in-house tools were used to process data for 142,111 catalytic hydrogenation reactions extracted from the Reaxys database. Our results reveal some contradictions with famous Greene's Reactivity Charts based on manual expert analysis. Models developed in this study show high accuracy (ca. 90%) for predicting optimal experimental conditions of protective group deprotection.


Subject(s)
Informatics/methods , Automation , Catalysis , Databases, Factual , Hydroxides/chemistry , Models, Chemical , Phenols/chemistry , Solvents/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL
...