Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 25
Filter
Add more filters










Publication year range
1.
J Chem Inf Model ; 64(6): 1765-1771, 2024 Mar 25.
Article in English | MEDLINE | ID: mdl-38480486

ABSTRACT

Computational tools are revolutionizing our understanding and prediction of chemical reactivity by combining traditional data analysis techniques with new predictive models. These tools extract additional value from the reaction data corpus, but to effectively convert this value into actionable knowledge, domain specialists need to interact easily with the computer-generated output. In this application note, we demonstrate the capabilities of the open-source Python toolkit LinChemIn, which simplifies the manipulation of reaction networks and provides advanced functionality for working with synthetic routes. LinChemIn ensures chemical consistency when merging, editing, mining, and analyzing reaction networks. Its flexible input interface can process routes from various sources, including predictive models and expert input. The toolkit also efficiently extracts individual routes from the combined synthetic tree, identifying alternative paths and reaction combinations. By reducing the operational barrier to accessing and analyzing synthetic routes from multiple sources, LinChemIn facilitates a constructive interplay between artificial intelligence and human expertise.


Subject(s)
Artificial Intelligence , Software , Humans
2.
Chem Mater ; 35(21): 8806-8815, 2023 Nov 14.
Article in English | MEDLINE | ID: mdl-38027545

ABSTRACT

The world is on the verge of a new industrial revolution, and language models are poised to play a pivotal role in this transformative era. Their ability to offer intelligent insights and forecasts has made them a valuable asset for businesses seeking a competitive advantage. The chemical industry, in particular, can benefit significantly from harnessing their power. Since 2016 already, language models have been applied to tasks such as predicting reaction outcomes or retrosynthetic routes. While such models have demonstrated impressive abilities, the lack of publicly available data sets with universal coverage is often the limiting factor for achieving even higher accuracies. This makes it imperative for organizations to incorporate proprietary data sets into their model training processes to improve their performance. So far, however, these data sets frequently remain untapped as there are no established criteria for model customization. In this work, we report a successful methodology for retraining language models on reaction outcome prediction and single-step retrosynthesis tasks, using proprietary, nonpublic data sets. We report a considerable boost in accuracy by combining patent and proprietary data in a multidomain learning formulation. This exercise, inspired by a real-world use case, enables us to formulate guidelines that can be adopted in different corporate settings to customize chemical language models easily.

3.
J Cheminform ; 15(1): 41, 2023 Apr 01.
Article in English | MEDLINE | ID: mdl-37005691

ABSTRACT

BACKGROUND: The increasing amount of chemical reaction data makes traditional ways to navigate its corpus less effective, while the demand for novel approaches and instruments is rising. Recent data science and machine learning techniques support the development of new ways to extract value from the available reaction data. On the one side, Computer-Aided Synthesis Planning tools can predict synthetic routes in a model-driven approach; on the other side, experimental routes can be extracted from the Network of Organic Chemistry, in which reaction data are linked in a network. In this context, the need to combine, compare and analyze synthetic routes generated by different sources arises naturally. RESULTS: Here we present LinChemIn, a python toolkit that allows chemoinformatics operations on synthetic routes and reaction networks. Wrapping some third-party packages for handling graph arithmetic and chemoinformatics and implementing new data models and functionalities, LinChemIn allows the interconversion between data formats and data models and enables route-level analysis and operations, including route comparison and descriptors calculation. Object-Oriented Design principles inspire the software architecture, and the modules are structured to maximize code reusability and support code testing and refactoring. The code structure should facilitate external contributions, thus encouraging open and collaborative software development. CONCLUSIONS: The current version of LinChemIn allows users to combine synthetic routes generated from various tools and analyze them, and constitutes an open and extensible framework capable of incorporating contributions from the community and fostering scientific discussion. Our roadmap envisages the development of sophisticated metrics for routes evaluation, a multi-parameter scoring system, and the implementation of an entire "ecosystem" of functionalities operating on synthetic routes. LinChemIn is freely available at https://github.com/syngenta/linchemin.

4.
Chimia (Aarau) ; 75(3): 211-212, 2021 03 31.
Article in English | MEDLINE | ID: mdl-33766206
5.
J Chem Biol ; 7(4): 125-8, 2014 Oct.
Article in English | MEDLINE | ID: mdl-25320645

ABSTRACT

Neonicotinoid insecticides selectively target the invertebrate nicotinic acetylcholine receptor and disrupt excitatory cholinergic neurotransmission. First launched over 20 years ago, their broad pest spectrum, variety of application methods and relatively low risk to nontarget organisms have resulted in this class dominating the insecticide market with global annual sales in excess of $3.5 bn. This remarkable commercial success brings with it conditions in the field that favour selection of resistant phenotypes. A number of important pest species have been identified with mutations at the nicotinic acetylcholine receptor associated with insensitivity to neonicotinoids. The detailed characterization of these mutations has facilitated a greater understanding of the invertebrate nicotinic acetylcholine receptor.

6.
J Phys Chem B ; 118(27): 7457-7466, 2014 Jul 10.
Article in English | MEDLINE | ID: mdl-24914486

ABSTRACT

The stereospecific oxidative decomposition of urate into allantoin is the core of purine catabolism in many organisms. The spontaneous decomposition of upstream intermediates and the nonenzymatic racemization of allantoin lead to an accumulation of (R)-allantoin, because the enzymes converting allantoin into allantoate are specific for the (S) isomer. The enzyme allantoin racemase catalyzes the reversible conversion between the two allantoin enantiomers, thus ensuring the overall efficiency of the catabolic pathway and preventing allantoin accumulation. On the basis of recent crystallographic and biochemical evidence, allantoin racemase has been assigned to the family of cofactor-independent racemases, together with other amino acid racemases. A detailed computational investigation of allantoin racemase has been carried out to complement the available experimental data and to provide atomistic insight into the enzymatic action. Allantoin, the natural substrate of the enzyme, has been investigated at the quantum mechanical level, in order to rationalize its conformational and tautomeric equilibria, playing a key role in protein-ligand recognition and in the following catalytic steps. The reaction mechanism of the enzyme has been elucidated through quantum mechanics/molecular mechanics (QM/MM) calculations. The potential energy surface investigation, carried out at the QM/MM level, revealed a stepwise reaction mechanism. A pair of cysteine residues promotes the stereoinversion of a carbon atom of the ligand without the assistance of cofactors. Electrostatic fingerprint calculations are used to discuss the role of the active site residues in lowering the pKa of the substrate. The planar unprotonated intermediate is compared with the enolic allantoin tautomer observed in the active site of the crystallized enzyme. Finally, the enzymatic catalysis featured by allantoin racemase (AllR) is compared with that of other enzymes belonging to the same family.

7.
J Bacteriol ; 196(10): 1889-900, 2014 May.
Article in English | MEDLINE | ID: mdl-24633871

ABSTRACT

The nucleoid-associated protein EspR, a chromosome organizer, has pleiotropic effects on expression of genes associated with cell wall function and pathogenesis in Mycobacterium tuberculosis. In particular, EspR binds to several sites upstream of the espACD locus to promote its expression, thereby ensuring full function of the ESX-1 secretion system, a major virulence determinant. The N terminus of EspR contains the helix-turn-helix DNA-binding domain, whereas the C-terminal dimerization domain harbors residues involved in intersubunit interactions. While direct binding to DNA appears to be mediated by an EspR dimer-of-dimers, where two helix-turn-helix motifs remain free for long-range interactions, the mechanism of EspR higher-order organization and its impact on chromosome structure and gene expression are not understood. To investigate these processes, we identified seven amino acid residues using molecular dynamics and replaced them with Ala in order to probe interactions at either the dimer or the dimer-of-dimers interfaces. Arg70, Lys72, and Arg101 were important for protein stability and optimal DNA-binding activity. Moreover, the Arg70 mutant showed decreased dimerization in a mycobacterial two-hybrid system. To correlate these defects with higher-order organization and transcriptional activity, we used atomic force microscopy to observe different EspR mutant proteins in complex with the espACD promoter region. In addition, complementation of an M. tuberculosis espR knockout mutant was performed to measure their impact on EspA expression. Our results pinpoint key residues required for EspR function at the dimer (Arg70) and the dimer-of-dimers (Lys72) interface and demonstrate that EspR dimerization and higher-order oligomerization modulate espACD transcriptional activity and hence pathogenesis.


Subject(s)
Bacterial Proteins/metabolism , Gene Expression Regulation, Bacterial/physiology , Mycobacterium tuberculosis/metabolism , Mycobacterium tuberculosis/pathogenicity , Amino Acid Sequence , Bacterial Proteins/genetics , Binding Sites , DNA, Bacterial/genetics , Models, Molecular , Molecular Sequence Data , Mutagenesis, Site-Directed , Mycobacterium tuberculosis/genetics , Promoter Regions, Genetic , Protein Binding , Protein Conformation , Protein Subunits , Transcription, Genetic , Virulence
8.
PLoS One ; 8(11): e80261, 2013.
Article in English | MEDLINE | ID: mdl-24282528

ABSTRACT

Transcription Activator-Like (TAL) effectors are DNA-binding proteins secreted by phytopathogenic bacteria that interfere with native cellular functions by binding to plant DNA promoters. The key element of their architecture is a domain of tandem-repeats with almost identical sequences. Most of the polymorphism is located at two consecutive amino acids termed Repeat Variable Diresidue (RVD). The discovery of a direct link between the RVD composition and the targeted nucleotide allowed the design of TAL-derived DNA-binding tools with programmable specificities that revolutionized the field of genome engineering. Despite structural data, the molecular origins of this specificity as well as the recognition mechanism have remained unclear. Molecular simulations of the recent crystal structures suggest that most of the protein-DNA binding energy originates from non-specific interactions between the DNA backbone and non-variable residues, while RVDs contributions are negligible. Based on dynamical and energetic considerations we postulate that, while the first RVD residue promotes helix breaks--allowing folding of TAL as a DNA-wrapping super-helix--the second provides specificity through a negative discrimination of matches. Furthermore, we propose a simple pharmacophore-like model for the rationalization of RVD-DNA interactions and the interpretation of experimental findings concerning shared affinities and binding efficiencies. The explanatory paradigm presented herein provides a better comprehension of this elegant architecture and we hope will allow for improved designs of TAL-derived biotechnological tools.


Subject(s)
Bacterial Proteins/chemistry , DNA-Binding Proteins/chemistry , Amino Acid Sequence , Bacterial Proteins/genetics , Bacterial Proteins/physiology , DNA Methylation , DNA, Plant/chemistry , DNA-Binding Proteins/genetics , DNA-Binding Proteins/physiology , Molecular Dynamics Simulation , Molecular Sequence Data , Plant Diseases/microbiology , Promoter Regions, Genetic , Tandem Repeat Sequences
9.
Elife ; 2: e00792, 2013 Jul 30.
Article in English | MEDLINE | ID: mdl-23908767

ABSTRACT

Injectisomes are multi-protein transmembrane machines allowing pathogenic bacteria to inject effector proteins into eukaryotic host cells, a process called type III secretion. Here we present the first three-dimensional structure of Yersinia enterocolitica and Shigella flexneri injectisomes in situ and the first structural analysis of the Yersinia injectisome. Unexpectedly, basal bodies of injectisomes inside the bacterial cells showed length variations of 20%. The in situ structures of the Y. enterocolitica and S. flexneri injectisomes had similar dimensions and were significantly longer than the isolated structures of related injectisomes. The crystal structure of the inner membrane injectisome component YscD appeared elongated compared to a homologous protein, and molecular dynamics simulations documented its elongation elasticity. The ring-shaped secretin YscC at the outer membrane was stretched by 30-40% in situ, compared to its isolated liposome-embedded conformation. We suggest that elasticity is critical for some two-membrane spanning protein complexes to cope with variations in the intermembrane distance. DOI:http://dx.doi.org/10.7554/eLife.00792.001.


Subject(s)
Membrane Proteins/metabolism , Yersinia enterocolitica/metabolism , Cryoelectron Microscopy , Membrane Proteins/chemistry , Osmotic Pressure , Protein Conformation
10.
Chemistry ; 19(24): 7696-700, 2013 Jun 10.
Article in English | MEDLINE | ID: mdl-23649888

ABSTRACT

Ironing out the details: Proline and pyrrolidine derivatives (Hayashi- Jørgensen catalysts) are considered "work horses" in organocatalysis. This report describes a new effective ferrocenyl pyrrolidine catalyst that is able to perform well in benchmark organocatalytic reactions (see figure). The ferrocene moiety controls the conformational space and a simple alkyl group effectively covers a face of the derived enamine. This new framework can find applications in organocatalysis, and in general, in new ligand design.


Subject(s)
Amines/chemistry , Ferrous Compounds/chemical synthesis , Proline/analogs & derivatives , Pyrrolidines/chemical synthesis , Catalysis , Ferrous Compounds/chemistry , Molecular Structure , Proline/chemical synthesis , Proline/chemistry , Pyrrolidines/chemistry , Stereoisomerism
11.
J Chem Theory Comput ; 9(2): 857-62, 2013 Feb 12.
Article in English | MEDLINE | ID: mdl-26588728

ABSTRACT

Type II topoisomerase (topoII) is a metalloenzyme targeted by clinical antibiotics and anticancer agents. Here, we integrate existing structural data with molecular simulation and propose a model for the yet uncharacterized structure of the reactant state of topoII. This model describes a canonical two-metal-ion mechanism and suggests how the metals could rearrange at the catalytic pocket during enzymatic turnover, explaining also experimental evidence for topoII inhibition. These results call for further experimental validation.

12.
PLoS Pathog ; 8(5): e1002667, 2012.
Article in English | MEDLINE | ID: mdl-22570611

ABSTRACT

Capnocytophaga canimorsus is a usual member of dog's mouths flora that causes rare but dramatic human infections after dog bites. We determined the structure of C. canimorsus lipid A. The main features are that it is penta-acylated and composed of a "hybrid backbone" lacking the 4' phosphate and having a 1 phosphoethanolamine (P-Etn) at 2-amino-2-deoxy-d-glucose (GlcN). C. canimorsus LPS was 100 fold less endotoxic than Escherichia coli LPS. Surprisingly, C. canimorsus lipid A was 20,000 fold less endotoxic than the C. canimorsus lipid A-core. This represents the first example in which the core-oligosaccharide dramatically increases endotoxicity of a low endotoxic lipid A. The binding to human myeloid differentiation factor 2 (MD-2) was dramatically increased upon presence of the LPS core on the lipid A, explaining the difference in endotoxicity. Interaction of MD-2, cluster of differentiation antigen 14 (CD14) or LPS-binding protein (LBP) with the negative charge in the 3-deoxy-D-manno-oct-2-ulosonic acid (Kdo) of the core might be needed to form the MD-2 - lipid A complex in case the 4' phosphate is not present.


Subject(s)
Capnocytophaga/pathogenicity , Endotoxins/chemistry , Endotoxins/metabolism , Lipid A/chemistry , Lipid A/metabolism , Acute-Phase Proteins/metabolism , Animals , Antigens, CD/metabolism , Capnocytophaga/metabolism , Carrier Proteins/metabolism , Cell Line , Dogs , HEK293 Cells , Humans , Interleukin-6/metabolism , Lipopolysaccharide Receptors/metabolism , Macrophages/metabolism , Membrane Glycoproteins/metabolism , Models, Molecular , Protein Structure, Tertiary , Sugar Acids/metabolism , Toll-Like Receptor 4/metabolism , Tumor Necrosis Factor-alpha/metabolism
13.
Mol Microbiol ; 82(1): 251-64, 2011 Oct.
Article in English | MEDLINE | ID: mdl-21883526

ABSTRACT

The human pathogen Mycobacterium tuberculosis requires the ESX-1 secretion system for full virulence. EspR plays a key role in ESX-1 regulation via direct binding and transcriptional activation of the espACD operon. Here, we describe the crystal structures of EspR, a C-terminally truncated form, EspRΔ10, as well as an EspR-DNA complex. EspR forms a dimer with each monomer containing an N-terminal helix-turn-helix DNA binding motif and an atypical C-terminal dimerization domain. Structural studies combined with footprinting experiments, atomic force microscopy and molecular dynamic simulations allow us to propose a model in which a dimer of EspR dimers is the minimal functional unit with two subunits binding two consecutive major grooves. The other two DNA binding domains are thus free to form higher-order oligomers and to bridge distant DNA sites in a cooperative way. These features are reminiscent of nucleoid-associated proteins and suggest a more general regulatory role for EspR than was previously suspected.


Subject(s)
Bacterial Proteins/chemistry , Bacterial Proteins/metabolism , DNA, Bacterial/metabolism , DNA-Binding Proteins/chemistry , DNA-Binding Proteins/metabolism , Genes, Regulator , Mycobacterium tuberculosis/metabolism , Amino Acid Sequence , Bacterial Proteins/genetics , Base Sequence , Binding Sites , Crystallography, X-Ray , DNA, Bacterial/genetics , DNA-Binding Proteins/genetics , Dimerization , Humans , Models, Molecular , Molecular Sequence Data , Mycobacterium tuberculosis/chemistry , Mycobacterium tuberculosis/genetics , Mycobacterium tuberculosis/pathogenicity , Protein Binding , Protein Structure, Tertiary , Tuberculosis/microbiology , Virulence
14.
Front Biosci (Elite Ed) ; 3(3): 1061-78, 2011 06 01.
Article in English | MEDLINE | ID: mdl-21622114

ABSTRACT

The advent of molecular medicine allowed identifying the malfunctioning of subcellular processes as the source of many diseases. Since then, drugs are not only discovered, but actually designed to fulfill a precise task. Modern computational techniques, based on molecular modeling, play a relevant role both in target identification and drug lead development. By flanking and integrating standard experimental techniques, modeling has proven itself as a powerful tool across the drug design process. The success of computational methods depends on a balance between cost (computation time) and accuracy. Thus, the integration of innovative theories and more powerful hardware architectures allows molecular modeling to be used as a reliable tool for rationalizing the results of experiments and accelerating the development of new drug design strategies. We present an overview of the most common quantum chemistry computational approaches, providing for each one a general theoretical introduction to highlight limitations and strong points. We then discuss recent developments in software and hardware resources, which have allowed state-of-the-art of computational quantum chemistry to be applied to drug development.


Subject(s)
Drug Design , Quantum Theory , Models, Molecular
15.
Phys Chem Chem Phys ; 13(9): 3645-8, 2011 Mar 07.
Article in English | MEDLINE | ID: mdl-21243153

ABSTRACT

The photochemical cis-trans isomerization of retinal in rhodopsin is investigated by structure sampling and excited state QM/MM trajectories with surface hopping. The calculations uncover the motions responsible for photoproduct formation and elucidate the reasons behind the efficient photoisomerization in the primary event of visual transduction.


Subject(s)
Hydrogen/chemistry , Rhodopsin/chemistry , Isomerism , Quantum Theory , Retinaldehyde/chemistry
16.
Chemistry ; 16(41): 12462-73, 2010 Nov 02.
Article in English | MEDLINE | ID: mdl-20839181

ABSTRACT

Asymmetric phase-transfer catalysis (PTC) has risen to prominence over the last decade as a straightforward synthetic methodology for the preparation of pharmacologically active compounds in enantiomerically pure form. However, the complex interplay of weak nonbonded interactions (between catalyst and substrate) that could account for the stereoselection in these processes is still unclear, with tentative pictorial mechanistic representations usually proposed. Here we present a full account dealing with the enantioselective phase-transfer-catalyzed intramolecular aza-Michael reaction (IMAMR) of indolyl esters, as a valuable synthetic tool to obtain added-value compounds, such as dihydro-pyrazinoindolinones. A combined computational and experimental investigation has been carried out to elucidate the key mechanistic aspects of this process.


Subject(s)
Indoles/chemical synthesis , Pyrroles/chemical synthesis , Alkylation , Catalysis , Indoles/chemistry , Models, Theoretical , Pyrroles/chemistry , Stereoisomerism
18.
Proc Natl Acad Sci U S A ; 107(31): 13860-5, 2010 Aug 03.
Article in English | MEDLINE | ID: mdl-20643949

ABSTRACT

The needle length of the Yersinia spp. injectisome is determined by Yop secretion protein P (YscP), an early substrate of the injectisome itself. There is a linear correlation between the length of YscP and the length of the needle, suggesting that YscP acts as a molecular ruler. However, it is not known whether one single molecule of YscP suffices to control the length of one needle or whether several molecules of YscP are exported in alternation with the needle subunit YscF until the needle length matches the ruler length, which would stop needle growth. To address this question, three different strains expressing simultaneously a short and a long version of YscP were engineered. The experimentally obtained needle length distribution was compared with the distributions predicted by stochastic modeling of the various possible scenarios. The experimental data are compatible with the single ruler model and not with the scenarios involving more than one ruler per needle.


Subject(s)
Bacterial Proteins/metabolism , Yersinia/cytology , Yersinia/metabolism , Alleles , Bacterial Proteins/genetics , Base Sequence , Protein Transport , Yersinia/genetics
19.
Phys Chem Chem Phys ; 12(19): 5016-23, 2010 May 21.
Article in English | MEDLINE | ID: mdl-20411206

ABSTRACT

We have employed hybrid CASPT2//CASSCF/AMBER calculations to map the (1)L(a)(1pipi*) deactivation path of a single quantum mechanical adenine in a d(A)(10).d(T)(10) double strand in water that is treated at the molecular mechanics level. We find that (a) the L(a) relaxation route is flatter in DNA than in vacuo and (b) the L(a) relaxation energy in DNA is much larger than the stabilization energy of the corresponding L(a) excimer. An intra-monomer relaxation process is found to be compatible with the multiexponential decay recorded in DNA, possibly including the longer (4100 ps) lifetime component.


Subject(s)
Adenine/chemistry , DNA/chemistry , Molecular Dynamics Simulation , Quantum Theory , Thermodynamics
20.
Chemistry ; 16(4): 1312-8, 2010 Jan 25.
Article in English | MEDLINE | ID: mdl-20024991

ABSTRACT

When Ar is a low-aromaticity homo- or heterosystem, the sulfonyl-stabilized anion of nitrobutadienes 4 (which derive from the initial ring opening of 3-nitrothiophene) undergoes a rather surprising addition onto the aromatic ring itself, thereby leading to the construction of an unusual eight-membered sulfur heterocycle condensed with the original Ar ring. The competitiveness of such a pathway with respect to the formation of the thiopyran ring (i.e., addition onto the nitrovinyl moiety) is favored at low temperatures, thus revealing its nature as a kinetically controlled process.

SELECTION OF CITATIONS
SEARCH DETAIL
...