RESUMO
We report the QSAR modeling of cytochrome P450 3A4 (CYP3A4) enzyme inhibition using four large data sets of in vitro data. These data sets consist of marketed drugs and drug-like compounds all tested in four assays measuring the inhibition of the metabolism of four different substrates by the CYP3A4 enzyme. The four probe substrates are benzyloxycoumarin, testosterone, benzyloxyresorufin, and midazolam. We first show that using state-of-the-art QSAR modeling approaches applied to only one of these four data sets does not lead to predictive models that would be useful for in silico filtering of chemical libraries. We then present the development and the testing of a multiple pharmacophore hypothesis (MPH) that is formulated as a conceptual extension of the traditional QSAR approach to modeling the promiscuous binding of a large variety of drugs to CYP3A4. In the simplest form, the MPH approach takes advantage of the multiple substrate data sets and identifies the binding of test compounds as either proximal or distal relative to that of a given substrate. Application of the approach to the in silico filtering of test compounds for potential inhibitors of CYP3A4 is also presented. In addition to an improvement in the QSAR modeling for the inhibition of CYP3A4, the results from this modeling approach provide structural insights into the drug-enzyme interactions. The existence of multiple inhibition data sets in the BioPrint database motivates the original development of the concept of a multiple pharmacophore hypothesis and provides a unique opportunity for formulating alternative strategies of QSAR modeling of the inhibition of the in vitro metabolism of CYP3A4.
Assuntos
Inibidores das Enzimas do Citocromo P-450 , Inibidores Enzimáticos/farmacologia , Modelos Moleculares , Citocromo P-450 CYP3A , Sistema Enzimático do Citocromo P-450/química , Sistema Enzimático do Citocromo P-450/metabolismo , Relação Quantitativa Estrutura-Atividade , Especificidade por SubstratoRESUMO
Computational methods are increasingly used to streamline and enhance the lead discovery and optimization process. However, accurate prediction of absorption, distribution, metabolism and excretion (ADME) and adverse drug reactions (ADR) is often difficult, due to the complexity of underlying physiological mechanisms. Modeling approaches have been hampered by the lack of large, robust and standardized training datasets. In an extensive effort to build such a dataset, the BioPrint database was constructed by systematic profiling of nearly all drugs available on the market, as well as numerous reference compounds. The database is composed of several large datasets: compound structures and molecular descriptors, in vitro ADME and pharmacology profiles, and complementary clinical data including therapeutic use information, pharmacokinetics profiles and ADR profiles. These data have allowed the development of computational tools designed to integrate a program of computational chemistry into library design and lead development. Models based on chemical structure are strengthened by in vitro results that can be used as additional compound descriptors to predict complex in vivo endpoints. The BioPrint pharmacoinformatics platform represents a systematic effort to accelerate the process of drug discovery, improve quantitative structure-activity relationships and develop in vitro/in vivo associations. In this review, we will discuss the importance of training set size and diversity in model development, the implementation of linear and neighborhood modeling approaches, and the use of in silico methods to predict potential clinical liabilities.