ABSTRACT
It has been shown that water solubility and octanol/water partition coefficient for a large diverse set of compounds can be predicted simultaneously using molecular descriptors derived solely from a two dimensional representation of molecular structure. These properties have been modelled using multiple linear regression, artificial neural networks and a statistical method known as canonical correlation analysis. The neural networks give slightly better models both in terms of fitting and prediction presumably due to the fact that they include non-linear terms. The statistical methods, on the other hand, provide information concerning the explanation of variance and allow easy interrogation of the models. Models were fitted using a training set of 552 compounds, a validation set and test set each containing 68 molecules and two separate literature test sets for solubility and partition.
Subject(s)
Drug Design , Models, Chemical , 1-Octanol , Linear Models , Molecular Structure , Neural Networks, Computer , Pesticides/chemistry , Solubility , WaterABSTRACT
The aim of this study was to determine the efficacy of atom-type electrotopological state indices for estimation of the octanol-water partition coefficient (log P) values in a set of 345 drug compounds or related complex chemical structures. Multilinear regression analysis and artificial neural networks were used to construct models based on molecular weights and atom-type electrotopological state indices. Both multilinear regression and artificial neural networks provide reliable log P estimations. For the same set of parameters, application of neural networks provided better prediction ability for training and test sets. The present study indicates that atom-type electrotopological state indices offer valuable parameters for fast evaluation of octanol-water partition coefficients that can be applied to screen large databases of chemical compounds, such as combinatorial libraries.