Your browser doesn't support javascript.
Exploring QSAR models for activity-cliff prediction.
Dablander, Markus; Hanser, Thierry; Lambiotte, Renaud; Morris, Garrett M.
  • Dablander M; Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter (550), Woodstock Road, Oxford, OX2 6GG, UK.
  • Hanser T; Lhasa Limited, Granary Wharf House, 2 Canal Wharf, Leeds, LS11 5PS, UK.
  • Lambiotte R; Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter (550), Woodstock Road, Oxford, OX2 6GG, UK.
  • Morris GM; Department of Statistics, University of Oxford, 24-29 St Giles', Oxford, OX1 3LB, UK. garrett.morris@stats.ox.ac.uk.
J Cheminform ; 15(1): 47, 2023 Apr 17.
Article in English | MEDLINE | ID: covidwho-2293809
ABSTRACT
INTRODUCTION AND

METHODOLOGY:

Pairs of similar compounds that only differ by a small structural modification but exhibit a large difference in their binding affinity for a given target are known as activity cliffs (ACs). It has been hypothesised that QSAR models struggle to predict ACs and that ACs thus form a major source of prediction error. However, the AC-prediction power of modern QSAR methods and its quantitative relationship to general QSAR-prediction performance is still underexplored. We systematically construct nine distinct QSAR models by combining three molecular representation methods (extended-connectivity fingerprints, physicochemical-descriptor vectors and graph isomorphism networks) with three regression techniques (random forests, k-nearest neighbours and multilayer perceptrons); we then use each resulting model to classify pairs of similar compounds as ACs or non-ACs and to predict the activities of individual molecules in three case studies dopamine receptor D2, factor Xa, and SARS-CoV-2 main protease. RESULTS AND

CONCLUSIONS:

Our results provide strong support for the hypothesis that indeed QSAR models frequently fail to predict ACs. We observe low AC-sensitivity amongst the evaluated models when the activities of both compounds are unknown, but a substantial increase in AC-sensitivity when the actual activity of one of the compounds is given. Graph isomorphism features are found to be competitive with or superior to classical molecular representations for AC-classification and can thus be employed as baseline AC-prediction models or simple compound-optimisation tools. For general QSAR-prediction, however, extended-connectivity fingerprints still consistently deliver the best performance amongs the tested input representations. A potential future pathway to improve QSAR-modelling performance might be the development of techniques to increase AC-sensitivity.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: J Cheminform Year: 2023 Document Type: Article Affiliation country: S13321-023-00708-w

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: J Cheminform Year: 2023 Document Type: Article Affiliation country: S13321-023-00708-w