Your browser doesn't support javascript.
Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development.
Tahir Ul Qamar, Muhammad; Zhu, Xi-Tong; Chen, Ling-Ling; Alhussain, Laila; Alshiekheid, Maha A; Theyab, Abdulrahman; Algahtani, Mohammad.
  • Tahir Ul Qamar M; State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China.
  • Zhu XT; National Key Laboratory of Crop Genetic Improvement, Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
  • Chen LL; State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China.
  • Alhussain L; National Key Laboratory of Crop Genetic Improvement, Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
  • Alshiekheid MA; Department of Biology, College of Science, Qassim University, Buraydah 51452, Saudi Arabia.
  • Theyab A; Department of Botany & Microbiology, College of Science, King Saud University, Riyadh 11451, Saudi Arabia.
  • Algahtani M; Department of Laboratory & Blood Bank, Security Forces Hospital, P.O. Box 14799, Mecca 21955, Saudi Arabia.
Int J Mol Sci ; 23(19)2022 Sep 20.
Article in English | MEDLINE | ID: covidwho-2043766
ABSTRACT
Leveraging machine learning has been shown to improve the accuracy of structure-based virtual screening. Furthermore, a tremendous amount of empirical data is publicly available, which further enhances the performance of the machine learning approach. In this proof-of-concept study, the 3CLpro enzyme of SARS-CoV-2 was used. Structure-based virtual screening relies heavily on scoring functions. It is widely accepted that target-specific scoring functions may perform more effectively than universal scoring functions in real-world drug research and development processes. It would be beneficial to drug discovery to develop a method that can effectively build target-specific scoring functions. In the current study, the bindingDB database was used to retrieve experimental data. Smina was utilized to generate protein-ligand complexes for the extraction of InteractionFingerPrint (IFP) and SimpleInteractionFingerPrint SIFP fingerprints via the open drug discovery tool (oddt). The present study found that randomforestClassifier and randomforestRegressor performed well when used with the above fingerprints along the Molecular ACCess System (MACCS), Extended Connectivity Fingerprint (ECFP4), and ECFP6. It was found that the area under the precision-recall curve was 0.80, which is considered a satisfactory level of accuracy. In addition, our enrichment factor analysis indicated that our trained scoring function ranked molecules correctly compared to smina's generic scoring function. Further molecular dynamics simulations indicated that the top-ranked molecules identified by our developed scoring function were highly stable in the active site, supporting the validity of our developed process. This research may provide a template for developing target-specific scoring functions against specific enzyme targets.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: SARS-CoV-2 / COVID-19 Drug Treatment Type of study: Prognostic study Limits: Humans Language: English Year: 2022 Document Type: Article Affiliation country: Ijms231911003

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: SARS-CoV-2 / COVID-19 Drug Treatment Type of study: Prognostic study Limits: Humans Language: English Year: 2022 Document Type: Article Affiliation country: Ijms231911003