Large Language Model-Based Natural Language Encoding Could Be All You Need for Drug Biomedical Association Prediction.

Zhang, Hanyu; Zhou, Yuan; Zhang, Zhichao; Sun, Huaicheng; Pan, Ziqi; Mou, Minjie; Zhang, Wei; Ye, Qing; Hou, Tingjun; Li, Honglin; Hsieh, Chang-Yu; Zhu, Feng

Zhang, Hanyu; Zhou, Yuan; Zhang, Zhichao; Sun, Huaicheng; Pan, Ziqi; Mou, Minjie; Zhang, Wei; Ye, Qing; Hou, Tingjun; Li, Honglin; Hsieh, Chang-Yu; Zhu, Feng.

Afiliación

Zhang H; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China.
Zhou Y; College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, State Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China.
Zhang Z; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Sun H; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Pan Z; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Mou M; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Zhang W; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Ye Q; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Hou T; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Li H; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Hsieh CY; Innovation Center for AI and Drug Discovery, East China Normal University, Shanghai 200062, China.
Zhu F; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.

Anal Chem ; 2024 Jul 16.

Article en En | MEDLINE | ID: mdl-39011990

ABSTRACT

ABSTRACT

Analyzing drug-related interactions in the field of biomedicine has been a critical aspect of drug discovery and development. While various artificial intelligence (AI)-based tools have been proposed to analyze drug biomedical associations (DBAs), their feature encoding did not adequately account for crucial biomedical functions and semantic concepts, thereby still hindering their progress. Since the advent of ChatGPT by OpenAI in 2022, large language models (LLMs) have demonstrated rapid growth and significant success across various applications. Herein, LEDAP was introduced, which uniquely leveraged LLM-based biotext feature encoding for predicting drug-disease associations, drug-drug interactions, and drug-side effect associations. Benefiting from the large-scale knowledgebase pre-training, LLMs had great potential in drug development analysis owing to their holistic understanding of natural language and human topics. LEDAP illustrated its notable competitiveness in comparison with other popular DBA analysis tools. Specifically, even in simple conjunction with classical machine learning methods, LLM-based feature representations consistently enabled satisfactory performance across diverse DBA tasks like binary classification, multiclass classification, and regression. Our findings underpinned the considerable potential of LLMs in drug development research, indicating a catalyst for further progress in related fields.

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Anal Chem Año: 2024 Tipo del documento: Article País de afiliación: China Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google