MedChatZH: A tuning LLM for traditional Chinese medicine consultations.

Tan, Yang; Zhang, Zhixing; Li, Mingchen; Pan, Fei; Duan, Hao; Huang, Zijie; Deng, Hua; Yu, Zhuohang; Yang, Chen; Shen, Guoyang; Qi, Peng; Yue, Chengyuan; Liu, Yuxian; Hong, Liang; Yu, Huiqun; Fan, Guisheng; Tang, Yun

Tan, Yang; Zhang, Zhixing; Li, Mingchen; Pan, Fei; Duan, Hao; Huang, Zijie; Deng, Hua; Yu, Zhuohang; Yang, Chen; Shen, Guoyang; Qi, Peng; Yue, Chengyuan; Liu, Yuxian; Hong, Liang; Yu, Huiqun; Fan, Guisheng; Tang, Yun.

Afiliación

Tan Y; Department of Computer Science and Technology, East China University of Science and Technology, Shanghai, 200237, China; Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China; Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, 200240, China.
Zhang Z; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Li M; Department of Computer Science and Technology, East China University of Science and Technology, Shanghai, 200237, China; Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China; Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, 200240, China.
Pan F; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Duan H; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Huang Z; Department of Computer Science and Technology, East China University of Science and Technology, Shanghai, 200237, China.
Deng H; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Yu Z; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Yang C; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Shen G; Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, 200240, China.
Qi P; Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, 200240, China.
Yue C; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
Liu Y; The University of Sydney, Sydney, 2050, Australia.
Hong L; Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China; Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, 200240, China; School of Physics and Astronomy & School of Pharmacy, Shanghai Jiao Tong University, Shanghai, 200240, China.
Yu H; Department of Computer Science and Technology, East China University of Science and Technology, Shanghai, 200237, China.
Fan G; Department of Computer Science and Technology, East China University of Science and Technology, Shanghai, 200237, China.
Tang Y; Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China. Electronic address: ytang234@ecust.edu.cn.

Comput Biol Med ; 172: 108290, 2024 Apr.

Article en En | MEDLINE | ID: mdl-38503097

ABSTRACT

ABSTRACT

Generative Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, including Question-Answering (QA) and dialogue systems. However, most models are trained on English data and lack strong generalization in providing answers in Chinese. This limitation is especially evident in specialized domains like traditional Chinese medical QA, where performance suffers due to the absence of fine-tuning and high-quality datasets. To address this, we introduce MedChatZH, a dialogue model optimized for Chinese medical QA based on transformer decoder with LLaMA architecture. Continued pre-training on a curated corpus of Chinese medical books is followed by fine-tuning with a carefully selected medical instruction dataset, resulting in MedChatZH outperforming several Chinese dialogue baselines on a real-world medical dialogue dataset. Our model, code, and dataset are publicly available on GitHub (https//github.com/tyang816/MedChatZH) to encourage further research in traditional Chinese medicine and LLMs.

Asunto(s)

Educación Médica; Medicina Tradicional China; Lenguaje; Derivación y Consulta; Procesamiento de Lenguaje Natural; Inteligencia Artificial

Palabras clave

Dialogue model; Fine-tuning; Generative large language models (LLMs); Question-answering (QA); Traditional Chinese medical QA

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Educación Médica / Medicina Tradicional China Idioma: En Revista: Comput Biol Med Año: 2024 Tipo del documento: Article País de afiliación: China Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google