Search | VHL Regional Portal

Hierarchical medical image report adversarial generation with hybrid discriminator.

Zhang, Junsan; Cheng, Ming; Cheng, Qiaoqiao; Shen, Xiuxuan; Wan, Yao; Zhu, Jie; Liu, Mengxuan.

Artif Intell Med ; 151: 102846, 2024 May.

Article in English | MEDLINE | ID: mdl-38547777

ABSTRACT

BACKGROUND AND OBJECTIVES: Generating coherent reports from medical images is an important task for reducing doctors' workload. Unlike traditional image captioning tasks, the task of medical image report generation faces more challenges. Current models for generating reports from medical images often fail to characterize some abnormal findings, and some models generate reports with low quality. In this study, we propose a model to generate high-quality reports from medical images. METHODS: In this paper, we propose a model called Hybrid Discriminator Generative Adversarial Network (HDGAN), which combines Generative Adversarial Network (GAN) with Reinforcement Learning (RL). The HDGAN model consists of a generator, a one-sentence discriminator, and a one-word discriminator. Specifically, the RL reward signals are judged on the one-sentence discriminator and one-word discriminator separately. The one-sentence discriminator can better learn sentence-level structural information, while the one-word discriminator can learn word diversity information effectively. RESULTS: Our approach performs better on the IU-X-ray and COV-CTR datasets than the baseline models. For the ROUGE metric, our method outperforms the state-of-the-art model by 0.36 on the IU-X-ray, 0.06 on the MIMIC-CXR and 0.156 on the COV-CTR. CONCLUSIONS: The compositional framework we proposed can generate more accurate medical image reports at different levels.

Subject(s)

Deep Learning , Diagnostic Imaging , Image Processing, Computer-Assisted , Neural Networks, Computer , Datasets as Topic , Diagnostic Imaging/methods , Image Processing, Computer-Assisted/methods , Radiography, Thoracic , Thorax/diagnostic imaging , Humans

A Novel Deep Learning Model for Medical Report Generation by Inter-Intra Information Calibration.

Zhang, Junsan; Shen, Xiuxuan; Wan, Shaohua; Goudos, Sotirios K; Wu, Jie; Cheng, Ming; Zhang, Weishan.

IEEE J Biomed Health Inform ; 27(10): 5110-5121, 2023 10.

Article in English | MEDLINE | ID: mdl-37018727

ABSTRACT

Automatic generation of medical reports can provide diagnostic assistance to doctors and reduce their workload. To improve the quality of the generated medical reports, injecting auxiliary information through knowledge graphs or templates into the model is widely adopted in previous methods. However, they suffer from two problems: 1) The injected external information is limited in amount and difficult to adequately meet the information needs of medical report generation in content. 2) The injected external information increases the complexity of model and is hard to be reasonably integrated into the generation process of medical reports. Therefore, we propose an Information Calibrated Transformer (ICT) to address the above issues. First, we design a Precursor-information Enhancement Module (PEM), which can effectively extract numerous inter-intra report features from the datasets as the auxiliary information without external injection. And the auxiliary information can be dynamically updated with the training process. Secondly, a combination mode, which consists of PEM and our proposed Information Calibration Attention Module (ICA), is designed and embedded into ICT. In this method, the auxiliary information extracted from PEM is flexibly injected into ICT and the increment of model parameters is small. The comprehensive evaluations validate that the ICT is not only superior to previous methods in the X-Ray datasets, IU-X-Ray and MIMIC-CXR, but also successfully be extended to a CT COVID-19 dataset COV-CTR.

Subject(s)

COVID-19 , Deep Learning , Humans , Calibration , Electric Power Supplies , Knowledge

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL