Your browser doesn't support javascript.
loading
CTDUNet: A Multimodal CNN-Transformer Dual U-Shaped Network with Coordinate Space Attention for Camellia oleifera Pests and Diseases Segmentation in Complex Environments.
Guo, Ruitian; Zhang, Ruopeng; Zhou, Hao; Xie, Tunjun; Peng, Yuting; Chen, Xili; Yu, Guo; Wan, Fangying; Li, Lin; Zhang, Yongzhong; Liu, Ruifeng.
Afiliação
  • Guo R; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Zhang R; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Zhou H; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Xie T; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Peng Y; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Chen X; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Yu G; School of Business, Central South University of Forestry and Technology, Changsha 410004, China.
  • Wan F; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Li L; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Zhang Y; School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.
  • Liu R; School of Forestry, Central South University of Forestry and Technology, Changsha 410004, China.
Plants (Basel) ; 13(16)2024 Aug 15.
Article em En | MEDLINE | ID: mdl-39204710
ABSTRACT
Camellia oleifera is a crop of high economic value, yet it is particularly susceptible to various diseases and pests that significantly reduce its yield and quality. Consequently, the precise segmentation and classification of diseased Camellia leaves are vital for managing pests and diseases effectively. Deep learning exhibits significant advantages in the segmentation of plant diseases and pests, particularly in complex image processing and automated feature extraction. However, when employing single-modal models to segment Camellia oleifera diseases, three critical challenges arise (A) lesions may closely resemble the colors of the complex background; (B) small sections of diseased leaves overlap; (C) the presence of multiple diseases on a single leaf. These factors considerably hinder segmentation accuracy. A novel multimodal model, CNN-Transformer Dual U-shaped Network (CTDUNet), based on a CNN-Transformer architecture, has been proposed to integrate image and text information. This model first utilizes text data to address the shortcomings of single-modal image features, enhancing its ability to distinguish lesions from environmental characteristics, even under conditions where they closely resemble one another. Additionally, we introduce Coordinate Space Attention (CSA), which focuses on the positional relationships between targets, thereby improving the segmentation of overlapping leaf edges. Furthermore, cross-attention (CA) is employed to align image and text features effectively, preserving local information and enhancing the perception and differentiation of various diseases. The CTDUNet model was evaluated on a self-made multimodal dataset compared against several models, including DeeplabV3+, UNet, PSPNet, Segformer, HrNet, and Language meets Vision Transformer (LViT). The experimental results demonstrate that CTDUNet achieved an mean Intersection over Union (mIoU) of 86.14%, surpassing both multimodal models and the best single-modal model by 3.91% and 5.84%, respectively. Additionally, CTDUNet exhibits high balance in the multi-class segmentation of Camellia oleifera diseases and pests. These results indicate the successful application of fused image and text multimodal information in the segmentation of Camellia disease, achieving outstanding performance.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Plants (Basel) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China País de publicação: Suíça

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Plants (Basel) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China País de publicação: Suíça