Search | VHL Regional Portal

Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.

Pei, Gensheng; Shen, Fumin; Yao, Yazhou; Chen, Tao; Hua, Xian-Sheng; Shen, Heng-Tao.

IEEE Trans Image Process ; 32: 5909-5920, 2023.

Article in English | MEDLINE | ID: mdl-37883290

ABSTRACT

The optical flow guidance strategy is ideal for obtaining motion information of objects in the video. It is widely utilized in video segmentation tasks. However, existing optical flow-based methods have a significant dependency on optical flow, which results in poor performance when the optical flow estimation fails for a particular scene. The temporal consistency provided by the optical flow could be effectively supplemented by modeling in a structural form. This paper proposes a new hierarchical graph neural network (GNN) architecture, dubbed hierarchical graph pattern understanding (HGPU), for zero-shot video object segmentation (ZS-VOS). Inspired by the strong ability of GNNs in capturing structural relations, HGPU innovatively leverages motion cues (i.e., optical flow) to enhance the high-order representations from the neighbors of target frames. Specifically, a hierarchical graph pattern encoder with message aggregation is introduced to acquire different levels of motion and appearance features in a sequential manner. Furthermore, a decoder is designed for hierarchically parsing and understanding the transformed multi-modal contexts to achieve more accurate and robust results. HGPU achieves state-of-the-art performance on four publicly available benchmarks (DAVIS-16, YouTube-Objects, Long-Videos and DAVIS-17). Code and pre-trained model can be found at https://github.com/NUST-Machine-Intelligence-Laboratory/HGPU.

Automated object recognition in high-resolution optical remote sensing imagery.

Yao, Yazhou; Chen, Tao; Bi, Hanbo; Cai, Xinhao; Pei, Gensheng; Yang, Guoye; Yan, Zhiyuan; Sun, Xian; Xu, Xing; Zhang, Hai.

Natl Sci Rev ; 10(6): nwad122, 2023 Jun.

Article in English | MEDLINE | ID: mdl-37324647

ABSTRACT

This paper reports the background and results of the Automated Object Recognition in Optical Remote Sensing Imagery, which is one of the tracks in 2022 International Algorithm Case Competition, as well as summarize the challenges, champion solutions, and future directions.

Hierarchical Co-Attention Propagation Network for Zero-Shot Video Object Segmentation.

Pei, Gensheng; Yao, Yazhou; Shen, Fumin; Huang, Dan; Huang, Xingguo; Shen, Heng-Tao.

IEEE Trans Image Process ; 32: 2348-2359, 2023.

Article in English | MEDLINE | ID: mdl-37074884

ABSTRACT

Zero-shot video object segmentation (ZS-VOS) aims to segment foreground objects in a video sequence without prior knowledge of these objects. However, existing ZS-VOS methods often struggle to distinguish between foreground and background or to keep track of the foreground in complex scenarios. The common practice of introducing motion information, such as optical flow, can lead to overreliance on optical flow estimation. To address these challenges, we propose an encoder-decoder-based hierarchical co-attention propagation network (HCPN) capable of tracking and segmenting objects. Specifically, our model is built upon multiple collaborative evolutions of the parallel co-attention module (PCM) and the cross co-attention module (CCM). PCM captures common foreground regions among adjacent appearance and motion features, while CCM further exploits and fuses cross-modal motion features returned by PCM. Our method is progressively trained to achieve hierarchical spatio-temporal feature propagation across the entire video. Experimental results demonstrate that our HCPN outperforms all previous methods on public benchmarks, showcasing its effectiveness for ZS-VOS. Code and pre-trained model can be found at https://github.com/NUST-Machine-Intelligence-Laboratory/HCPN.

ABSTRACT

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL