Pesquisa | Portal Regional da BVS (teste)

ShuffleTrans: Patch-wise weight shuffle for transparent object segmentation.

Zhang, Boxiang; Wang, Zunran; Ling, Yonggen; Guan, Yuanyuan; Zhang, Shenghao; Li, Wenhui; Wei, Lei; Zhang, Chunxu.

Neural Netw ; 167: 199-212, 2023 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-37659116

RESUMO

Transparent objects widely exist in the world. The task of transparent object segmentation is challenging as the object lacks its own texture. The cue of shape information therefore gets more critical. Most existing methods, however, rely on the mechanism of simple convolution, which is good at local cues and performs weakly on global cues like shape. To solve this problem, an operation named Patch-wise Weight Shuffle is proposed to bring in the global context cue by being combined with the dynamic convolution. A network ShuffleTrans that recognizes shape better is then designed based on this operation. Besides, fitter for this task, two auxiliary modules are presented in ShuffleTrans: a Boundary and Direction Refinement Module which collects two additional information, and a Channel Attention Enhancement Module that assists the above operation. Experiments on four texture-less object segmentation datasets and two normal datasets verify the effectiveness and generality of the method. Especially, the ShuffleTrans achieved 74.93% mIoU on the Trans10k v2 test set, which is more accurate than existing methods.

Assuntos

Sinais (Psicologia) , Processamento de Imagem Assistida por Computador

Self-supervised Learning of Detailed 3D Face Reconstruction.

Chen, Yajing; Wu, Fanzi; Wang, Zeyu; Song, Yibing; Ling, Yonggen; Bao, Linchao.

IEEE Trans Image Process ; PP2020 Aug 27.

Artigo em Inglês | MEDLINE | ID: mdl-32853150

RESUMO

In this paper, we present an end-to-end learning framework for detailed 3D face reconstruction from a single image1. Our approach uses a 3DMM-based coarse model and a displacement map in UV-space to represent a 3D face. Unlike previous work addressing the problem, our learning framework does not require supervision of surrogate ground-truth 3D models computed with traditional approaches. Instead, we utilize the input image itself as supervision during learning. In the first stage, we combine a photometric loss and a facial perceptual loss between the input face and the rendered face, to regress a 3DMM-based coarse model. In the second stage, both the input image and the regressed texture of the coarse model are unwrapped into UV-space, and then sent through an image-toimage translation network to predict a displacement map in UVspace. The displacement map and the coarse model are used to render a final detailed face, which again can be compared with the original input image to serve as a photometric loss for the second stage. The advantage of learning displacement map in UV-space is that face alignment can be explicitly done during the unwrapping, thus facial details are easier to learn from large amount of data. Extensive experiments demonstrate the superiority of the proposed method over previous work.

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA