Pesquisa | Portal Regional da BVS (teste)

1.

Learning Skill Characteristics From Manipulations.

Zhou, Xiao-Hu; Xie, Xiao-Liang; Liu, Shi-Qi; Ni, Zhen-Liang; Zhou, Yan-Jie; Li, Rui-Qi; Gui, Mei-Jiang; Fan, Chen-Chen; Feng, Zhen-Qiu; Bian, Gui-Bin; Hou, Zeng-Guang.

IEEE Trans Neural Netw Learn Syst ; 34(12): 9727-9741, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-35333726

RESUMO

Percutaneous coronary intervention (PCI) has increasingly become the main treatment for coronary artery disease. The procedure requires high experienced skills and dexterous manipulations. However, there are few techniques to model PCI skill so far. In this study, a learning framework with local and ensemble learning is proposed to learn skill characteristics of different skill-level subjects from their PCI manipulations. Ten interventional cardiologists (four experts and six novices) were recruited to deliver a medical guidewire to two target arteries on a porcine model for in vivo studies. Simultaneously, translation and twist manipulations of thumb, forefinger, and wrist are acquired with electromagnetic (EM) and fiber-optic bend (FOB) sensors, respectively. These behavior data are then processed with wavelet packet decomposition (WPD) under 1-10 levels for feature extraction. The feature vectors are further fed into three candidate individual classifiers in the local learning layer. Furthermore, the local learning results from different manipulation behaviors are fused in the ensemble learning layer with three rule-based ensemble learning algorithms. In subject-dependent skill characteristics learning, the ensemble learning can achieve 100% accuracy, significantly outperforming the best local result (90%). Furthermore, ensemble learning can also maintain 73% accuracy in subject-independent schemes. These promising results demonstrate the great potential of the proposed method to facilitate skill learning in surgical robotics and skill assessment in clinical practice.

Assuntos

Intervenção Coronária Percutânea , Robótica , Humanos , Animais , Suínos , Redes Neurais de Computação , Algoritmos , Aprendizagem

2.

Spatio-Temporal Causal Transformer for Multi-Grained Surgical Phase Recognition.

Chen, Hua-Bin; Li, Zhen; Fu, Pan; Ni, Zhen-Liang; Bian, Gui-Bin.

Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 1663-1666, 2022 07.

Artigo em Inglês | MEDLINE | ID: mdl-36086459

RESUMO

Automatic surgical phase recognition plays a key role in surgical workflow analysis and overall optimization in clinical work. In the complicated surgical procedures, similar inter-class appearance and drastic variability in phase duration make this still a challenging task. In this paper, a spatio-temporal transformer is proposed for online surgical phase recognition with different granularity. To extract rich spatial information, a spatial transformer is used to model global spatial dependencies of each time index. To overcome the variability in phase duration, a temporal transformer captures the multi-scale temporal context of different time indexes with a dual pyramid pattern. Our method is thoroughly validated on the public Cholec80 dataset with 7 coarse-grained phases and the CATARACTS2020 dataset with 19 fine-grained phases, outperforming state-of-the-art approaches with 91.4% and 84.2% accuracy, taking only 24.5M parameters.

Assuntos

Algoritmos , Fluxo de Trabalho

3.

Space Squeeze Reasoning and Low-Rank Bilinear Feature Fusion for Surgical Image Segmentation.

Ni, Zhen-Liang; Bian, Gui-Bin; Li, Zhen; Zhou, Xiao-Hu; Li, Rui-Qi; Hou, Zeng-Guang.

IEEE J Biomed Health Inform ; 26(7): 3209-3217, 2022 07.

Artigo em Inglês | MEDLINE | ID: mdl-35226612

RESUMO

Surgical image segmentation is critical for surgical robot control and computer-assisted surgery. In the surgical scene, the local features of objects are highly similar, and the illumination interference is strong, which makes surgical image segmentation challenging. To address the above issues, a bilinear squeeze reasoning network is proposed for surgical image segmentation. In it, the space squeeze reasoning module is proposed, which adopts height pooling and width pooling to squeeze global contexts in the vertical and horizontal directions, respectively. The similarity between each horizontal position and each vertical position is calculated to encode long-range semantic dependencies and establish the affinity matrix. The feature maps are also squeezed from both the vertical and horizontal directions to model channel relations. Guided by channel relations, the affinity matrix is expanded to the same size as the input features. It captures long-range semantic dependencies from different directions, helping address the local similarity issue. Besides, a low-rank bilinear fusion module is proposed to enhance the model's ability to recognize similar features. This module is based on the low-rank bilinear model to capture the inter-layer feature relations. It integrates the location details from low-level features and semantic information from high-level features. Various semantics can be represented more accurately, which effectively improves feature representation. The proposed network achieves state-of-the-art performance on cataract image segmentation dataset CataSeg and robotic image segmentation dataset EndoVis 2018.

Assuntos

Processamento de Imagem Assistida por Computador , Cirurgia Assistida por Computador , Humanos , Processamento de Imagem Assistida por Computador/métodos , Redes Neurais de Computação , Semântica

4.

TR-GAN: Multi-Session Future MRI Prediction With Temporal Recurrent Generative Adversarial Network.

Fan, Chen-Chen; Peng, Liang; Wang, Tian; Yang, Hongjun; Zhou, Xiao-Hu; Ni, Zhen-Liang; Wang, Guan'an; Chen, Sheng; Zhou, Yan-Jie; Hou, Zeng-Guang.

IEEE Trans Med Imaging ; 41(8): 1925-1937, 2022 08.

Artigo em Inglês | MEDLINE | ID: mdl-35148262

RESUMO

Magnetic Resonance Imaging (MRI) has been proven to be an efficient way to diagnose Alzheimer's disease (AD). Recent dramatic progress on deep learning greatly promotes the MRI analysis based on data-driven CNN methods using a large-scale longitudinal MRI dataset. However, most of the existing MRI datasets are fragmented due to unexpected quits of volunteers. To tackle this problem, we propose a novel Temporal Recurrent Generative Adversarial Network (TR-GAN) to complete missing sessions of MRI datasets. Unlike existing GAN-based methods, which either fail to generate future sessions or only generate fixed-length sessions, TR-GAN takes all past sessions to recurrently and smoothly generate future ones with variant length. Specifically, TR-GAN adopts recurrent connection to deal with variant input sequence length and flexibly generate future variant sessions. Besides, we also design a multiple scale & location (MSL) module and a SWAP module to encourage the model to better focus on detailed information, which helps to generate high-quality MRI data. Compared with other popular GAN architectures, TR-GAN achieved the best performance in all evaluation metrics of two datasets. After expanding the Whole MRI dataset, the balanced accuracy of AD vs. cognitively normal (CN) vs. mild cognitive impairment (MCI) and stable MCI vs. progressive MCI classification can be increased by 3.61% and 4.00%, respectively.

Assuntos

Doença de Alzheimer , Disfunção Cognitiva , Doença de Alzheimer/diagnóstico por imagem , Disfunção Cognitiva/diagnóstico por imagem , Humanos , Imageamento por Ressonância Magnética

5.

A Multilayer and Multimodal-Fusion Architecture for Simultaneous Recognition of Endovascular Manipulations and Assessment of Technical Skills.

Zhou, Xiao-Hu; Xie, Xiao-Liang; Feng, Zhen-Qiu; Hou, Zeng-Guang; Bian, Gui-Bin; Li, Rui-Qi; Ni, Zhen-Liang; Liu, Shi-Qi; Zhou, Yan-Jie.

IEEE Trans Cybern ; 52(4): 2565-2577, 2022 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-32697730

RESUMO

The clinical success of the percutaneous coronary intervention (PCI) is highly dependent on endovascular manipulation skills and dexterous manipulation strategies of interventionalists. However, the analysis of endovascular manipulations and related discussion for technical skill assessment are limited. In this study, a multilayer and multimodal-fusion architecture is proposed to recognize six typical endovascular manipulations. The synchronously acquired multimodal motion signals from ten subjects are used as the inputs of the architecture independently. Six classification-based and two rule-based fusion algorithms are evaluated for performance comparisons. The recognition metrics under the determined architecture are further used to assess technical skills. The experimental results indicate that the proposed architecture can achieve the overall accuracy of 96.41%, much higher than that of a single-layer recognition architecture (92.85%). In addition, the multimodal fusion brings significant performance improvement in comparison with single-modal schemes. Furthermore, the K -means-based skill assessment can obtain an accuracy of 95% to cluster the attempts made by different skill-level groups. These hopeful results indicate the great possibility of the architecture to facilitate clinical skill assessment and skill learning.

Assuntos

Intervenção Coronária Percutânea , Algoritmos , Competência Clínica , Humanos , Aprendizagem

6.

SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation.

Ni, Zhen-Liang; Zhou, Xiao-Hu; Wang, Guan-An; Yue, Wen-Qian; Li, Zhen; Bian, Gui-Bin; Hou, Zeng-Guang.

Med Image Anal ; 76: 102310, 2022 02.

Artigo em Inglês | MEDLINE | ID: mdl-34954623

RESUMO

Surgical instrument segmentation plays a promising role in robot-assisted surgery. However, illumination issues often appear in surgical scenes, altering the color and texture of surgical instruments. Changes in visual features make surgical instrument segmentation difficult. To address illumination issues, the SurgiNet is proposed to learn pyramid attention features. The double attention module is designed to capture the semantic dependencies between locations and channels. Based on semantic dependencies, the semantic features in the disturbed area can be inferred for addressing illumination issues. Pyramid attention is aggregated to capture multi-scale features and make predictions more accurate. To perform model compression, class-wise self-distillation is proposed to enhance the representation learning of the network, which performs feature distillation within the class to eliminate interference from other classes. Top-down and multi-stage knowledge distillation is designed to distill class probability maps. By inter-layer supervision, high-level probability maps are applied to calibrate the probability distribution of low-level probability maps. Since class-wise distillation enhances the self-learning of the network, the network can get excellent performance with a lightweight backbone. The proposed network achieves the state-of-the-art performance of 89.14% mIoU on CataIS with only 1.66 GFlops and 2.05 M parameters. It also takes first place on EndoVis 2017 with 66.30% mIoU.

Assuntos

Processamento de Imagem Assistida por Computador , Humanos , Atenção , Semântica , Instrumentos Cirúrgicos

7.

A Unified Framework for Multi-Guidewire Endpoint Localization in Fluoroscopy Images.

Li, Rui-Qi; Xie, Xiao-Liang; Zhou, Xiao-Hu; Liu, Shi-Qi; Ni, Zhen-Liang; Zhou, Yan-Jie; Bian, Gui-Bin; Hou, Zeng-Guang.

IEEE Trans Biomed Eng ; 69(4): 1406-1416, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-34613905

RESUMO

OBJECTIVE: In this paper, Keypoint Localization Region-based CNN (KL R-CNN) is proposed, which can simultaneously accomplish the guidewire detection and endpoint localization in a unified model. METHODS: KL R-CNN modifies Mask R-CNN by replacing the mask branch with a novel keypoint localization branch. Besides, some settings of Mask R-CNN are also modified to generate the keypoint localization results at a higher detail level. At the same time, based on the existing metrics of Average Precision (AP) and Percentage of Correct Keypoints (PCK), a new metric named APPCK is proposed to evaluate the overall performance on the multi-guidewire endpoint localization task. Compared with existing metrics, APPCK is easy to use and its results are more intuitive. RESULTS: Compared with existing methods, KL R-CNN has better performance when the threshold is loose, reaching a mean APPCK of 90.65% when the threshold is 9 pixels. CONCLUSION: KL R-CNN achieves the state-of-the-art performance on the multi-guidewire endpoint localization task and has application potentials. SIGNIFICANCE: KL R-CNN can achieve the localization of guidewire endpoints in fluoroscopy images, which is a prerequisite for computer-assisted percutaneous coronary intervention. KL R-CNN can also be extended to other multi-instrument localization tasks.

Assuntos

Processamento de Imagem Assistida por Computador , Intervenção Coronária Percutânea , Cateterismo , Fluoroscopia , Processamento de Imagem Assistida por Computador/métodos

8.

Bilinear neural network with 3-D attention for brain decoding of motor imagery movements from the human EEG.

Fan, Chen-Chen; Yang, Hongjun; Hou, Zeng-Guang; Ni, Zhen-Liang; Chen, Sheng; Fang, Zhijie.

Cogn Neurodyn ; 15(1): 181-189, 2021 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-33786088

RESUMO

Deep learning has achieved great success in areas such as computer vision and natural language processing. In the past, some work used convolutional networks to process EEG signals and reached or exceeded traditional machine learning methods. We propose a novel network structure and call it QNet. It contains a newly designed attention module: 3D-AM, which is used to learn the attention weights of EEG channels, time points, and feature maps. It provides a way to automatically learn the electrode and time selection. QNet uses a dual branch structure to fuse bilinear vectors for classification. It performs four, three, and two classes on the EEG Motor Movement/Imagery Dataset. The average cross-validation accuracy of 65.82%, 74.75%, and 82.88% was obtained, which are 7.24%, 4.93%, and 2.45% outperforms than the state-of-the-art, respectively. The article also visualizes the attention weights learned by QNet and shows its possible application for electrode channel selection.

9.

Real-Time Multi-Guidewire Endpoint Localization in Fluoroscopy Images.

Li, Rui-Qi; Xie, Xiao-Liang; Zhou, Xiao-Hu; Liu, Shi-Qi; Ni, Zhen-Liang; Zhou, Yan-Jie; Bian, Gui-Bin; Hou, Zeng-Guang.

IEEE Trans Med Imaging ; 40(8): 2002-2014, 2021 08.

Artigo em Inglês | MEDLINE | ID: mdl-33788685

RESUMO

The real-time localization of the guidewire endpoints is a stepping stone to computer-assisted percutaneous coronary intervention (PCI). However, methods for multi-guidewire endpoint localization in fluoroscopy images are still scarce. In this paper, we introduce a framework for real-time multi-guidewire endpoint localization in fluoroscopy images. The framework consists of two stages, first detecting all guidewire instances in the fluoroscopy image, and then locating the endpoints of each single guidewire instance. In the first stage, a YOLOv3 detector is used for guidewire detection, and a post-processing algorithm is proposed to refine the guidewire detection results. In the second stage, a Segmentation Attention-hourglass (SA-hourglass) network is proposed to predict the endpoint locations of each single guidewire instance. The SA-hourglass network can be generalized to the keypoint localization of other surgical instruments. In our experiments, the SA-hourglass network is applied not only on a guidewire dataset but also on a retinal microsurgery dataset, reaching the mean pixel error (MPE) of 2.20 pixels on the guidewire dataset and the MPE of 5.30 pixels on the retinal microsurgery dataset, both achieving the state-of-the-art localization results. Besides, the inference rate of our framework is at least 20FPS, which meets the real-time requirement of fluoroscopy images (6-12FPS).

Assuntos

Intervenção Coronária Percutânea , Algoritmos , Cateterismo , Fluoroscopia , Humanos

10.

Comparative validation of multi-instance instrument segmentation in endoscopy: Results of the ROBUST-MIS 2019 challenge.

Roß, Tobias; Reinke, Annika; Full, Peter M; Wagner, Martin; Kenngott, Hannes; Apitz, Martin; Hempe, Hellena; Mindroc-Filimon, Diana; Scholz, Patrick; Tran, Thuy Nuong; Bruno, Pierangela; Arbeláez, Pablo; Bian, Gui-Bin; Bodenstedt, Sebastian; Bolmgren, Jon Lindström; Bravo-Sánchez, Laura; Chen, Hua-Bin; González, Cristina; Guo, Dong; Halvorsen, Pål; Heng, Pheng-Ann; Hosgor, Enes; Hou, Zeng-Guang; Isensee, Fabian; Jha, Debesh; Jiang, Tingting; Jin, Yueming; Kirtac, Kadir; Kletz, Sabrina; Leger, Stefan; Li, Zhixuan; Maier-Hein, Klaus H; Ni, Zhen-Liang; Riegler, Michael A; Schoeffmann, Klaus; Shi, Ruohua; Speidel, Stefanie; Stenzel, Michael; Twick, Isabell; Wang, Gutai; Wang, Jiacheng; Wang, Liansheng; Wang, Lu; Zhang, Yujie; Zhou, Yan-Jie; Zhu, Lei; Wiesenfarth, Manuel; Kopp-Schneider, Annette; Müller-Stich, Beat P; Maier-Hein, Lena.

Med Image Anal ; 70: 101920, 2021 05.

Artigo em Inglês | MEDLINE | ID: mdl-33676097

RESUMO

Intraoperative tracking of laparoscopic instruments is often a prerequisite for computer and robotic-assisted interventions. While numerous methods for detecting, segmenting and tracking of medical instruments based on endoscopic video images have been proposed in the literature, key limitations remain to be addressed: Firstly, robustness, that is, the reliable performance of state-of-the-art methods when run on challenging images (e.g. in the presence of blood, smoke or motion artifacts). Secondly, generalization; algorithms trained for a specific intervention in a specific hospital should generalize to other interventions or institutions. In an effort to promote solutions for these limitations, we organized the Robust Medical Instrument Segmentation (ROBUST-MIS) challenge as an international benchmarking competition with a specific focus on the robustness and generalization capabilities of algorithms. For the first time in the field of endoscopic image processing, our challenge included a task on binary segmentation and also addressed multi-instance detection and segmentation. The challenge was based on a surgical data set comprising 10,040 annotated images acquired from a total of 30 surgical procedures from three different types of surgery. The validation of the competing methods for the three tasks (binary segmentation, multi-instance detection and multi-instance segmentation) was performed in three different stages with an increasing domain gap between the training and the test data. The results confirm the initial hypothesis, namely that algorithm performance degrades with an increasing domain gap. While the average detection and segmentation quality of the best-performing algorithms is high, future research should concentrate on detection and segmentation of small, crossing, moving and transparent instrument(s) (parts).

Assuntos

Processamento de Imagem Assistida por Computador , Laparoscopia , Algoritmos , Artefatos

11.

RASNet: Segmentation for Tracking Surgical Instruments in Surgical Videos Using Refined Attention Segmentation Network.

Ni, Zhen-Liang; Bian, Gui-Bin; Xie, Xiao-Liang; Hou, Zeng-Guang; Zhou, Xiao-Hu; Zhou, Yan-Jie.

Annu Int Conf IEEE Eng Med Biol Soc ; 2019: 5735-5738, 2019 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-31947155

RESUMO

Segmentation for tracking surgical instruments plays an important role in robot-assisted surgery. Segmentation of surgical instruments contributes to capturing accurate spatial information for tracking. In this paper, a novel network, Refined Attention Segmentation Network, is proposed to simultaneously segment surgical instruments and identify their categories. The U-shape network which is popular in segmentation is used. Different from previous work, an attention module is adopted to help the network focus on key regions, which can improve the segmentation accuracy. To solve the class imbalance problem, the weighted sum of the cross entropy loss and the logarithm of the Jaccard index is used as loss function. Furthermore, transfer learning is adopted in our network. The encoder is pre-trained on ImageNet. The dataset from the MICCAI EndoVis Challenge 2017 is used to evaluate our network. Based on this dataset, our network achieves state-of-the-art performance 94.65% mean Dice and 90.33% mean IOU.

Assuntos

Processamento de Imagem Assistida por Computador , Instrumentos Cirúrgicos , Atenção

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA