Search | VHL Regional Portal

A Highly Efficient Model to Study the Semantics of Salient Object Detection.

Cheng, Ming-Ming; Gao, Shang-Hua; Borji, Ali; Tan, Yong-Qiang; Lin, Zheng; Wang, Meng.

IEEE Trans Pattern Anal Mach Intell ; 44(11): 8006-8021, 2022 11.

Article in English | MEDLINE | ID: mdl-34437058

ABSTRACT

CNN-based salient object detection (SOD) methods achieve impressive performance. However, the way semantic information is encoded in them and whether they are category-agnostic is less explored. One major obstacle in studying these questions is the fact that SOD models are built on top of the ImageNet pre-trained backbones which may cause information leakage and feature redundancy. To remedy this, here we first propose an extremely light-weight holistic model tied to the SOD task that can be freed from classification backbones and trained from scratch, and then employ it to study the semantics of SOD models. With the holistic network and representation redundancy reduction by a novel dynamic weight decay scheme, our model has only 100K parameters, â¼ 0.2% of parameters of large models, and performs on par with SOTA on popular SOD benchmarks. Using CSNet, we find that a) SOD and classification methods use different mechanisms, b) SOD models are category insensitive, c) ImageNet pre-training is not necessary for SOD training, and d) SOD models require far fewer parameters than the classification models. The source code is publicly available at https://mmcheng.net/sod100k/.

Subject(s)

Neural Networks, Computer , Semantics , Algorithms

Improved Protein Structure Prediction Using a New Multi-Scale Network and Homologous Templates.

Su, Hong; Wang, Wenkai; Du, Zongyang; Peng, Zhenling; Gao, Shang-Hua; Cheng, Ming-Ming; Yang, Jianyi.

Adv Sci (Weinh) ; 8(24): e2102592, 2021 12.

Article in English | MEDLINE | ID: mdl-34719864

ABSTRACT

The accuracy of de novo protein structure prediction has been improved considerably in recent years, mostly due to the introduction of deep learning techniques. In this work, trRosettaX, an improved version of trRosetta for protein structure prediction is presented. The major improvement over trRosetta consists of two folds. The first is the application of a new multi-scale network, i.e., Res2Net, for improved prediction of inter-residue geometries, including distance and orientations. The second is an attention-based module to exploit multiple homologous templates to increase the accuracy further. Compared with trRosetta, trRosettaX improves the contact precision by 6% and 8% on the free modeling targets of CASP13 and CASP14, respectively. A preliminary version of trRosettaX is ranked as one of the top server groups in CASP14's blind test. Additional benchmark test on 161 targets from CAMEO (between Jun and Sep 2020) shows that trRosettaX achieves an average TM-score ≈0.8, outperforming the top groups in CAMEO. These data suggest the effectiveness of using the multi-scale network and the benefit of incorporating homologous templates into the network. The trRosettaX algorithm is incorporated into the trRosetta server since Nov 2020. The web server, the training and inference codes are available at: https://yanglab.nankai.edu.cn/trRosetta/.

Subject(s)

Computational Biology/methods , Deep Learning , Models, Molecular , Neural Networks, Computer , Protein Conformation , Sequence Analysis, Protein/methods , Datasets as Topic

JCS: An Explainable COVID-19 Diagnosis System by Joint Classification and Segmentation.

Wu, Yu-Huan; Gao, Shang-Hua; Mei, Jie; Xu, Jun; Fan, Deng-Ping; Zhang, Rong-Guo; Cheng, Ming-Ming.

IEEE Trans Image Process ; 30: 3113-3126, 2021.

Article in English | MEDLINE | ID: mdl-33600316

ABSTRACT

Recently, the coronavirus disease 2019 (COVID-19) has caused a pandemic disease in over 200 countries, influencing billions of humans. To control the infection, identifying and separating the infected people is the most crucial step. The main diagnostic tool is the Reverse Transcription Polymerase Chain Reaction (RT-PCR) test. Still, the sensitivity of the RT-PCR test is not high enough to effectively prevent the pandemic. The chest CT scan test provides a valuable complementary tool to the RT-PCR test, and it can identify the patients in the early-stage with high sensitivity. However, the chest CT scan test is usually time-consuming, requiring about 21.5 minutes per case. This paper develops a novel Joint Classification and Segmentation (JCS) system to perform real-time and explainable COVID- 19 chest CT diagnosis. To train our JCS system, we construct a large scale COVID- 19 Classification and Segmentation (COVID-CS) dataset, with 144,167 chest CT images of 400 COVID- 19 patients and 350 uninfected cases. 3,855 chest CT images of 200 patients are annotated with fine-grained pixel-level labels of opacifications, which are increased attenuation of the lung parenchyma. We also have annotated lesion counts, opacification areas, and locations and thus benefit various diagnosis aspects. Extensive experiments demonstrate that the proposed JCS diagnosis system is very efficient for COVID-19 classification and segmentation. It obtains an average sensitivity of 95.0% and a specificity of 93.0% on the classification test set, and 78.5% Dice score on the segmentation test set of our COVID-CS dataset. The COVID-CS dataset and code are available at https://github.com/yuhuan-wu/JCS.

Subject(s)

COVID-19/diagnostic imaging , Deep Learning , Lung/diagnostic imaging , Radiographic Image Interpretation, Computer-Assisted/methods , Adolescent , Adult , Aged , Aged, 80 and over , Databases, Factual , Female , Humans , Male , Middle Aged , SARS-CoV-2 , Tomography, X-Ray Computed , Young Adult

Res2Net: A New Multi-Scale Backbone Architecture.

Gao, Shang-Hua; Cheng, Ming-Ming; Zhao, Kai; Zhang, Xin-Yu; Yang, Ming-Hsuan; Torr, Philip.

IEEE Trans Pattern Anal Mach Intell ; 43(2): 652-662, 2021 02.

Article in English | MEDLINE | ID: mdl-31484108

ABSTRACT

Representing features at multiple scales is of great importance for numerous vision tasks. Recent advances in backbone convolutional neural networks (CNNs) continually demonstrate stronger multi-scale representation ability, leading to consistent performance gains on a wide range of applications. However, most existing methods represent the multi-scale features in a layer-wise manner. In this paper, we propose a novel building block for CNNs, namely Res2Net, by constructing hierarchical residual-like connections within one single residual block. The Res2Net represents multi-scale features at a granular level and increases the range of receptive fields for each network layer. The proposed Res2Net block can be plugged into the state-of-the-art backbone CNN models, e.g., ResNet, ResNeXt, and DLA. We evaluate the Res2Net block on all these models and demonstrate consistent performance gains over baseline models on widely-used datasets, e.g., CIFAR-100 and ImageNet. Further ablation studies and experimental results on representative computer vision tasks, i.e., object detection, class activation mapping, and salient object detection, further verify the superiority of the Res2Net over the state-of-the-art baseline methods. The source code and trained models are available on https://mmcheng.net/res2net/.

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL