Search | VHL Regional Portal

1.

Heterogeneous context interaction network for vehicle re-identification.

Sun, Ke; Pang, Xiyu; Zheng, Meifeng; Nie, Xiushan; Li, Xi; Zhou, Houren; Yin, Yilong.

Neural Netw ; 169: 293-306, 2024 Jan.

Article in English | MEDLINE | ID: mdl-37918272

ABSTRACT

Capturing global and subtle discriminative information using attention mechanisms is essential to address the challenge of inter-class high similarity for vehicle re-identification (Re-ID) task. Mixing self-information of nodes or modeling context based on pairwise dependencies between nodes are the core ideas of current advanced attention mechanisms. This paper aims to explore how to utilize both dependency context and self-context in an efficient way to facilitate attention to learn more effectively. We propose a heterogeneous context interaction (HCI) attention mechanism that infers the weights of nodes from the interactions of global dependency contexts and local self-contexts to enhance the effect of attention learning. To reduce computational complexity, global dependency contexts are modeled by aggregating number-compressed pairwise dependencies, and the interactions of heterogeneous contexts are restricted to a certain range. Based on this mechanism, we propose a heterogeneous context interaction network (HCI-Net), which uses channel heterogeneous context interaction module (CHCI) and spatial heterogeneous context interaction module (SHCI), and introduces a rigid partitioning strategy to extract important global and fine-grained features. In addition, we design a non-similarity constraint (NSC) that forces the HCI-Net to learn diverse subtle discriminative information. The experiment results on two large datasets, VeRi-776 and VehicleID, show that our proposed HCI-Net achieves the state-of-the-art performance. In particular, the mean average precision (mAP) reaches 83.8% on VeRi-776 dataset.

Subject(s)

Machine Learning , Neural Networks, Computer , Motor Vehicles

2.

Reformulating Graph Kernels for Self-Supervised Space-Time Correspondence Learning.

Qin, Zheyun; Lu, Xiankai; Liu, Dongfang; Nie, Xiushan; Yin, Yilong; Shen, Jianbing; Loui, Alexander C.

IEEE Trans Image Process ; 32: 6543-6557, 2023.

Article in English | MEDLINE | ID: mdl-37922168

ABSTRACT

Self-supervised space-time correspondence learning utilizing unlabeled videos holds great potential in computer vision. Most existing methods rely on contrastive learning with mining negative samples or adapting reconstruction from the image domain, which requires dense affinity across multiple frames or optical flow constraints. Moreover, video correspondence prediction models need to uncover more inherent properties of the video, such as structural information. In this work, we propose HiGraph+, a sophisticated space-time correspondence framework based on learnable graph kernels. By treating videos as a spatial-temporal graph, the learning objective of HiGraph+ is issued in a self-supervised manner, predicting the unobserved hidden graph via graph kernel methods. First, we learn the structural consistency of sub-graphs in graph-level correspondence learning. Furthermore, we introduce a spatio-temporal hidden graph loss through contrastive learning that facilitates learning temporal coherence across frames of sub-graphs and spatial diversity within the same frame. Therefore, we can predict long-term correspondences and drive the hidden graph to acquire distinct local structural representations. Then, we learn a refined representation across frames on the node-level via a dense graph kernel. The structural and temporal consistency of the graph forms the self-supervision of model training. HiGraph+ achieves excellent performance and demonstrates robustness in benchmark tests involving object, semantic part, keypoint, and instance labeling propagation tasks. Our algorithm implementations have been made publicly available at https://github.com/zyqin19/HiGraph.

3.

Biomarkers-Aware Asymmetric Bibranch GAN with Adaptive Memory Batch Normalization for Prediction of Anti-VEGF Treatment Response in Neovascular Age-related Macular Degeneration.

Zhao, Peng; Song, Xian; Xi, Xiaoming; Nie, Xiushan; Meng, Xianjing; Qu, Yi; Yin, Yilong.

IEEE J Biomed Health Inform ; PP2023 Aug 07.

Article in English | MEDLINE | ID: mdl-37549082

ABSTRACT

The emergence of anti-vascular endothelial growth factor (anti-VEGF) therapy has revolutionized neovascular age-related macular degeneration (nAMD). Post-therapeutic optical coherence tomography (OCT) imaging facilitates the prediction of therapeutic response to anti-VEGF therapy for nAMD. Although the generative adversarial network (GAN) is a popular generative model for post-therapeutic OCT image generation, it is realistically challenging to gather sufficient pre- and post-therapeutic OCT image pairs, resulting in overfitting. Moreover, the available GAN-based methods ignore local details, such as the biomarkers that are essential for nAMD treatment. To address these issues, a Biomarkers-aware Asymmetric Bibranch GAN (BAABGAN) is proposed to efficiently generate post-therapeutic OCT images. Specifically, one branch is developed to learn prior knowledge with a high degree of transferability from large-scale data, termed the source branch. Then, the source branch transfer knowledge to another branch, which is trained on small-scale paired data, termed the target branch. To boost the transferability, a novel Adaptive Memory Batch Normalization (AMBN) is introduced in the source branch, which learns more effective global knowledge that is impervious to noise via memory mechanism. Also, a novel Adaptive Biomarkers-aware Attention (ABA) module is proposed to encode biomarkers information into latent features of target branches to learn finer local details of biomarkers. The proposed method outperforms traditional GAN models and can produce high-quality post-treatment OCT pictures with limited data sets, as shown by the results of experiments.

4.

Multi-Receptive Field Soft Attention Part Learning for Vehicle Re-Identification.

Pang, Xiyu; Yin, Yilong; Zheng, Yanli.

Entropy (Basel) ; 25(4)2023 Mar 31.

Article in English | MEDLINE | ID: mdl-37190382

ABSTRACT

Vehicle re-identification across multiple cameras is one of the main problems of intelligent transportation systems (ITSs). Since the differences in the appearance between different vehicles of the same model are small and the appearance of the same vehicle changes drastically from different viewpoints, vehicle re-identification is a challenging task. In this paper, we propose a model called multi-receptive field soft attention part learning (MRF-SAPL). The MRF-SAPL model learns semantically diverse vehicle part-level features under different receptive fields through multiple local branches, alleviating the problem of small differences in vehicle appearance. To align vehicle parts from different images, this study uses soft attention to adaptively locate the positions of the parts on the final feature map generated by a local branch and maintain the continuity of the internal semantics of the parts. In addition, to obtain parts with different semantic patterns, we propose a new loss function that punishes overlapping regions, forcing the positions of different parts on the same feature map to not overlap each other as much as possible. Extensive ablation experiments demonstrate the effectiveness of our part-level feature learning method MRF-SAPL, and our model achieves state-of-the-art performance on two benchmark datasets.

5.

Missingness-Pattern-Adaptive Learning With Incomplete Data.

Gong, Yongshun; Li, Zhibin; Liu, Wei; Lu, Xiankai; Liu, Xinwang; Tsang, Ivor W; Yin, Yilong.

IEEE Trans Pattern Anal Mach Intell ; 45(9): 11053-11066, 2023 Sep.

Article in English | MEDLINE | ID: mdl-37030829

ABSTRACT

Many real-world problems deal with collections of data with missing values, e.g., RNA sequential analytics, image completion, video processing, etc. Usually, such missing data is a serious impediment to a good learning achievement. Existing methods tend to use a universal model for all incomplete data, resulting in a suboptimal model for each missingness pattern. In this paper, we present a general model for learning with incomplete data. The proposed model can be appropriately adjusted with different missingness patterns, alleviating competitions between data. Our model is based on observable features only, so it does not incur errors from data imputation. We further introduce a low-rank constraint to promote the generalization ability of our model. Analysis of the generalization error justifies our idea theoretically. In additional, a subgradient method is proposed to optimize our model with a proven convergence rate. Experiments on different types of data show that our method compares favorably with typical imputation strategies and other state-of-the-art models for incomplete data. More importantly, our method can be seamlessly incorporated into the neural networks with the best results achieved. The source code is released at https://github.com/YS-GONG/missingness-patterns.

6.

Towards Accurate and Robust Domain Adaptation Under Multiple Noisy Environments.

Han, Zhongyi; Gui, Xian-Jin; Sun, Haoliang; Yin, Yilong; Li, Shuo.

IEEE Trans Pattern Anal Mach Intell ; 45(5): 6460-6479, 2023 May.

Article in English | MEDLINE | ID: mdl-36251911

ABSTRACT

In many non-stationary environments, machine learning algorithms usually confront the distribution shift scenarios. Previous domain adaptation methods have achieved great success. However, they would lose algorithm robustness in multiple noisy environments where the examples of source domain become corrupted by label noise, feature noise, or open-set noise. In this paper, we report our attempt toward achieving noise-robust domain adaptation. We first give a theoretical analysis and find that different noises have disparate impacts on the expected target risk. To eliminate the effect of source noises, we propose offline curriculum learning minimizing a newly-defined empirical source risk. We suggest a proxy distribution-based margin discrepancy to gradually decrease the noisy distribution distance to reduce the impact of source noises. We propose an energy estimator for assessing the outlier degree of open-set-noise examples to defeat the harmful influence. We also suggest robust parameter learning to mitigate the negative effect further and learn domain-invariant feature representations. Finally, we seamlessly transform these components into an adversarial network that performs efficient joint optimization for them. A series of empirical studies on the benchmark datasets and the COVID-19 screening task show that our algorithm remarkably outperforms the state-of-the-art, with over 10% accuracy improvements in some transfer tasks.

7.

Feature enhancement network for CNV typing in optical coherence tomography images.

Xu, Chuanzhen; Xi, Xiaoming; Yang, Lu; Yang, Xiao; Song, Zuoyong; Nie, Xiushan; Zhang, Limei; Zhang, Yanwei; Chen, Xinjian; Yin, Yilong.

Phys Med Biol ; 67(20)2022 10 12.

Article in English | MEDLINE | ID: mdl-36137536

ABSTRACT

Objective. Choroidal neovascularization (CNV) is a characteristic feature of wet age-related macular degeneration, which is one of the main causes of blindness in the elderly. Automatic classification of CNV in optical coherence tomography images plays an auxiliary role in the clinical treatment of CNV.Approach. This study proposes a feature enhancement network (FE-net) to discriminate between different CNV types with high inter-class similarity. The FE-net consists of two branches: discriminative FE and diverse FE. In the discriminative FE branch, a novel class-specific feature extraction module is introduced to learn class-specific features, and the discriminative loss is introduced to make the learned features more discriminative. In the diverse FE branch, the attention region selection is used to mine the multi-attention features from feature maps in the same class, and the diverse loss is introduced to guarantee that the attention features are different, which can improve the diversity of the learned features.Main results. Experiments were conducted on our CNV dataset, with significant accuracy of 92.33%, 87.45%, 90.10%, and 91.25% on ACC, AUC, SEN, and SPE, respectively.Significance. These results demonstrate that the proposed method can effectively learn the discriminative and diverse features to discriminate subtle differences between different types of CNV. And accurate classification of CNV plays an auxiliary role in clinical treatmen.

Subject(s)

Choroidal Neovascularization , Wet Macular Degeneration , Aged , Choroidal Neovascularization/diagnostic imaging , Choroidal Neovascularization/drug therapy , Fluorescein Angiography , Humans , Tomography, Optical Coherence/methods , Wet Macular Degeneration/drug therapy

8.

Learning Transferable Parameters for Unsupervised Domain Adaptation.

Han, Zhongyi; Sun, Haoliang; Yin, Yilong.

IEEE Trans Image Process ; 31: 6424-6439, 2022.

Article in English | MEDLINE | ID: mdl-35759596

ABSTRACT

Unsupervised domain adaptation (UDA) enables a learning machine to adapt from a labeled source domain to an unlabeled target domain under the distribution shift. Thanks to the strong representation ability of deep neural networks, recent remarkable achievements in UDA resort to learning domain-invariant features. Intuitively, the goal is that a good feature representation and the hypothesis learned from the source domain can generalize well to the target domain. However, the learning processes of domain-invariant features and source hypotheses inevitably involve domain-specific information that would degrade the generalizability of UDA models on the target domain. The lottery ticket hypothesis proves that only partial parameters are essential for generalization. Motivated by it, we find in this paper that only partial parameters are essential for learning domain-invariant information. Such parameters are termed transferable parameters that can generalize well in UDA. In contrast, the rest parameters tend to fit domain-specific details and often cause the failure of generalization, which are termed untransferable parameters. Driven by this insight, we propose Transferable Parameter Learning (TransPar) to reduce the side effect of domain-specific information in the learning process and thus enhance the memorization of domain-invariant information. Specifically, according to the distribution discrepancy degree, we divide all parameters into transferable and untransferable ones in each training iteration. We then perform separate update rules for the two types of parameters. Extensive experiments on image classification and regression tasks (keypoint detection) show that TransPar outperforms prior arts by non-trivial margins. Moreover, experiments demonstrate that TransPar can be integrated into the most popular deep UDA networks and be easily extended to handle any data distribution shift scenarios.

Subject(s)

Neural Networks, Computer

9.

Tri-Branch Convolutional Neural Networks for Top-k Focused Academic Performance Prediction.

Cui, Chaoran; Zong, Jian; Ma, Yuling; Wang, Xinhua; Guo, Lei; Chen, Meng; Yin, Yilong.

IEEE Trans Neural Netw Learn Syst ; PP2022 May 19.

Article in English | MEDLINE | ID: mdl-35588409

ABSTRACT

Academic performance prediction aims to leverage student-related information to predict their future academic outcomes, which is beneficial to numerous educational applications, such as personalized teaching and academic early warning. In this article, we reveal the students' behavior trajectories by mining campus smartcard records, and capture the characteristics inherent in trajectories for academic performance prediction. Particularly, we carefully design a tri-branch convolutional neural network (CNN) architecture, which is equipped with rowwise, columnwise, and depthwise convolutions and attention operations, to effectively capture the persistence, regularity, and temporal distribution of student behavior in an end-to-end manner, respectively. However, different from existing works mainly targeting at improving the prediction performance for the whole students, we propose to cast academic performance prediction as a top-k ranking problem, and introduce a top-k focused loss to ensure the accuracy of identifying academically at-risk students. Extensive experiments were carried out on a large-scale real-world dataset, and we show that our approach substantially outperforms recently proposed methods for academic performance prediction. For the sake of reproducibility, our codes have been released at https://github.com/ZongJ1111/Academic-Performance-Prediction.

10.

Supervised Adaptive Similarity Matrix Hashing.

Shi, Yang; Nie, Xiushan; Liu, Xingbo; Zou, Li; Yin, Yilong.

IEEE Trans Image Process ; 31: 2755-2766, 2022.

Article in English | MEDLINE | ID: mdl-35320101

ABSTRACT

Compact hash codes can facilitate large-scale multimedia retrieval, significantly reducing storage and computation. Most hashing methods learn hash functions based on the data similarity matrix, which is predefined by supervised labels or a distance metric type. However, this predefined similarity matrix cannot accurately reflect the real similarity relationship among images, which results in poor retrieval performance of hashing methods, especially in multi-label datasets and zero-shot datasets that are highly dependent on similarity relationships. Toward this end, this study proposes a new supervised hashing method called supervised adaptive similarity matrix hashing (SASH) via feature-label space consistency. SASH not only learns the similarity matrix adaptively, but also extracts the label correlations by maintaining consistency between the feature and the label space. This correlation information is then used to optimize the similarity matrix. The experiments on three large normal benchmark datasets (including two multi-label datasets) and three large zero-shot benchmark datasets show that SASH has an excellent performance compared with several state-of-the-art techniques.

11.

Learning Binary Semantic Embedding forLarge-Scale Breast Histology Image Analysis.

Liu, Xingbo; Kang, Xiao; Nie, Xiushan; Guo, Jie; Wang, Shaohua; Yin, Yilong.

IEEE J Biomed Health Inform ; PP2022 Mar 23.

Article in English | MEDLINE | ID: mdl-35320109

ABSTRACT

With the progress of clinical imaging innovation and machine learning, the computer-assisted diagnosis of breast histology images has attracted broad attention. Nonetheless, the use of computer-assisted diagnoses has been blocked due to the incomprehensibility of customary classification models. In view of this question, we propose a novel method for Learning Binary Semantic Embedding (LBSE). In this study, bit balance and uncorrela-tion constraints, double supervision, discrete optimization and asymmetric pairwise similarity are seamlessly integrated for learning binary semantic-preserving embedding. Moreover, a fusion-based strategy is carefully designed to handle the intractable problem of parameter setting, saving huge amounts of time for boundary tuning. Based on the above-mentioned proficient and effective embedding, classification and retrieval are simultaneously performed to give interpretable image-based deduction and model helped conclusions for breast histology images. Extensive experiments are conducted on three benchmark datasets to approve the predominance of LBSE in different situations.

12.

MetaKernel: Learning Variational Random Features with Limited Labels.

Du, Yingjun; Sun, Haoliang; Zhen, Xiantong; Xu, Jun; Yin, Yilong; Shao, Ling; Snoek, Cees G M.

IEEE Trans Pattern Anal Mach Intell ; PP2022 Feb 28.

Article in English | MEDLINE | ID: mdl-35226600

ABSTRACT

Few-shot learning deals with the fundamental and challenging problem of learning from a few annotated samples, while being able to generalize well on new tasks. The crux of few-shot learning is to extract prior knowledge from related tasks to enable fast adaptation to a new task with a limited amount of data. In this paper, we propose meta-learning kernels with random Fourier features for few-shot learning, we call MetaKernel. Specically, we propose learning variational random features in a data-driven manner to obtain task-specic kernels by leveraging the shared knowledge provided by related tasks in a meta-learning setting. We treat the random feature basis as the latent variable, which is estimated by variational inference. The shared knowledge from related tasks is incorporated into a context inference of the posterior, which we achieve via a long-short term memory module. To establish more expressive kernels, we deploy conditional normalizing ows based on coupling layers to achieve a richer posterior distribution over random Fourier bases. The resultant kernels are more informative and discriminative, which further improves the few-shot learning. We conduct experiments on both few-shot image classication and regression tasks. The results on fourteen datasets demonstrate MetaKernel consistently better performance than state-of-the-art alternatives.

13.

Multi-modality relation attention network for breast tumor classification.

Yang, Xiao; Xi, Xiaoming; Yang, Lu; Xu, Chuanzhen; Song, Zuoyong; Nie, Xiushan; Qiao, Lishan; Li, Chenglong; Shi, Qinglei; Yin, Yilong.

Comput Biol Med ; 150: 106210, 2022 11.

Article in English | MEDLINE | ID: mdl-37859295

ABSTRACT

Automatic breast image classification plays an important role in breast cancer diagnosis, and multi-modality image fusion may improve classification performance. However, existing fusion methods ignore relevant multi-modality information in favor of improving the discriminative ability of single-modality features. To improve classification performance, this paper proposes a multi-modality relation attention network with consistent regularization for breast tumor classification using diffusion-weighted imaging (DWI) and apparent dispersion coefficient (ADC) images. Within the proposed network, a novel multi-modality relation attention module improves the discriminative ability of single-modality features by exploring the correlation information between two modalities. In addition, a module ensures the classification consistency of ADC and DWI modality, thus improving robustness to noise. Experimental results on our database demonstrate that the proposed method is effective for breast tumor classification, and outperforms existing multi-modality fusion methods. The AUC, accuracy, specificity, and sensitivity are 85.1%, 86.7%, 83.3%, and 88.9% respectively.

Subject(s)

Breast Neoplasms , Mammary Neoplasms, Animal , Humans , Animals , Female , Magnetic Resonance Imaging/methods , Diffusion Magnetic Resonance Imaging/methods , Breast , Breast Neoplasms/diagnostic imaging

14.

SMBFT: A Modified Fuzzy c-Means Algorithm for Superpixel Generation.

Yu, Zhen; Tian, Cuihuan; Ji, Shiyong; Wei, Benzheng; Yin, Yilong.

Comput Math Methods Med ; 2021: 1053242, 2021.

Article in English | MEDLINE | ID: mdl-34659445

ABSTRACT

Most traditional superpixel segmentation methods used binary logic to generate superpixels for natural images. When these methods are used for images with significantly fuzzy characteristics, the boundary pixels sometimes cannot be correctly classified. In order to solve this problem, this paper proposes a Superpixel Method Based on Fuzzy Theory (SMBFT), which uses fuzzy theory as a guide and traditional fuzzy c-means clustering algorithm as a baseline. This method can make full use of the advantage of the fuzzy clustering algorithm in dealing with the images with the fuzzy characteristics. Boundary pixels which have higher uncertainties can be correctly classified with maximum probability. The superpixel has homogeneous pixels. Meanwhile, the paper also uses the surrounding neighborhood pixels to constrain the spatial information, which effectively alleviates the negative effects of noise. The paper tests on the images from Berkeley database and brain MR images from the Brain web. In addition, this paper proposes a comprehensive criterion to measure the weights of two kinds of criterions in choosing superpixel methods for color images. An evaluation criterion for medical image data sets employs the internal entropy of superpixels which is inspired by the concept of entropy in the information theory. The experimental results show that this method has superiorities than traditional methods both on natural images and medical images.

Subject(s)

Algorithms , Fuzzy Logic , Image Interpretation, Computer-Assisted/methods , Brain/diagnostic imaging , Computational Biology , Humans , Image Interpretation, Computer-Assisted/statistics & numerical data , Image Processing, Computer-Assisted/methods , Image Processing, Computer-Assisted/statistics & numerical data , Magnetic Resonance Imaging/methods , Magnetic Resonance Imaging/statistics & numerical data , Pattern Recognition, Automated/methods , Pattern Recognition, Automated/statistics & numerical data

15.

Weighted-Attribute Triplet Hashing for Large-Scale Similar Judicial Case Matching.

Li, Jiamin; Liu, Xingbo; Nie, Xiushan; Ma, Lele; Li, Peng; Zhang, Kai; Yin, Yilong.

Comput Intell Neurosci ; 2021: 6650962, 2021.

Article in English | MEDLINE | ID: mdl-33953738

ABSTRACT

Similar judicial case matching aims to enable an accurate selection of a judicial document that is most similar to the target document from multiple candidates. The core of similar judicial case matching is to calculate the similarity between two fact case documents. Owing to similar judicial case matching techniques, legal professionals can promptly find and judge similar cases in a candidate set. These techniques can also benefit the development of judicial systems. However, the document of judicial cases not only is long in length but also has a certain degree of structural complexity. Meanwhile, a variety of judicial cases are also increasing rapidly; thus, it is difficult to find the document most similar to the target document in a large corpus. In this study, we present a novel similar judicial case matching model, which obtains the weight of judicial feature attributes based on hash learning and realizes fast similar matching by using a binary code. The proposed model extracts the judicial feature attributes vector using the bidirectional encoder representations from transformers (BERT) model and subsequently obtains the weighted judicial feature attributes through learning the hash function. We further impose triplet constraints to ensure that the similarity of judicial case data is well preserved when projected into the Hamming space. Comprehensive experimental results on public datasets show that the proposed method is superior in the task of similar judicial case matching and is suitable for large-scale similar judicial case matching.

Subject(s)

Algorithms

16.

Unifying neural learning and symbolic reasoning for spinal medical report generation.

Han, Zhongyi; Wei, Benzheng; Xi, Xiaoming; Chen, Bo; Yin, Yilong; Li, Shuo.

Med Image Anal ; 67: 101872, 2021 01.

Article in English | MEDLINE | ID: mdl-33142134

ABSTRACT

Automated medical report generation in spine radiology, i.e., given spinal medical images and directly create radiologist-level diagnosis reports to support clinical decision making, is a novel yet fundamental study in the domain of artificial intelligence in healthcare. However, it is incredibly challenging because it is an extremely complicated task that involves visual perception and high-level reasoning processes. In this paper, we propose the neural-symbolic learning (NSL) framework that performs human-like learning by unifying deep neural learning and symbolic logical reasoning for the spinal medical report generation. Generally speaking, the NSL framework firstly employs deep neural learning to imitate human visual perception for detecting abnormalities of target spinal structures. Concretely, we design an adversarial graph network that interpolates a symbolic graph reasoning module into a generative adversarial network through embedding prior domain knowledge, achieving semantic segmentation of spinal structures with high complexity and variability. NSL secondly conducts human-like symbolic logical reasoning that realizes unsupervised causal effect analysis of detected entities of abnormalities through meta-interpretive learning. NSL finally fills these discoveries of target diseases into a unified template, successfully achieving a comprehensive medical report generation. When employed in a real-world clinical dataset, a series of empirical studies demonstrate its capacity on spinal medical report generation and show that our algorithm remarkably exceeds existing methods in the detection of spinal structures. These indicate its potential as a clinical tool that contributes to computer-aided diagnosis.

Subject(s)

Artificial Intelligence , Neural Networks, Computer , Algorithms , Diagnosis, Computer-Assisted , Humans , Spine

17.

IA-net: informative attention convolutional neural network for choroidal neovascularization segmentation in OCT images.

Xi, Xiaoming; Meng, Xianjing; Qin, Zheyun; Nie, Xiushan; Yin, Yilong; Chen, Xinjian.

Biomed Opt Express ; 11(11): 6122-6136, 2020 Nov 01.

Article in English | MEDLINE | ID: mdl-33282479

ABSTRACT

Choroidal neovascularization (CNV) is a characteristic feature of wet age-related macular degeneration (AMD). Quantification of CNV is useful to clinicians in the diagnosis and treatment of CNV disease. Before quantification, CNV lesion should be delineated by automatic CNV segmentation technology. Recently, deep learning methods have achieved significant success for medical image segmentation. However, some CNVs are small objects which are hard to discriminate, resulting in performance degradation. In addition, it's difficult to train an effective network for accurate segmentation due to the complicated characteristics of CNV in OCT images. In order to tackle these two challenges, this paper proposed a novel Informative Attention Convolutional Neural Network (IA-net) for automatic CNV segmentation in OCT images. Considering that the attention mechanism has the ability to enhance the discriminative power of the interesting regions in the feature maps, the attention enhancement block is developed by introducing the additional attention constraint. It has the ability to force the model to pay high attention on CNV in the learned feature maps, improving the discriminative ability of the learned CNV features, which is useful to improve the segmentation performance on small CNV. For accurate pixel classification, the novel informative loss is proposed with the incorporation of an informative attention map. It can focus training on a set of informative samples that are difficult to be predicted. Therefore, the trained model has the ability to learn enough information to classify these informative samples, further improving the performance. The experimental results on our database demonstrate that the proposed method outperforms traditional CNV segmentation methods.

18.

Automatic Segmentation and Measurement on Knee Computerized Tomography Images for Patellar Dislocation Diagnosis.

Sun, Limin; Kong, Qi; Huang, Yan; Yang, Jiushan; Wang, Shaoshan; Zou, Ruiqi; Yin, Yilong; Peng, Jingliang.

Comput Math Methods Med ; 2020: 1782531, 2020.

Article in English | MEDLINE | ID: mdl-32454878

ABSTRACT

Traditionally, for diagnosing patellar dislocation, clinicians make manual geometric measurements on computerized tomography (CT) images taken in the knee area, which is often complex and error-prone. Therefore, we develop a prototype CAD system for automatic measurement and diagnosis. We firstly segment the patella and the femur regions on the CT images and then measure two geometric quantities, patellar tilt angle (PTA), and patellar lateral shift (PLS) automatically on the segmentation results, which are finally used to assist in diagnoses. The proposed quantities are proved valid and the proposed algorithms are proved effective by experiments.

Subject(s)

Knee/diagnostic imaging , Patellar Dislocation/diagnostic imaging , Radiographic Image Interpretation, Computer-Assisted/statistics & numerical data , Tomography, X-Ray Computed/statistics & numerical data , Algorithms , Computational Biology , Databases, Factual/statistics & numerical data , Femur/diagnostic imaging , Humans , Patella/diagnostic imaging , Radiographic Image Enhancement/methods

19.

Model Optimization Boosting Framework for Linear Model Hash Learning.

Liu, Xingbo; Nie, Xiushan; Zhou, Quan; Nie, Liqiang; Yin, Yilong.

IEEE Trans Image Process ; 2020 Feb 05.

Article in English | MEDLINE | ID: mdl-32031939

ABSTRACT

Efficient hashing techniques have attracted extensive research interests in both storage and retrieval of highdimensional data, such as images and videos. In existing hashing methods, a linear model is commonly utilized owing to its efficiency. To obtain better accuracy, linear-based hashing methods focus on designing a generalized linear objective function with different constraints or penalty terms that consider the inherent characteristics and neighborhood information of samples. Differing from existing hashing methods, in this study, we propose a self-improvement framework called Model Boost (MoBoost) to improve model parameter optimization for linear-based hashing methods without adding new constraints or penalty terms. In the proposed MoBoost, for a linear-based hashing method, we first repeatedly execute the hashing method to obtain several hash codes to training samples. Then, utilizing two novel fusion strategies, these codes are fused into a single set. We also propose two new criteria to evaluate the goodness of hash bits during the fusion process. Based on the fused set of hash codes, we learn new parameters for the linear hash function that can significantly improve the accuracy. In general, the proposed MoBoost can be adopted by existing linear-based hashing methods, achieving more precise and stable performance compared to the original methods, and adopting the proposed MoBoost will incur negligible time and space costs. To evaluate the proposed MoBoost, we performed extensive experiments on four benchmark datasets, and the results demonstrate superior performance.

20.

Multiscale Rotation-Invariant Convolutional Neural Networks for Lung Texture Classification.

Wang, Qiangchang; Zheng, Yuanjie; Yang, Gongping; Jin, Weidong; Chen, Xinjian; Yin, Yilong.

IEEE J Biomed Health Inform ; 22(1): 184-195, 2018 01.

Article in English | MEDLINE | ID: mdl-28333649

ABSTRACT

We propose a new multiscale rotation-invariant convolutional neural network (MRCNN) model for classifying various lung tissue types on high-resolution computed tomography. MRCNN employs Gabor-local binary pattern that introduces a good property in image analysis-invariance to image scales and rotations. In addition, we offer an approach to deal with the problems caused by imbalanced number of samples between different classes in most of the existing works, accomplished by changing the overlapping size between the adjacent patches. Experimental results on a public interstitial lung disease database show a superior performance of the proposed method to state of the art.

Subject(s)

Image Interpretation, Computer-Assisted/methods , Lung Diseases, Interstitial/diagnostic imaging , Lung/diagnostic imaging , Neural Networks, Computer , Pattern Recognition, Automated/methods , Algorithms , Humans

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL