Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
Neural Netw ; 176: 106340, 2024 Apr 25.
Article in English | MEDLINE | ID: mdl-38713967

ABSTRACT

Vision transformers have achieved remarkable success in computer vision tasks by using multi-head self-attention modules to capture long-range dependencies within images. However, the high inference computation cost poses a new challenge. Several methods have been proposed to address this problem, mainly by slimming patches. In the inference stage, these methods classify patches into two classes, one to keep and the other to discard in multiple layers. This approach results in additional computation at every layer where patches are discarded, which hinders inference acceleration. In this study, we tackle the patch slimming problem from a different perspective by proposing a life regression module that determines the lifespan of each image patch in one go. During inference, the patch is discarded once the current layer index exceeds its life. Our proposed method avoids additional computation and parameters in multiple layers to enhance inference speed while maintaining competitive performance. Additionally, our approach1 requires fewer training epochs than other patch slimming methods.

2.
Commun Chem ; 7(1): 85, 2024 Apr 17.
Article in English | MEDLINE | ID: mdl-38632308

ABSTRACT

Effective transfer learning for molecular property prediction has shown considerable strength in addressing insufficient labeled molecules. Many existing methods either disregard the quantitative relationship between source and target properties, risking negative transfer, or require intensive training on target tasks. To quantify transferability concerning task-relatedness, we propose Principal Gradient-based Measurement (PGM) for transferring molecular property prediction ability. First, we design an optimization-free scheme to calculate a principal gradient for approximating the direction of model optimization on a molecular property prediction dataset. We have analyzed the close connection between the principal gradient and model optimization through mathematical proof. PGM measures the transferability as the distance between the principal gradient obtained from the source dataset and that derived from the target dataset. Then, we perform PGM on various molecular property prediction datasets to build a quantitative transferability map for source dataset selection. Finally, we evaluate PGM on multiple combinations of transfer learning tasks across 12 benchmark molecular property prediction datasets and demonstrate that it can serve as fast and effective guidance to improve the performance of a target task. This work contributes to more efficient discovery of drugs, materials, and catalysts by offering a task-relatedness quantification prior to transfer learning and understanding the relationship between chemical properties.

3.
Article in English | MEDLINE | ID: mdl-38625782

ABSTRACT

The study of cultural artifact provenance, tracing ownership and preservation, holds significant importance in archaeology and art history. Modern technology has advanced this field, yet challenges persist, including recognizing evidence from diverse sources, integrating sociocultural context, and enhancing interactive automation for comprehensive provenance analysis. In collaboration with art historians, we examined the handscroll, a traditional Chinese painting form that provides a rich source of historical data and a unique opportunity to explore history through cultural artifacts. We present a three-tiered methodology encompassing artifact, contextual, and provenance levels, designed to create a "Biography" for handscroll. Our approach incorporates the application of image processing techniques and language models to extract, validate, and augment elements within handscroll using various cultural heritage databases. To facilitate efficient analysis of non-contiguous extracted elements, we have developed a distinctive layout. Additionally, we introduce ScrollTimes, a visual analysis system tailored to support the three-tiered analysis of handscroll, allowing art historians to interactively create biographies tailored to their interests. Validated through case studies and expert interviews, our approach offers a window into history, fostering a holistic understanding of handscroll provenance and historical significance.

4.
Stud Health Technol Inform ; 310: 906-910, 2024 Jan 25.
Article in English | MEDLINE | ID: mdl-38269940

ABSTRACT

Lymph node metastasis is of paramount importance for patient treatment decision-making, prognosis evaluation, and clinical trial enrollment. However, accurate preoperative diagnosis remains challenging. In this study, we proposed a multi-task network to learn the primary tumor pathological features using the pT stage prediction task and leverage these features to facilitate lymph node metastasis prediction. We conducted experiments using electronic medical record data from 681 patients with non-small cell lung cancer. The proposed method achieved a 0.768 area under the receiver operating characteristic curve (AUC) value with a 0.073 standard deviation (SD) and a 0.448 average precision (AP) value with a 0.113 SD for lymph node metastasis prediction, which significantly outperformed the baseline models. Based on the results, we can conclude that the proposed multi-task method can effectively learn representations about tumor pathological conditions to support lymph node metastasis prediction.


Subject(s)
Carcinoma, Non-Small-Cell Lung , Lung Neoplasms , Humans , Lymphatic Metastasis , Learning , Electronic Health Records
5.
Article in English | MEDLINE | ID: mdl-38083421

ABSTRACT

Lung cancer is one of the most dangerous cancers all over the world. Surgical resection remains the only potentially curative option for patients with lung cancer. However, this invasive treatment often causes various complications, which seriously endanger patient health. In this study, we proposed a novel multi-label network, namely a hierarchy-driven multi-label network with label constraints (HDMN-LC), to predict the risk of complications of lung cancer patients. In this method, we first divided all complications into pulmonary and cardiovascular complication groups and employed the hierarchical cluster algorithm to analyze the hierarchies between these complications. After that, we employed the hierarchies to drive the network architecture design so that related complications could share more hidden features. And then, we combined all complications and employed an auxiliary task to predict whether any complications would occur to impose the bottom layer to learn general features. Finally, we proposed a regularization term to constrain the relationship between specific and combined complication labels to improve performance. We conducted extensive experiments on real clinical data of 593 patients. Experimental results indicate that the proposed method outperforms the single-label, multi-label baseline methods, with an average AUC value of 0.653. And the results also prove the effectiveness of hierarchy-driven network architecture and label constraints. We conclude that the proposed method can predict complications for lung cancer patients more effectively than the baseline methods.Clinical relevance-This study presents a novel multi-label network that can more accurately predict the risk of specific postoperative complications for lung cancer patients. The method can help clinicians identify high-risk patients more accurately and timely so that interventions can be implemented in advance to ensure patient safety.


Subject(s)
Lung Neoplasms , Humans , Lung Neoplasms/surgery , Algorithms , Postoperative Complications/diagnosis , Postoperative Complications/etiology , Learning , Pattern Recognition, Automated/methods
6.
Article in English | MEDLINE | ID: mdl-38117621

ABSTRACT

Weakly supervised semantic segmentation (WSSS) is a challenging yet important research field in vision community. In WSSS, the key problem is to generate high-quality pseudo segmentation masks (PSMs). Existing approaches mainly depend on the discriminative object part to generate PSMs, which would inevitably miss object parts or involve surrounding image background, as the learning process is unaware of the full object structure. In fact, both the discriminative object part and the full object structure are critical for deriving of high-quality PSMs. To fully explore these two information cues, we build a novel end-to-end learning framework, alternate self-dual teaching (ASDT), based on a dual-teacher single-student network architecture. The information interaction among different network branches is formulated in the form of knowledge distillation (KD). Unlike the conventional KD, the knowledge of the two teacher models would inevitably be noisy under weak supervision. Inspired by the Pulse Width (PW) modulation, we introduce a PW wave-like selection signal to alleviate the influence of the imperfect knowledge from either teacher model on the KD process. Comprehensive experiments on the PASCAL VOC 2012 and COCO-Stuff 10K demonstrate the effectiveness of the proposed ASDT framework, and new state-of-the-art results are achieved.

7.
Article in English | MEDLINE | ID: mdl-37436859

ABSTRACT

Most existing methods that cope with noisy labels usually assume that the classwise data distributions are well balanced. They are difficult to deal with the practical scenarios where training samples have imbalanced distributions, since they are not able to differentiate noisy samples from tail classes' clean samples. This article makes an early effort to tackle the image classification task in which the provided labels are noisy and have a long-tailed distribution. To deal with this problem, we propose a new learning paradigm which can screen out noisy samples by matching between inferences on weak and strong data augmentations. A leave-noise-out regularization (LNOR) is further introduced to eliminate the effect of the recognized noisy samples. Besides, we propose a prediction penalty based on the online classwise confidence levels to avoid the bias toward easy classes which are dominated by head classes. Extensive experiments on five datasets including CIFAR-10, CIFAR-100, MNIST, FashionMNIST, and Clothing1M demonstrate that the proposed method outperforms the existing algorithms for learning with long-tailed distribution and label noise.

8.
IEEE Trans Med Imaging ; 42(6): 1720-1734, 2023 06.
Article in English | MEDLINE | ID: mdl-37021848

ABSTRACT

Convolutional neural networks (CNNs) have made enormous progress in medical image segmentation. The learning of CNNs is dependent on a large amount of training data with fine annotations. The workload of data labeling can be significantly relieved via collecting imperfect annotations which only match the underlying ground truths coarsely. However, label noises which are systematically introduced by the annotation protocols, severely hinders the learning of CNN-based segmentation models. Hence, we devise a novel collaborative learning framework in which two segmentation models cooperate to combat label noises in coarse annotations. First, the complementary knowledge of two models is explored by making one model clean training data for the other model. Secondly, to further alleviate the negative impact of label noises and make sufficient usage of the training data, the specific reliable knowledge of each model is distilled into the other model with augmentation-based consistency constraints. A reliability-aware sample selection strategy is incorporated for guaranteeing the quality of the distilled knowledge. Moreover, we employ joint data and model augmentations to expand the usage of reliable knowledge. Extensive experiments on two benchmarks showcase the superiority of our proposed method against existing methods under annotations with different noise levels. For example, our approach can improve existing methods by nearly 3% DSC on the lung lesion segmentation dataset LIDC-IDRI under annotations with 80% noise ratio. Code is available at: https://github.com/Amber-Believe/ReliableMutualDistillation.


Subject(s)
Distillation , Neural Networks, Computer , Reproducibility of Results , Image Processing, Computer-Assisted
9.
IEEE Trans Cybern ; 53(11): 7263-7274, 2023 Nov.
Article in English | MEDLINE | ID: mdl-36251898

ABSTRACT

Part-level attribute parsing is a fundamental but challenging task, which requires the region-level visual understanding to provide explainable details of body parts. Most existing approaches address this problem by adding a regional convolutional neural network (RCNN) with an attribute prediction head to a two-stage detector, in which attributes of body parts are identified from localwise part boxes. However, localwise part boxes with limit visual clues (i.e., part appearance only) lead to unsatisfying parsing results, since attributes of body parts are highly dependent on comprehensive relations among them. In this article, we propose a knowledge-embedded RCNN (KE-RCNN) to identify attributes by leveraging rich knowledge, including implicit knowledge (e.g., the attribute "above-the-hip" for a shirt requires visual/geometry relations of shirt-hip) and explicit knowledge (e.g., the part of "shorts" cannot have the attribute of "hoodie" or "lining"). Specifically, the KE-RCNN consists of two novel components, that is: 1) implicit knowledge-based encoder (IK-En) and 2) explicit knowledge-based decoder (EK-De). The former is designed to enhance part-level representation by encoding part-part relational contexts into part boxes, and the latter one is proposed to decode attributes with a guidance of prior knowledge about part-attribute relations. In this way, the KE-RCNN is plug-and-play, which can be integrated into any two-stage detectors, for example, Attribute-RCNN, Cascade-RCNN, HRNet-based RCNN, and SwinTransformer-based RCNN. Extensive experiments conducted on two challenging benchmarks, for example, Fashionpedia and Kinetics-TPS, demonstrate the effectiveness and generalizability of the KE-RCNN. In particular, it achieves higher improvements over all existing methods, reaching around 3% of AP allIoU+F1 on Fashionpedia and around 4% of Accp on Kinetics-TPS. Code and models are publicly available at: https://github.com/sota-joson/KE-RCNN.

10.
Front Cardiovasc Med ; 9: 812276, 2022.
Article in English | MEDLINE | ID: mdl-35463786

ABSTRACT

Objective: To compare the performance, clinical feasibility, and reliability of statistical and machine learning (ML) models in predicting heart failure (HF) events. Background: Although ML models have been proposed to revolutionize medicine, their promise in predicting HF events has not been investigated in detail. Methods: A systematic search was performed on Medline, Web of Science, and IEEE Xplore for studies published between January 1, 2011 to July 14, 2021 that developed or validated at least one statistical or ML model that could predict all-cause mortality or all-cause readmission of HF patients. Prediction Model Risk of Bias Assessment Tool was used to assess the risk of bias, and random effect model was used to evaluate the pooled c-statistics of included models. Result: Two-hundred and two statistical model studies and 78 ML model studies were included from the retrieved papers. The pooled c-index of statistical models in predicting all-cause mortality, ML models in predicting all-cause mortality, statistical models in predicting all-cause readmission, ML models in predicting all-cause readmission were 0.733 (95% confidence interval 0.724-0.742), 0.777 (0.752-0.803), 0.678 (0.651-0.706), and 0.660 (0.633-0.686), respectively, indicating that ML models did not show consistent superiority compared to statistical models. The head-to-head comparison revealed similar results. Meanwhile, the immoderate use of predictors limited the feasibility of ML models. The risk of bias analysis indicated that ML models' technical pitfalls were more serious than statistical models'. Furthermore, the efficacy of ML models among different HF subgroups is still unclear. Conclusions: ML models did not achieve a significant advantage in predicting events, and their clinical feasibility and reliability were worse.

SELECTION OF CITATIONS
SEARCH DETAIL
...