Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 54
Filter
1.
Article in English | MEDLINE | ID: mdl-38837924

ABSTRACT

Building learning systems possessing adaptive flexibility to different tasks is critical and challenging. In this article, we propose a novel and general meta-learning framework, called meta-modulation (MeMo), to foster the adaptation capability of a base learner across different tasks where only a few training data are available per task. For one independent task, MeMo proceeds like a "feedback regulation system", which achieves an adaptive modulation on the so-called definitive embeddings of query data to maximize the corresponding task objective. Specifically, we devise a type of efficient feedback information, definitive embedding feedback (DEF), to mathematize and quantify the unsuitability between the few training data and the base learner as well as the promising adjustment direction to reduce this unsuitability. The DEFs are encoded into high-level representation and temporarily stored as task-specific modulator templates by a modulation encoder. For coming query data, we develop an attention mechanism acting upon these modulator templates and combine both task/data-level modulation to generate the final data-specific meta-modulator. This meta-modulator is then used to modulate the query's embedding for correct decision-making. Our framework is scalable for various base learner models like multi-layer perceptron (MLP), long short-term memory (LSTM), convolutional neural network (CNN), and transformer, and applicable to different learning problems like language modeling and image recognition. Experimental results on a 2-D point synthetic dataset and various benchmarks in language and vision domains demonstrate the effectiveness and competitiveness of our framework.

2.
Evol Bioinform Online ; 20: 11769343241257344, 2024.
Article in English | MEDLINE | ID: mdl-38826865

ABSTRACT

In diploid organisms, half of the chromosomes in each cell come from the father and half from the mother. Through previous studies, it was found that the paternal chromosome and the maternal chromosome can be regulated and expressed independently, leading to the emergence of allele specific expression (ASE). In this study, we analyzed the differential expression of alleles in the high-altitude population and the normal population based on the RNA sequencing data. Through gene cluster analysis and protein interaction network analysis, we found some changes occurred at the gene level, and some negative effects. During the study, we realized that the calmodulin homology domain may have a certain correlation with long-term survival at high altitude. The plateau environment is characterized by hypoxia, low air pressure, strong ultraviolet radiation, and low temperature. Accordingly, the genetic changes in the process of adaptation are mainly reflected in these characteristics. High altitude generation living is also highly related to cancer, immune disease, cardiovascular disease, neurological disease, endocrine disease, and other diseases. Therefore, the medical system in high altitude areas should pay more attention to these diseases.

3.
Article in English | MEDLINE | ID: mdl-38809743

ABSTRACT

Self-supervised Learning (SSL) including the mainstream contrastive learning has achieved great success in learning visual representations without data annotations. However, most methods mainly focus on the instance level information (ie, the different augmented images of the same instance should have the same feature or cluster into the same class), but there is a lack of attention on the relationships between different instances. In this paper, we introduce a novel SSL paradigm, which we term as relational self-supervised learning (ReSSL) framework that learns representations by modeling the relationship between different instances. Specifically, our proposed method employs sharpened distribution of pairwise similarities among different instances as relation metric, which is thus utilized to match the feature embeddings of different augmentations. To boost the performance, we argue that weak augmentations matter to represent a more reliable relation, and leverage momentum strategy for practical efficiency. The designed asymmetric predictor head and an InfoNCE warm-up strategy enhance the robustness to hyper-parameters and benefit the resulting performance. Experimental results show that our proposed ReSSL substantially outperforms the state-of-the-art methods across different network architectures, including various lightweight networks (eg, EfficientNet and MobileNet).

4.
IEEE Trans Pattern Anal Mach Intell ; 46(5): 3972-3980, 2024 May.
Article in English | MEDLINE | ID: mdl-38224500

ABSTRACT

Backpropagation (BP) is widely used for calculating gradients in deep neural networks (DNNs). Applied often along with stochastic gradient descent (SGD) or its variants, BP is considered as a de-facto choice in a variety of machine learning tasks including DNN training and adversarial attack/defense. Recently, a linear variant of BP named LinBP was introduced for generating more transferable adversarial examples for performing black-box attacks, by (Guo et al. 2020). Although it has been shown empirically effective in black-box attacks, theoretical studies and convergence analyses of such a method is lacking. This paper serves as a complement and somewhat an extension to Guo et al. (2020) paper, by providing theoretical analyses on LinBP in neural-network-involved learning tasks, including adversarial attack and model training. We demonstrate that, somewhat surprisingly, LinBP can lead to faster convergence in these tasks in the same hyper-parameter settings, compared to BP. We confirm our theoretical results with extensive experiments.

5.
IEEE Trans Pattern Anal Mach Intell ; 45(11): 13235-13249, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37819812

ABSTRACT

Recently, with the applications of algorithms in various risky scenarios, algorithmic fairness has been a serious concern and received lots of interest in machine learning community. In this article, we focus on the bipartite ranking scenario, where the instances come from either the positive or negative class and the goal is to learn a ranking function that ranks positive instances higher than negative ones. We are interested in whether the learned ranking function can cause systematic disparity across different protected groups defined by sensitive attributes. While there could be a trade-off between fairness and performance, we propose a model agnostic post-processing framework xOrder for achieving fairness in bipartite ranking and maintaining the algorithm classification performance. In particular, we optimize a weighted sum of the utility as identifying an optimal warping path across different protected groups and solve it through a dynamic programming process. xOrder is compatible with various classification models and ranking fairness metrics, including supervised and unsupervised fairness metrics. In addition to binary groups, xOrder can be applied to multiple protected groups. We evaluate our proposed algorithm on four benchmark data sets and two real-world patient electronic health record repositories. xOrder consistently achieves a better balance between the algorithm utility and ranking fairness on a variety of datasets with different metrics. From the visualization of the calibrated ranking scores, xOrder mitigates the score distribution shifts of different groups compared with baselines. Moreover, additional analytical results verify that xOrder achieves a robust performance when faced with fewer samples and a bigger difference between training and testing ranking score distributions.

6.
J Chem Inf Model ; 63(19): 5971-5980, 2023 10 09.
Article in English | MEDLINE | ID: mdl-37589216

ABSTRACT

Many material properties are manifested in the morphological appearance and characterized using microscopic images, such as scanning electron microscopy (SEM). Polymer miscibility is a key physical quantity of polymer materials and is commonly and intuitively judged using SEM images. However, human observation and judgment of the images is time-consuming, labor-intensive, and hard to be quantified. Computer image recognition with machine learning methods can make up for the defects of artificial judging, giving accurate and quantitative judgment. We achieve automatic miscibility recognition utilizing a convolutional neural network and transfer learning methods, and the model obtains up to 94% accuracy. We also put forward a quantitative criterion for polymer miscibility with this model. The proposed method can be widely applied to the quantitative characterization of the microstructure and properties of various materials.


Subject(s)
Neural Networks, Computer , Polymers , Humans , Machine Learning
7.
IEEE Trans Pattern Anal Mach Intell ; 45(7): 8936-8953, 2023 Jul.
Article in English | MEDLINE | ID: mdl-37015571

ABSTRACT

Searching for a more compact network width recently serves as an effective way of channel pruning for the deployment of convolutional neural networks (CNNs) under hardware constraints. To fulfil the searching, a one-shot supernet is usually leveraged to efficiently evaluate the performance w.r.t. different network widths. However, current methods mainly follow a unilaterally augmented (UA) principle for the evaluation of each width, which induces the training unfairness of channels in supernet. In this article, we introduce a new supernet called Bilaterally Coupled Network (BCNet) to address this issue. In BCNet, each channel is fairly trained and responsible for the same amount of network widths, thus each network width can be evaluated more accurately. Besides, we propose to reduce the redundant search space and present the BCNetV2 as the enhanced supernet to ensure rigorous training fairness over channels. Furthermore, we leverage a stochastic complementary strategy for training the BCNet, and propose a prior initial population sampling method to boost the performance of the evolutionary search. We also propose a new open-source width search benchmark on macro structures named Channel-Bench-Macro for the better comparisons of the width search algorithms with MobileNet- and ResNet-like architectures. Extensive experiments on the benchmark datasets demonstrate that our method can achieve state-of-the-art performance.

8.
Article in English | MEDLINE | ID: mdl-37023168

ABSTRACT

Deep convolutional neural networks are shown to be overkill with high parametric and computational redundancy in many application scenarios, and an increasing number of works have explored model pruning to obtain lightweight and efficient networks. However, most existing pruning approaches are driven by empirical heuristics and rarely consider the joint impact of channels, leading to unguaranteed and suboptimal performance. In this article, we propose a novel channel pruning method via class-aware trace ratio optimization (CATRO) to reduce the computational burden and accelerate the model inference. Utilizing class information from a few samples, CATRO measures the joint impact of multiple channels by feature space discriminations and consolidates the layerwise impact of preserved channels. By formulating channel pruning as a submodular set function maximization problem, CATRO solves it efficiently via a two-stage greedy iterative optimization procedure. More importantly, we present theoretical justifications on convergence of CATRO and performance of pruned networks. Experimental results demonstrate that CATRO achieves higher accuracy with similar computation cost or lower computation cost with similar accuracy than other state-of-the-art channel pruning algorithms. In addition, because of its class-aware property, CATRO is suitable to prune efficient networks adaptively for various classification subtasks, enhancing handy deployment and usage of deep networks in real-world applications.

9.
Article in English | MEDLINE | ID: mdl-36306294

ABSTRACT

Studying the relationship between linear discriminant analysis (LDA) and least squares regression (LSR) is of great theoretical and practical significance. It is well-known that the two-class LDA is equivalent to an LSR problem, and directly casting multiclass LDA as an LSR problem, however, becomes more challenging. Recent study reveals that the equivalence between multiclass LDA and LSR can be established based on a special class indicator matrix, but under a mild condition which may not hold under the scenarios with low-dimensional or oversampled data. In this article, we show that the equivalence between multiclass LDA and LSR can be established based on arbitrary linearly independent class indicator vectors and without any condition. In addition, we show that LDA is also equivalent to a constrained LSR based on the data-dependent indicator vectors. It can be concluded that under exactly the same mild condition, such two regressions are both equivalent to the null space LDA method. Illuminated by the equivalence of LDA and LSR, we propose a direct LDA classifier to replace the conventional framework of LDA plus extra classifier. Extensive experiments well validate the above theoretic analysis.

10.
IEEE Trans Cybern ; PP2022 Sep 30.
Article in English | MEDLINE | ID: mdl-36179009

ABSTRACT

Markerless vision-based teleoperation that leverages innovations in computer vision offers the advantages of allowing natural and noninvasive finger motions for multifingered robot hands. However, current pose estimation methods still face inaccuracy issues due to the self-occlusion of the fingers. Herein, we develop a novel vision-based hand-arm teleoperation system that captures the human hands from the best viewpoint and at a suitable distance. This teleoperation system consists of an end-to-end hand pose regression network and a controlled active vision system. The end-to-end pose regression network (Transteleop), combined with an auxiliary reconstruction loss function, captures the human hand through a low-cost depth camera and predicts joint commands of the robot based on the image-to-image translation method. To obtain the optimal observation of the human hand, an active vision system is implemented by a robot arm at the local site that ensures the high accuracy of the proposed neural network. Human arm motions are simultaneously mapped to the slave robot arm under relative control. Quantitative network evaluation and a variety of complex manipulation tasks, for example, tower building, pouring, and multitable cup stacking, demonstrate the practicality and stability of the proposed teleoperation system.

12.
Front Immunol ; 13: 804034, 2022.
Article in English | MEDLINE | ID: mdl-35250976

ABSTRACT

OBJECTIVE: Interstitial lung diseases (ILDs) secondary to anti-synthetase syndrome (ASS) greatly influence the prognoses of patients with ASS. Here we aimed to investigate the peripheral immune responses to understand the pathogenesis of this condition. METHODS: We performed single-cell RNA sequencing (scRNA-seq) of peripheral blood mononuclear cells (PBMCs) from 5 patients with ASS-ILD and 3 healthy donors (HDs). Flow cytometry of PBMCs was performed to replenish the results of scRNA-seq. RESULTS: We used scRNA-seq to depict a high-resolution visualization of cellular landscape in PBMCs from patients with ASS-ILD. Patients showed upregulated interferon responses among NK cells, monocytes, T cells, and B cells. And the ratio of effector memory CD8 T cells to naïve CD8 T cells was significantly higher in patients than that in HDs. Additionally, Th1, Th2, and Th17 cell differentiation signaling pathways were enriched in T cells. Flow cytometry analyses showed increased proportions of Th17 cells and Th2 cells, and decreased proportion of Th1 cells in patients with ASS-ILD when compared with HDs, evaluated by the expression patterns of chemokine receptors. CONCLUSIONS: The scRNA-seq data analyses reveal that ASS-ILD is characterized by upregulated interferon responses, altered CD8 T cell homeostasis, and involvement of differentiation signaling pathways of CD4 T cells. The flow cytometry analyses show that the proportions of Th17 cells and Th2 cells are increased and the proportion of Th1 cells is decreased in patients with ASS-ILD. These findings may provide foundations of novel therapeutic targets for patients with this condition.


Subject(s)
Lung Diseases, Interstitial , Transcriptome , Humans , Immunity , Interferons/metabolism , Leukocytes, Mononuclear/metabolism , Ligases/metabolism
13.
IEEE Trans Pattern Anal Mach Intell ; 44(2): 1002-1019, 2022 02.
Article in English | MEDLINE | ID: mdl-32780696

ABSTRACT

Multi-task learning (MTL) refers to the paradigm of learning multiple related tasks together. In contrast, in single-task learning (STL) each individual task is learned independently. MTL often leads to better trained models because they can leverage the commonalities among related tasks. However, because MTL algorithms can "leak" information from different models across different tasks, MTL poses a potential security risk. Specifically, an adversary may participate in the MTL process through one task and thereby acquire the model information for another task. The previously proposed privacy-preserving MTL methods protect data instances rather than models, and some of them may underperform in comparison with STL methods. In this paper, we propose a privacy-preserving MTL framework to prevent information from each model leaking to other models based on a perturbation of the covariance matrix of the model matrix. We study two popular MTL approaches for instantiation, namely, learning the low-rank and group-sparse patterns of the model matrix. Our algorithms can be guaranteed not to underperform compared with STL methods. We build our methods based upon tools for differential privacy, and privacy guarantees, utility bounds are provided, and heterogeneous privacy budgets are considered. The experiments demonstrate that our algorithms outperform the baseline methods constructed by existing privacy-preserving MTL methods on the proposed model-protection problem.


Subject(s)
Algorithms , Privacy , Learning
14.
IEEE Trans Pattern Anal Mach Intell ; 44(8): 4404-4418, 2022 08.
Article in English | MEDLINE | ID: mdl-33625977

ABSTRACT

Remarkable gains in deep learning usually benefit from large-scale supervised data. Ensuring the intra-class modality diversity in training set is critical for generalization capability of cutting-edge deep models, but it burdens human with heavy manual labor on data collection and annotation. In addition, some rare or unexpected modalities are new for the current model, causing reduced performance under such emerging modalities. Inspired by the achievements in speech recognition, psychology and behavioristics, we present a practical solution, self-reinforcing unsupervised matching (SUM), to annotate the images with 2D structure-preserving property in an emerging modality by cross-modality matching. Specifically, we propose a dynamic programming algorithm, dynamic position warping (DPW), to reveal the underlying element correspondence relationship between two matrix-form data in an order-preserving fashion, and devise a local feature adapter (LoFA) to allow for cross-modality similarity measurement. On these bases, we develop a two-tier self-reinforcing learning mechanism on both feature level and image level to optimize the LoFA. The proposed SUM framework requires no any supervision in emerging modality and only one template in seen modality, providing a promising route towards incremental learning and continual learning. Extensive experimental evaluation on two proposed challenging one-template visual matching tasks demonstrate its efficiency and superiority.


Subject(s)
Algorithms , Humans
15.
IEEE Trans Pattern Anal Mach Intell ; 44(10): 7167-7174, 2022 10.
Article in English | MEDLINE | ID: mdl-34161238

ABSTRACT

This paper serves as a survey of recent advances in large margin training and its theoretical foundations, mostly for (nonlinear) deep neural networks (DNNs) that are probably the most prominent machine learning models for large-scale data in the community over the past decade. We generalize the formulation of classification margins from classical research to latest DNNs, summarize theoretical connections between the margin, network generalization, and robustness, and introduce recent efforts in enlarging the margins for DNNs comprehensively. Since the viewpoint of different methods is discrepant, we categorize them into groups for ease of comparison and discussion in the paper. Hopefully, our discussions and overview inspire new research work in the community that aim to improve the performance of DNNs, and we also point to directions where the large margin principle can be verified to provide theoretical evidence why certain regularizations for DNNs function well in practice. We managed to shorten the paper such that the crucial spirit of large margin learning and related methods are better emphasized.


Subject(s)
Algorithms , Neural Networks, Computer , Machine Learning
16.
Comput Intell Neurosci ; 2021: 6639865, 2021.
Article in English | MEDLINE | ID: mdl-33628214

ABSTRACT

This study investigated the influence of competitive state on cerebral cortex activity of professional shooters with 10 m air rifle before shooting. Generally, professional athletes have higher neural efficiency compared with ordinary people. We recruited 11 national shooters to complete 60 shots under both noncompetitive and competitive shooting conditions, and simultaneously collected their electroencephalogram (EEG) and electrocardiogram (ECG) information. Theta, alpha, and beta power were computed in the last three seconds preceding each shot from average-reference 29-channel EEG, while EEG characteristics under two conditions were analyzed. The results showed a significant linear correlation between shooting accuracy and EEG power of anterior frontal, central, temporal, and occipital regions in beta and theta bands. In addition, the theta power in occipital regions, alpha power in frontal-central and left occipital regions, and beta power in frontal and mid-occipital regions were higher than those in noncompetitive state. However, heart rate (HR) and shooting accuracy did not change significantly under the two conditions. These findings reveal the changes of cortical activity underlying competition shooting as well as providing further understanding of the neural mechanisms of the shooting process and lay a foundation for the subsequent neuromodulation research.


Subject(s)
Electroencephalography , Firearms , Athletes , Cerebral Cortex , Heart Rate , Humans
17.
IEEE Trans Image Process ; 30: 2669-2681, 2021.
Article in English | MEDLINE | ID: mdl-33476270

ABSTRACT

Convolutional neural networks have been proven effective in a variety of image restoration tasks. Most state-of-the-art solutions, however, are trained using images with a single particular degradation level, and their performance deteriorates drastically when applied to other degradation settings. In this paper, we propose deep likelihood network (DL-Net), aiming at generalizing off-the-shelf image restoration networks to succeed over a spectrum of degradation levels. We slightly modify an off-the-shelf network by appending a simple recursive module, which is derived from a fidelity term, for disentangling the computation for multiple degradation levels. Extensive experimental results on image inpainting, interpolation, and super-resolution show the effectiveness of our DL-Net.

18.
IEEE Trans Neural Netw Learn Syst ; 32(4): 1433-1447, 2021 Apr.
Article in English | MEDLINE | ID: mdl-32310797

ABSTRACT

Few-shot learning (FSL) focuses on distilling transferrable knowledge from existing experience to cope with novel concepts for which the labeled data are scarce. A typical assumption in FSL is that the training examples of novel classes are all clean with no outlier interference. In many realistic applications where examples are provided by users, however, data are potentially noisy or unreadable. In this context, we introduce a novel research topic, robust FSL (RFSL), where we aim to address two types of outliers within user-provided data: the representation outlier (RO) and the label outlier (LO). Moreover, we introduce a metric for estimating robustness and use it to investigate the performance of several advanced methods to FSL when faced with user-provided outliers. In addition, we propose robust attentive profile networks (RapNets) to achieve outlier suppression. The results of a comprehensive evaluation of benchmark data sets demonstrate the shortcomings of current FSL methods and the superiority of the proposed RapNets when dealing with RFSL problems, establishing a benchmark for follow-up studies.

19.
IEEE Trans Pattern Anal Mach Intell ; 43(4): 1129-1139, 2021 04.
Article in English | MEDLINE | ID: mdl-31634825

ABSTRACT

The tremendous recent success of deep neural networks (DNNs) has sparked a surge of interest in understanding their predictive ability. Unlike the human visual system which is able to generalize robustly and learn with little supervision, DNNs normally require a massive amount of data to learn new concepts. In addition, research works also show that DNNs are vulnerable to adversarial examples-maliciously generated images which seem perceptually similar to the natural ones but are actually formed to fool learning models, which means the models have problem generalizing to unseen data with certain type of distortions. In this paper, we analyze the generalization ability of DNNs comprehensively and attempt to improve it from a geometric point of view. We propose adversarial margin maximization (AMM), a learning-based regularization which exploits an adversarial perturbation as a proxy. It encourages a large margin in the input space, just like the support vector machines. With a differentiable formulation of the perturbation, we train the regularized DNNs simply through back-propagation in an end-to-end manner. Experimental results on various datasets (including MNIST, CIFAR-10/100, SVHN and ImageNet) and different DNN architectures demonstrate the superiority of our method over previous state-of-the-arts. Code and models for reproducing our results will be made publicly available.

20.
IEEE Trans Pattern Anal Mach Intell ; 43(12): 4469-4476, 2021 12.
Article in English | MEDLINE | ID: mdl-32750801

ABSTRACT

This paper analyzes regularization terms proposed recently for improving the adversarial robustness of deep neural networks (DNNs), from a theoretical point of view. Specifically, we study possible connections between several effective methods, including input-gradient regularization, Jacobian regularization, curvature regularization, and a cross-Lipschitz functional. We investigate them on DNNs with general rectified linear activations, which constitute one of the most prevalent families of models for image classification and a host of other machine learning applications. We shed light on essential ingredients of these regularizations and re-interpret their functionality. Through the lens of our study, more principled and efficient regularizations can possibly be invented in the near future.


Subject(s)
Algorithms , Neural Networks, Computer , Machine Learning
SELECTION OF CITATIONS
SEARCH DETAIL
...