Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 254
Filter
1.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-38801703

ABSTRACT

Micro ribonucleic acids (miRNAs) play a pivotal role in governing the human transcriptome in various biological phenomena. Hence, the accumulation of miRNA expression dysregulation frequently assumes a noteworthy role in the initiation and progression of complex diseases. However, accurate identification of dysregulated miRNAs still faces challenges at the current stage. Several bioinformatics tools have recently emerged for forecasting the associations between miRNAs and diseases. Nonetheless, the existing reference tools mainly identify the miRNA-disease associations in a general state and fall short of pinpointing dysregulated miRNAs within a specific disease state. Additionally, no studies adequately consider miRNA-miRNA interactions (MMIs) when analyzing the miRNA-disease associations. Here, we introduced a systematic approach, called IDMIR, which enabled the identification of expression dysregulated miRNAs through an MMI network under the gene expression context, where the network's architecture was designed to implicitly connect miRNAs based on their shared biological functions within a particular disease context. The advantage of IDMIR is that it uses gene expression data for the identification of dysregulated miRNAs by analyzing variations in MMIs. We illustrated the excellent predictive power for dysregulated miRNAs of the IDMIR approach through data analysis on breast cancer and bladder urothelial cancer. IDMIR could surpass several existing miRNA-disease association prediction approaches through comparison. We believe the approach complements the deficiencies in predicting miRNA-disease association and may provide new insights and possibilities for diagnosing and treating diseases. The IDMIR approach is now available as a free R package on CRAN (https://CRAN.R-project.org/package=IDMIR).


Subject(s)
Computational Biology , Gene Regulatory Networks , MicroRNAs , Urinary Bladder Neoplasms , Humans , MicroRNAs/genetics , MicroRNAs/metabolism , Computational Biology/methods , Urinary Bladder Neoplasms/genetics , Urinary Bladder Neoplasms/metabolism , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Gene Expression Profiling , Female , Gene Expression Regulation, Neoplastic
3.
IEEE Trans Med Imaging ; PP2024 Apr 01.
Article in English | MEDLINE | ID: mdl-38557623

ABSTRACT

Deep reinforcement learning (DRL) has demonstrated impressive performance in medical image segmentation, particularly for low-contrast and small medical objects. However, current DRL-based segmentation methods face limitations due to the optimization of error propagation in two separate stages and the need for a significant amount of labeled data. In this paper, we propose a novel deep generative adversarial reinforcement learning (DGARL) approach that, for the first time, enables end-to-end semi-supervised medical image segmentation in the DRL domain. DGARL ingeniously establishes a pipeline that integrates DRL and generative adversarial networks (GANs) to optimize both detection and segmentation tasks holistically while mutually enhancing each other. Specifically, DGARL introduces two innovative components to facilitate this integration in semi-supervised settings. First, a task-joint GAN with two discriminators links the detection results to the GAN's segmentation performance evaluation, allowing simultaneous joint evaluation and feedback. This ensures that DRL and GAN can be directly optimized based on each other's results. Second, a bidirectional exploration DRL integrates backward exploration and forward exploration to ensure the DRL agent explores the correct direction when forward exploration is disabled due to lack of explicit rewards. This mitigates the issue of unlabeled data being unable to provide rewards and rendering DRL unexplorable. Comprehensive experiments on three generalization datasets, comprising a total of 640 patients, demonstrate that our novel DGARL achieves 85.02% Dice and improves at least 1.91% for brain tumors, achieves 73.18% Dice and improves at least 4.28% for liver tumors, and achieves 70.85% Dice and improves at least 2.73% for pancreas compared to the ten most recent advanced methods, our results attest to the superiority of DGARL. Code is available at GitHub.

4.
Nat Commun ; 15(1): 3403, 2024 Apr 22.
Article in English | MEDLINE | ID: mdl-38649683

ABSTRACT

The corpus callosum, historically considered primarily for homotopic connections, supports many heterotopic connections, indicating complex interhemispheric connectivity. Understanding this complexity is crucial yet challenging due to diverse cell-specific wiring patterns. Here, we utilized public AAV bulk tracing and single-neuron tracing data to delineate the anatomical connection patterns of mouse brains and conducted wide-field calcium imaging to assess functional connectivity across various brain states in male mice. The single-neuron data uncovered complex and dense interconnected patterns, particularly for interhemispheric-heterotopic connections. We proposed a metric "heterogeneity" to quantify the complexity of the connection patterns. Computational modeling of these patterns suggested that the heterogeneity of upstream projections impacted downstream homotopic functional connectivity. Furthermore, higher heterogeneity observed in interhemispheric-heterotopic projections would cause lower strength but higher stability in functional connectivity than their intrahemispheric counterparts. These findings were corroborated by our wide-field functional imaging data, underscoring the important role of heterotopic-projection heterogeneity in interhemispheric communication.


Subject(s)
Corpus Callosum , Neurons , Animals , Corpus Callosum/physiology , Male , Mice , Neurons/physiology , Neural Pathways/physiology , Connectome , Brain/physiology , Computer Simulation , Models, Neurological , Nerve Net/physiology , Calcium/metabolism
5.
Elife ; 122024 Apr 18.
Article in English | MEDLINE | ID: mdl-38635322

ABSTRACT

Cortical folding is an important feature of primate brains that plays a crucial role in various cognitive and behavioral processes. Extensive research has revealed both similarities and differences in folding morphology and brain function among primates including macaque and human. The folding morphology is the basis of brain function, making cross-species studies on folding morphology important for understanding brain function and species evolution. However, prior studies on cross-species folding morphology mainly focused on partial regions of the cortex instead of the entire brain. Previously, our research defined a whole-brain landmark based on folding morphology: the gyral peak. It was found to exist stably across individuals and ages in both human and macaque brains. Shared and unique gyral peaks in human and macaque are identified in this study, and their similarities and differences in spatial distribution, anatomical morphology, and functional connectivity were also dicussed.


Subject(s)
Brain , Macaca , Animals , Humans
6.
Neural Netw ; 174: 106219, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38442489

ABSTRACT

Extrapolating future events based on historical information in temporal knowledge graphs (TKGs) holds significant research value and practical applications. In this field, the methods currently utilized can be classified as either embedding-based or logical rule-based. Embedding-based methods depend on learned entity and relation embeddings for prediction, but they suffer from the lack of interpretability due to the opaque reasoning process. On the other hand, logical rule-based methods face scalability challenges as they heavily rely on predefined logical rules. To overcome these limitations, we propose a hybrid model that combines embedding-based and logical rule-based methods to capture deep causal logic. Our model, called the Inductive Reasoning Model based on Interpretable Logical Rule (ILR-IR), aims to provide interpretable insights while effectively predicting future events in TKGs. ILR-IR delves into historical information, extracting valuable insights from logical rules embedded within relations and interaction preferences between entities. By considering both logical rules and interaction preferences, ILR-IR offers a comprehensive perspective for predicting future events. In addition, we propose the incorporation of a one-class augmented matching loss during optimization, which serves to enhance performance of the model during training. We evaluate ILR-IR on multiple datasets, including ICEWS14, ICEWS0515, and ICEWS18. Experimental results demonstrate that ILR-IR outperforms state-of-the-art baselines, showcasing its superior performance in TKG extrapolation reasoning. Moreover, ILR-IR demonstrates remarkable generalization capabilities, even when applied to related datasets that share a common relation vocabulary. This suggests that our proposed model exhibits robust zero-shot reasoning abilities. For interested parties, we have made our code publicly available at https://github.com/mxadorable/ILR-IR.


Subject(s)
Pattern Recognition, Automated , Problem Solving , Learning , Generalization, Psychological , Knowledge
7.
Waste Manag ; 179: 120-129, 2024 Apr 30.
Article in English | MEDLINE | ID: mdl-38471250

ABSTRACT

Traditional cathode recycling methods have become outdated amid growing concerns for high-value output and environmental friendliness in spent Li-ion battery (LIB) recycling. Our study presents a closed-loop approach that involves selective sulfurization roasting, water leaching, and regeneration, efficiently transforming spent ternary Li batteries (i.e., NCM) into high-performance cathode materials. By combining experimental investigations with density functional theory (DFT) calculations, we elucidate the mechanisms within the NCM-C-S roasting system, providing a theoretical foundation for selective sulfidation. Utilizing in situ X-ray diffraction techniques and a series of consecutive experiments, the study meticulously tracks the evolution of regenerating cathode materials that use transition metal sulfides as their primary raw materials. The Li-rich regenerated NCM exhibits exceptional electrochemical performance, including long-term cycling, high-rate capabilities, reversibility, and stability. The closed-loop approach highlights the sustainability and environmental friendliness of this recycling process, with potential applications in other cathode materials, such as LiCoO2 and LiMn2O4. Compared with traditional methods, this short process approach avoids the complexity of leaching, solvent extraction, and reverse extraction, significantly increasing metal utilization and Li recovery rates while reducing pollution and resource waste.


Subject(s)
Lithium , Metals , Electric Power Supplies , Electrodes , Recycling , Ions
8.
Med Image Anal ; 94: 103136, 2024 May.
Article in English | MEDLINE | ID: mdl-38489895

ABSTRACT

Decoding brain states under different cognitive tasks from functional magnetic resonance imaging (fMRI) data has attracted great attention in the neuroimaging filed. However, the well-known temporal dependency in fMRI sequences has not been fully exploited in existing studies, due to the limited temporal-modeling capacity of the backbone machine learning algorithms and rigid training sample organization strategies upon which the brain decoding methods are built. To address these limitations, we propose a novel method for fine-grain brain state decoding, namely, group deep bidirectional recurrent neural network (Group-DBRNN) model. We first propose a training sample organization strategy that consists of a group-task sample generation module and a multiple-scale random fragment strategy (MRFS) module to collect training samples that contain rich task-relevant brain activity contrast (i.e., the comparison of neural activity patterns between different tasks) and maintain the temporal dependency. We then develop a novel decoding model by replacing the unidirectional RNNs that are widely used in existing brain state decoding studies with bidirectional stacked RNNs to better capture the temporal dependency, and by introducing a multi-task interaction layer (MTIL) module to effectively model the task-relevant brain activity contrast. Our experimental results on the Human Connectome Project task fMRI dataset (7 tasks consisting of 23 task sub-type states) show that the proposed model achieves an average decoding accuracy of 94.7% over the 23 fine-grain sub-type states. Meanwhile, our extensive interpretations of the intermediate features learned in the proposed model via visualizations and quantitative assessments of their discriminability and inter-subject alignment evidence that the proposed model can effectively capture the temporal dependency and task-relevant contrast.


Subject(s)
Brain , Connectome , Humans , Brain/diagnostic imaging , Neural Networks, Computer , Connectome/methods , Algorithms , Magnetic Resonance Imaging/methods
9.
Small ; : e2310978, 2024 Mar 21.
Article in English | MEDLINE | ID: mdl-38513253

ABSTRACT

Rechargeable sodium chloride (Na-Cl2) batteries have emerged as promising alternatives for next-generation energy storage due to their superior energy density and sodium abundance. However, their practical applications are hindered by the sluggish chlorine cathode kinetics related to the aggregation of NaCl and its difficult transformation into Cl2. Herein, the study, for the first time from the perspective of electrode level in Na-Cl2 batteries, proposes a free-standing carbon cathode host with customized vertical channels to facilitate the SOCl2 transport and regulate the NaCl deposition. Accordingly, electrode kinetics are significantly enhanced, and the deposited NaCl is distributed evenly across the whole electrode, avoiding the blockage of pores in the carbon host, and facilitating its oxidation to Cl2. With this low-polarization cathode, the Na-Cl2 batteries can deliver a practically high areal capacity approaching 4 mAh cm-2 and a long cycle life of over 170 cycles. This work demonstrates the significance of pore engineering in electrodes for mediating chlorine conversion kinetics in rechargeable alkali-metal-Cl2 batteries.

10.
IEEE Trans Image Process ; 33: 2851-2866, 2024.
Article in English | MEDLINE | ID: mdl-38358877

ABSTRACT

Gaze estimation is an important fundamental task in computer vision and medical research. Existing works have explored various effective paradigms and modules for precisely predicting eye gazes. However, the uncertainty for gaze estimation, e.g., input uncertainty and annotation uncertainty, have been neglected in previous research. Existing models use a deterministic function to estimate the gaze, which cannot reflect the actual situation in gaze estimation. To address this issue, we propose a probabilistic framework for gaze estimation by modeling the input uncertainty and annotation uncertainty. We first utilize probabilistic embeddings to model the input uncertainty, representing the input image as a Gaussian distribution in the embedding space. Based on the input uncertainty modeling, we give an instance-wise uncertainty estimation to measure the confidence of prediction results, which is critical in practical applications. Then, we propose a new label distribution learning method, probabilistic annotations, to model the annotation uncertainty, representing the raw hard labels as Gaussian distributions. In addition, we develop an Embedding Distribution Smoothing (EDS) module and a hard example mining method to improve the consistency between embedding distribution and label distribution. We conduct extensive experiments, demonstrating that the proposed approach achieves significant improvements over baseline and state-of-the-art methods on two widely used benchmark datasets, GazeCapture and MPIIFaceGaze, as well as our collected dataset using mobile devices.

11.
J Environ Manage ; 353: 120148, 2024 Feb 27.
Article in English | MEDLINE | ID: mdl-38306856

ABSTRACT

Arsenic-bearing neutralization (ABN) sludge is a classical hazardous waste commonly found in nonferrous metallurgy. However, the current storage of these hazardous wastes not only has to pay costly hazardous waste taxes but also poses significant risks to both the environment and human health. To address these issues and achieve the comprehensive utilization and minimization of ABN sludge, this study proposes a new combined process. The process involves selective reduction roasting, leaching, and carbonation, through which, the arsenate and gypsum in the ABN sludge were recovered in the form of As(s), high-purity CaCO3, and H2S. The selective reduction behaviors of arsenate and gypsum were investigated through thermodynamic analysis and roasting experiments. The results indicated that the 95.35 % arsenate and 96.55 % gypsum in the sludge were selectively reduced to As4(g) and CaS at 950 °C by carbothermic reduction. The As4(g) was condensed to As(s) and enriched in the dust (As, 96.78 wt %). In the leaching process, H2S gas was adopted to promote the leaching of CaS, and resulted in 97.41 % of CaS in the roasted product was selectively leached in the form of Ca(HS)2, leading to a 74.11 % reduction in the weight of the ABN sludge. Then, the Ca(HS)2 was subjected to capture CO2 for the separation of Ca2+ and S2-. The result depicted that 99.69 % of Ca2+ and 99.12 % of S2- were separated as high-purity (99.12 wt %) CaCO3 and H2S (24.89 vol %) by controlling the terminal carbonation pH to below 6.55. The generated H2S can be economically converted to sulfur by the Clause process. The whole process realized the comprehensive resource recovery and the minimization of the sludge, which provides an alternative solution for the clean treatment of hazardous ABN waste.


Subject(s)
Arsenic , Humans , Arsenic/analysis , Sewage , Arsenates , Calcium Sulfate , Hazardous Waste
12.
J Neuroinflammation ; 21(1): 10, 2024 Jan 04.
Article in English | MEDLINE | ID: mdl-38178152

ABSTRACT

Myasthenia gravis is an autoimmune disease characterized by pathogenic antibodies that target structures of the neuromuscular junction. However, some patients also experience autonomic dysfunction, anxiety, depression, and other neurological symptoms, suggesting the complex nature of the neurological manifestations. With the aim of explaining the symptoms related to the central nervous system, we utilized a rat model to investigate the impact of dopamine signaling in the central nervous and peripheral circulation. We adopted several screening methods, including western blot, quantitative PCR, mass spectrum technique, immunohistochemistry, immunofluorescence staining, and flow cytometry. In this study, we observed increased and activated dopamine signaling in both the central nervous system and peripheral circulation of myasthenia gravis rats. Furthermore, changes in the expression of two key molecules, Claudin5 and CD31, in endothelial cells of the blood-brain barrier were also examined in these rats. We also confirmed that dopamine incubation reduced the expression of ZO1, Claudin5, and CD31 in endothelial cells by inhibiting the Wnt/ß-catenin signaling pathway. Overall, this study provides novel evidence suggesting that pathologically elevated dopamine in both the central nervous and peripheral circulation of myasthenia gravis rats impair brain-blood barrier integrity by inhibiting junction protein expression in brain microvascular endothelial cells through the Wnt/ß-catenin pathway.


Subject(s)
Dopamine , Myasthenia Gravis , Humans , Rats , Animals , Dopamine/metabolism , Endothelial Cells/metabolism , Brain , Blood-Brain Barrier/metabolism , Wnt Signaling Pathway/physiology , Myasthenia Gravis/metabolism
13.
Brain Struct Funct ; 229(2): 431-442, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38193918

ABSTRACT

Disentangling functional difference between cortical folding patterns of gyri and sulci provides novel insights into the relationship between brain structure and function. Previous studies using resting-state functional magnetic resonance imaging (rsfMRI) have revealed that sulcal signals exhibit stronger high-frequency but weaker low-frequency components compared to gyral ones, suggesting that gyri may serve as functional integration centers while sulci are segregated local processing units. In this study, we utilize naturalistic paradigm fMRI (nfMRI) to explore the functional difference between gyri and sulci as it has proven to record stronger functional integrations compared to rsfMRI. We adopt a convolutional neural network (CNN) to classify gyral and sulcal fMRI signals in the whole brain (the global model) and within functional brain networks (the local models). The frequency-specific difference between gyri and sulci is then inferred from the power spectral density (PSD) profiles of the learned filters in the CNN model. Our experimental results show that nfMRI shows higher gyral-sulcal PSD contrast effect sizes in the global model compared to rsfMRI. In the local models, the effect sizes are either increased or decreased depending on frequency bands and functional complexity of the FBNs. This study highlights the advantages of nfMRI in depicting the functional difference between gyri and sulci, and provides novel insights into unraveling the relationship between brain structure and function.


Subject(s)
Cerebral Cortex , Magnetic Resonance Imaging , Magnetic Resonance Imaging/methods , Brain/diagnostic imaging , Neural Networks, Computer , Head
14.
Neural Netw ; 171: 159-170, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38091760

ABSTRACT

Nuclei detection is one of the most fundamental and challenging problems in histopathological image analysis, which can localize nuclei to provide effective computer-aided cancer diagnosis, treatment decision, and prognosis. The fully-supervised nuclei detector requires a large number of nuclei annotations on high-resolution digital images, which is time-consuming and needs human annotators with professional knowledge. In recent years, weakly-supervised learning has attracted significant attention in reducing the labeling burden. However, detecting dense nuclei of complex crowded distribution and diverse appearances remains a challenge. To solve this problem, we propose a novel point-supervised dense nuclei detection framework that introduces position-based anchor optimization to complete morphology-based pseudo-label supervision. Specifically, we first generate cellular-level pseudo labels (CPL) for the detection head via a morphology-based mechanism, which can help to build a baseline point-supervised detection network. Then, considering the crowded distribution of the dense nuclei, we propose a mechanism called Position-based Anchor-quality Estimation (PAE), which utilizes the positional deviation between an anchor and its corresponding point label to suppress low-quality detections far from each nucleus. Finally, to better handle the diverse appearances of nuclei, an Adaptive Anchor Selector (AAS) operation is proposed to automatically select positive and negative anchors according to morphological and positional statistical characteristics of nuclei. We conduct comprehensive experiments on two widely used benchmarks, MO and Lizard, using ResNet50 and PVTv2 as backbones. The results demonstrate that the proposed approach has superior capacity compared with other state-of-the-art methods. In particularly, in dense nuclei scenarios, our method can achieve 95.1% performance of the fully-supervised approach. The code is available at https://github.com/NucleiDet/DenseNucleiDet.


Subject(s)
Benchmarking , Diagnosis, Computer-Assisted , Humans , Image Processing, Computer-Assisted , Knowledge , Supervised Machine Learning
15.
IEEE Trans Med Imaging ; 43(3): 928-939, 2024 Mar.
Article in English | MEDLINE | ID: mdl-37847615

ABSTRACT

Neurodegenerative disorders usually happen stage-by-stage rather than overnight. Thus, cross-sectional brain imaging genetic methods could be insufficient to identify genetic risk factors. Repeatedly collecting imaging data over time appears to solve the problem. But most existing imaging genetic methods only use longitudinal imaging phenotypes straightforwardly, ignoring the disease progression trajectory which might be a more stable disease signature. In this paper, we propose a novel sparse multi-task mixed-effects longitudinal imaging genetic method (SMMLING). In our model, disease progression fitting and genetic risk factors identification are conducted jointly. Specifically, SMMLING models the disease progression using longitudinal imaging phenotypes, and then associates fitted disease progression with genetic variations. The baseline status and changing rate, i.e., the intercept and slope, of the progression trajectory thus shoulder the responsibility to discover loci of interest, which would have superior and stable performance. To facilitate the interpretation and stability, we employ l2,1 -norm and the fused group lasso (FGL) penalty to identify loci at both the individual level and group level. SMMLING can be solved by an efficient optimization algorithm which is guaranteed to converge to the global optimum. We evaluate SMMLING on synthetic data and real longitudinal neuroimaging genetic data. Both results show that, compared to existing longitudinal methods, SMMLING can not only decrease the modeling error but also identify more accurate and relevant genetic factors. Most risk loci reported by SMMLING are missed by comparison methods, implicating its superiority in genetic risk factors identification. Consequently, SMMLING could be a promising computational method for longitudinal imaging genetics.


Subject(s)
Alzheimer Disease , Humans , Cross-Sectional Studies , Alzheimer Disease/genetics , Neuroimaging/methods , Brain/diagnostic imaging , Phenotype , Algorithms , Disease Progression , Risk Factors , Magnetic Resonance Imaging/methods
16.
Adv Mater ; 36(5): e2307091, 2024 Feb.
Article in English | MEDLINE | ID: mdl-37857473

ABSTRACT

The ever-growing demand for resources sustainability has promoted the recycle of spent lithium-ion batteries to a strategic position. Direct recycle outperforms either hydrometallurgical or pyrometallurgical approaches due to the high added value and facile treatment processes. However, the traditional direct recycling technologies are only applicable for Ni-poor/middle cathodes. Herein, spent Ni-rich LiNi0.8 Co0.1 Mn0.1 O2 (S-NCM) to performance-enhanced single-crystalline cathode materials is directly recycled using a simple but effective LiOH-NaCl molten salt. The evolution process of the Li-supplement and grain-recrystallization during regeneration is systematically investigated, and the successful recovery of the highly degraded microstructure is comprehensively proven, including significant elimination of Ni2+ and O vacancies. Beneficial from the favorable reconstructed single-crystalline particles, the regenerated NCM (R-NCM) represents remarkably enhanced structural stability, electrochemical activity, O2 and cracks suppression during charge/discharge, thus achieving the excellent performances in long-term cycling and high-rate tests. As a result, R-NCM maintains the 86.5% reversible capacity at 1 C after 200 cycles. Instructively, the present molten salt can be successfully applied for recycling spent NCMs with various Li and Ni compositions (e.g., LiNi0.5 Co0.2 Mn0.3 O2 ).

17.
bioRxiv ; 2024 Feb 05.
Article in English | MEDLINE | ID: mdl-37546923

ABSTRACT

Cortical folding is an important feature of primate brains that plays a crucial role in various cognitive and behavioral processes. Extensive research has revealed both similarities and differences in folding morphology and brain function among primates including macaque and human. The folding morphology is the basis of brain function, making cross-species studies on folding morphology important for understanding brain function and species evolution. However, prior studies on cross-species folding morphology mainly focused on partial regions of the cortex instead of the entire brain. Previously, we defined a whole-brain landmark based on folding morphology: the gyral peak. It was found to exist stably across individuals and ages in both human and macaque brains. In this study, we identified shared and unique gyral peaks in human and macaque, and investigated the similarities and differences in the spatial distribution, anatomical morphology, and functional connectivity of them.

18.
Article in English | MEDLINE | ID: mdl-38117621

ABSTRACT

Weakly supervised semantic segmentation (WSSS) is a challenging yet important research field in vision community. In WSSS, the key problem is to generate high-quality pseudo segmentation masks (PSMs). Existing approaches mainly depend on the discriminative object part to generate PSMs, which would inevitably miss object parts or involve surrounding image background, as the learning process is unaware of the full object structure. In fact, both the discriminative object part and the full object structure are critical for deriving of high-quality PSMs. To fully explore these two information cues, we build a novel end-to-end learning framework, alternate self-dual teaching (ASDT), based on a dual-teacher single-student network architecture. The information interaction among different network branches is formulated in the form of knowledge distillation (KD). Unlike the conventional KD, the knowledge of the two teacher models would inevitably be noisy under weak supervision. Inspired by the Pulse Width (PW) modulation, we introduce a PW wave-like selection signal to alleviate the influence of the imperfect knowledge from either teacher model on the KD process. Comprehensive experiments on the PASCAL VOC 2012 and COCO-Stuff 10K demonstrate the effectiveness of the proposed ASDT framework, and new state-of-the-art results are achieved.

19.
IEEE Trans Image Process ; 32: 6168-6182, 2023.
Article in English | MEDLINE | ID: mdl-37938957

ABSTRACT

An essential need for accurate visual object tracking is to capture better correlations between the tracking target and the search region. However, the dominant Siamese-based trackers are limited to producing dense similarity maps at once via a cross-correlations operation, ignoring to remedy the contamination caused by erroneous or ambiguous matches. In this paper, we propose a novel tracker, termed neighborhood consensus constraint-based siamese tracker (NCSiam), which takes the idea of neighborhood consensus constraint to refine the produced correlation maps. The intuition behind our approach is that we can support the nearby erroneous or ambiguous matches by analyzing a larger context of the scene that contains a unique match. Specifically, we devise a 4D convolution-based multi-level similarity refinement (MLSR) strategy. Taking the primary similarity maps obtained from a cross-correlation as input, MLSR acquires reliable matches by analyzing neighborhood consensus patterns in 4D space, thus enhancing the discriminability between the tracking target and the distractors. Besides, traditional Siamese-based trackers directly perform classification and regression on similarity response maps which discard appearance or semantic information. Therefore, an appearance affinity decoder (AAD) is developed to take full advantage of the semantic information of the search region. To further improve performance, we design a task-specific disentanglement (TSD) module to decouple the learned representations into classification-specific and regression-specific embeddings. Extensive experiments are conducted on six challenging benchmarks, including GOT-10k, TrackingNet, LaSOT, UAV123, OTB2015, and VOT2020. The results demonstrate the effectiveness of our method. The code will be available at https://github.com/laybebe/NCSiam.

20.
IEEE Trans Biomed Eng ; 70(10): 2799-2808, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37695956

ABSTRACT

One-shot organ segmentation (OS2) aims at segmenting the desired organ regions from the input medical imaging data with only one pre-annotated example as the reference. By using the minimal annotation data to facilitate organ segmentation, OS2 receives great attention in the medical image analysis community due to its weak requirement on human annotation. In OS2, one core issue is to explore the mutual information between the support (reference slice) and the query (test slice). Existing methods rely heavily on the similarity between slices, and additional slice allocation mechanisms need to be designed to reduce the impact of the similarity between slices on the segmentation performance. To address this issue, we build a novel support-query interactive embedding (SQIE) module, which is equipped with the channel-wise co-attention, spatial-wise co-attention, and spatial bias transformation blocks to identify "what to look", "where to look", and "how to look" in the input test slice. By combining the three mechanisms, we can mine the interactive information of the intersection area and the disputed area between slices, and establish the feature connection between the target in slices with low similarity. We also propose a self-supervised contrastive learning framework, which transforms knowledge from the physical position to the embedding space to facilitate the self-supervised interactive embedding of the query and support slices. Comprehensive experiments on two large benchmarks demonstrate the superior capacity of the proposed approach when compared with the current alternatives and baseline models.

SELECTION OF CITATIONS
SEARCH DETAIL
...