Your browser doesn't support javascript.
loading
Montrer: 20 | 50 | 100
Résultats 1 - 14 de 14
Filtrer
1.
Article de Chinois | WPRIM | ID: wpr-1019896

RÉSUMÉ

Objective The objective of this study is to improve the accuracy of automatic identification in complex background herbal slice images.The goal is to achieve accurate recognition of herbal slice images in the presence of complex backgrounds.Methods The experiment was conducted on a collected and organized dataset of Tibetan herbal slice images.The RGB,HOG,and LBP features of the slices were analyzed.An improved HOG algorithm was used to fuse multiple features,and a deep learning network was utilized for image recognition.Results The proposed method of multi-feature fusion combined with deep learning achieved an identification accuracy of 91.68%on 3610 Tibetan herbal slice images with complex backgrounds.Furthermore,the average identification accuracy for 20 common traditional Chinese medicine slices,such as Chuan Beimu,Hawthorn,and Pinellia,reached 98.00%.This method outperformed existing methods for identifying herbal slices in complex backgrounds,indicating its feasibility and wide applicability for the identification of other traditional Chinese herbal medicines.Conclusion The fusion of multiple features effectively captures distinguishing characteristics of herbal slices in complex backgrounds.It exhibits high recognition rates for Tibetan herbal slices with complex and heavily occluded backgrounds,and can be successfully applied to the recognition of natural scene-based traditional Chinese herbal medicines and herbal slices.This approach shows promising prospects for practical applications.

2.
Article de Chinois | WPRIM | ID: wpr-1031682

RÉSUMÉ

@#Objective To propose a heart sound segmentation method based on multi-feature fusion network. Methods Data were obtained from the CinC/PhysioNet 2016 Challenge dataset (a total of 3 153 recordings from 764 patients, about 91.93% of whom were male, with an average age of 30.36 years). Firstly the features were extracted in time domain and time-frequency domain respectively, and reduced redundant features by feature dimensionality reduction. Then, we selected optimal features separately from the two feature spaces that performed best through feature selection. Next, the multi-feature fusion was completed through multi-scale dilated convolution, cooperative fusion, and channel attention mechanism. Finally, the fused features were fed into a bidirectional gated recurrent unit (BiGRU) network to heart sound segmentation results. Results The proposed method achieved precision, recall and F1 score of 96.70%, 96.99%, and 96.84% respectively. Conclusion The multi-feature fusion network proposed in this study has better heart sound segmentation performance, which can provide high-accuracy heart sound segmentation technology support for the design of automatic analysis of heart diseases based on heart sounds.

3.
Article de Chinois | WPRIM | ID: wpr-1026228

RÉSUMÉ

In response to the challenges of varied sizes and diverse shapes of colorectal polyps,especially with blurred boundaries that often complicates localization and smaller polyps being particularly prone to oversight,a colorectal polyp segmentation algorithm integrating Transformer and convolution is proposed.Transformer is employed to extract global features from images for ensuring the network's capability for global modeling and improving the localization capability for both main polyp regions and vague boundaries.Subsequently,convolution is introduced to augment the network's ability to process polyp details,refining boundary segmentation and enhancing the capture capability for small-sized polyps.Finally,a deep fusion of the features extracted by Transformer and convolution is carried out to realize feature complementarity.The experimental evaluation using CVC-ClinicDB and Kvasir-SEG datasets show that the algorithm has similarity coefficients of 95.4%and 93.2%,and mean intersection over union of 91.3%and 88.6%,respectively.Further tests on the generalization capability of the algorithm are conducted on CVC-ColonDB,CVC-T,and ETIS datasets,in which similarity coefficients of 81.3%,90.9%and 80.1%are obtained.The results indicate a notable improvement in the accuracy of polyp segmentation achieved by the proposed algorithm.

4.
Article de Chinois | WPRIM | ID: wpr-981562

RÉSUMÉ

The recurrent neural network architecture improves the processing ability of time-series data. However, issues such as exploding gradients and poor feature extraction limit its application in the automatic diagnosis of mild cognitive impairment (MCI). This paper proposed a research approach for building an MCI diagnostic model using a Bayesian-optimized bidirectional long short-term memory network (BO-BiLSTM) to address this problem. The diagnostic model was based on a Bayesian algorithm and combined prior distribution and posterior probability results to optimize the BO-BiLSTM network hyperparameters. It also used multiple feature quantities that fully reflected the cognitive state of the MCI brain, such as power spectral density, fuzzy entropy, and multifractal spectrum, as the input of the diagnostic model to achieve automatic MCI diagnosis. The results showed that the feature-fused Bayesian-optimized BiLSTM network model achieved an MCI diagnostic accuracy of 98.64% and effectively completed the diagnostic assessment of MCI. In conclusion, based on this optimization, the long short-term neural network model has achieved automatic diagnostic assessment of MCI, providing a new diagnostic model for intelligent diagnosis of MCI.


Sujet(s)
Humains , Théorème de Bayes , , Algorithmes , Encéphale , Dysfonctionnement cognitif/diagnostic
5.
Journal of Medical Biomechanics ; (6): E324-E330, 2023.
Article de Chinois | WPRIM | ID: wpr-987954

RÉSUMÉ

Objective Aiming at the problems of lacking initiative in upper limb rehabilitation training equipment, single training mode, and low active participation of patients, an upper limb continuous motion estimation algorithm model based on multi-modal information fusion was proposed, so to realize accurate estimation of elbow joint torque. Methods Firstly, the surface electromyography (sEMG) signal and posture signal of participants were collected at four angular velocities, and the time domain characteristics of the signal were extracted. The principal component analysis was adopted to multi-feature fusion. The back propagation neural network (BPNN) was optimized through the additional momentum and the adaptive learning rate method. The particle swarm optimization (PSO) algorithm was used to optimize the neural network and a continuous motion estimation model based on PSO-BPNN was constructed. Finally, the joint torque calculated by the second type of Lagrangian equation was used as the accurate value to train the model. The performance of the model was compared with the traditional BP neural network model. Results The root mean square error (RMSE) of the traditional BP neural network model was 558.9 N·m, and the R2 coefficient was 77.19%, Whereas the RMSE and the R2 coefficient of the optimized model were 113.6 mN·m and 99.12%, respectively.Thereby, the accuracy of torque estimation was improved apparently. Conclusions The method for continuous motion estimation of the elbow joint proposed in this study can estimate the motion intention accurately, and provide a practical scheme for the active control of upper exoskeleton rehabilitation robot.

6.
Article de Chinois | WPRIM | ID: wpr-1008915

RÉSUMÉ

Magnetic resonance imaging(MRI) can obtain multi-modal images with different contrast, which provides rich information for clinical diagnosis. However, some contrast images are not scanned or the quality of the acquired images cannot meet the diagnostic requirements due to the difficulty of patient's cooperation or the limitation of scanning conditions. Image synthesis techniques have become a method to compensate for such image deficiencies. In recent years, deep learning has been widely used in the field of MRI synthesis. In this paper, a synthesis network based on multi-modal fusion is proposed, which firstly uses a feature encoder to encode the features of multiple unimodal images separately, and then fuses the features of different modal images through a feature fusion module, and finally generates the target modal image. The similarity measure between the target image and the predicted image in the network is improved by introducing a dynamic weighted combined loss function based on the spatial domain and K-space domain. After experimental validation and quantitative comparison, the multi-modal fusion deep learning network proposed in this paper can effectively synthesize high-quality MRI fluid-attenuated inversion recovery (FLAIR) images. In summary, the method proposed in this paper can reduce MRI scanning time of the patient, as well as solve the clinical problem of missing FLAIR images or image quality that is difficult to meet diagnostic requirements.


Sujet(s)
Humains , Apprentissage profond , Imagerie par résonance magnétique/méthodes , Traitement d'image par ordinateur/méthodes
7.
Chinese Journal of Medical Physics ; (6): 1477-1485, 2023.
Article de Chinois | WPRIM | ID: wpr-1026167

RÉSUMÉ

In view of numerous subtle features in fundus disease images,small sample sizes,and difficulties in diagnosis,both deep learning and medical imaging technologies are used to develop a fundus disease diagnosis model that integrates multi-scale features and hybrid domain attention mechanism.Resnet50 network is taken as the baseline network,and it is modified in the study.The method uses parallel multi-branch architecture to extract the features of fundus diseases under different receptive fields for effectively improving the feature extraction ability and computational efficiency,and adopts hybrid domain attention mechanism to select information that is more critical to the current task for effectively enhancing the classification performance.The test on ODIR dataset shows that the proposed method has a diagnostic accuracy of 93.2%for different fundus diseases,which is 5.2%higher than the baseline network,demonstrating a good diagnostic performance.

8.
Journal of Biomedical Engineering ; (6): 1065-1073, 2022.
Article de Chinois | WPRIM | ID: wpr-970643

RÉSUMÉ

The effective classification of multi-task motor imagery electroencephalogram (EEG) is helpful to achieve accurate multi-dimensional human-computer interaction, and the high frequency domain specificity between subjects can improve the classification accuracy and robustness. Therefore, this paper proposed a multi-task EEG signal classification method based on adaptive time-frequency common spatial pattern (CSP) combined with convolutional neural network (CNN). The characteristics of subjects' personalized rhythm were extracted by adaptive spectrum awareness, and the spatial characteristics were calculated by using the one-versus-rest CSP, and then the composite time-domain characteristics were characterized to construct the spatial-temporal frequency multi-level fusion features. Finally, the CNN was used to perform high-precision and high-robust four-task classification. The algorithm in this paper was verified by the self-test dataset containing 10 subjects (33 ± 3 years old, inexperienced) and the dataset of the 4th 2018 Brain-Computer Interface Competition (BCI competition Ⅳ-2a). The average accuracy of the proposed algorithm for the four-task classification reached 93.96% and 84.04%, respectively. Compared with other advanced algorithms, the average classification accuracy of the proposed algorithm was significantly improved, and the accuracy range error between subjects was significantly reduced in the public dataset. The results show that the proposed algorithm has good performance in multi-task classification, and can effectively improve the classification accuracy and robustness.


Sujet(s)
Humains , Adulte , Imagination , , /méthodes , Électroencéphalographie/méthodes , Algorithmes , Interfaces cerveau-ordinateur , Traitement du signal assisté par ordinateur
9.
Article de Chinois | WPRIM | ID: wpr-888196

RÉSUMÉ

As the most common active brain-computer interaction paradigm, motor imagery brain-computer interface (MI-BCI) suffers from the bottleneck problems of small instruction set and low accuracy, and its information transmission rate (ITR) and practical application are severely limited. In this study, we designed 6-class imagination actions, collected electroencephalogram (EEG) signals from 19 subjects, and studied the effect of collaborative brain-computer interface (cBCI) collaboration strategy on MI-BCI classification performance, the effects of changes in different group sizes and fusion strategies on group multi-classification performance are compared. The results showed that the most suitable group size was 4 people, and the best fusion strategy was decision fusion. In this condition, the classification accuracy of the group reached 77%, which was higher than that of the feature fusion strategy under the same group size (77.31%


Sujet(s)
Humains , Encéphale , Interfaces cerveau-ordinateur , Électroencéphalographie , , Imagination
10.
Article de Chinois | WPRIM | ID: wpr-888224

RÉSUMÉ

Speech feature learning is the core and key of speech recognition method for mental illness. Deep feature learning can automatically extract speech features, but it is limited by the problem of small samples. Traditional feature extraction (original features) can avoid the impact of small samples, but it relies heavily on experience and is poorly adaptive. To solve this problem, this paper proposes a deep embedded hybrid feature sparse stack autoencoder manifold ensemble algorithm. Firstly, based on the prior knowledge, the psychotic speech features are extracted, and the original features are constructed. Secondly, the original features are embedded in the sparse stack autoencoder (deep network), and the output of the hidden layer is filtered to enhance the complementarity between the deep features and the original features. Third, the L1 regularization feature selection mechanism is designed to compress the dimensions of the mixed feature set composed of deep features and original features. Finally, a weighted local preserving projection algorithm and an ensemble learning mechanism are designed, and a manifold projection classifier ensemble model is constructed, which further improves the classification stability of feature fusion under small samples. In addition, this paper designs a medium-to-large-scale psychotic speech collection program for the first time, collects and constructs a large-scale Chinese psychotic speech database for the verification of psychotic speech recognition algorithms. The experimental results show that the main innovation of the algorithm is effective, and the classification accuracy is better than other representative algorithms, and the maximum improvement is 3.3%. In conclusion, this paper proposes a new method of psychotic speech recognition based on embedded mixed sparse stack autoencoder and manifold ensemble, which effectively improves the recognition rate of psychotic speech.


Sujet(s)
Humains , Algorithmes , Bases de données factuelles , Troubles psychotiques , Parole , Perception de la parole
11.
Article de Chinois | WPRIM | ID: wpr-879285

RÉSUMÉ

In order to solve the current problems in medical equipment maintenance, this study proposed an intelligent fault diagnosis method for medical equipment based on long short term memory network(LSTM). Firstly, in the case of no circuit drawings and unknown circuit board signal direction, the symptom phenomenon and port electrical signal of 7 different fault categories were collected, and the feature coding, normalization, fusion and screening were preprocessed. Then, the intelligent fault diagnosis model was built based on LSTM, and the fused and screened multi-modal features were used to carry out the fault diagnosis classification and identification experiment. The results were compared with those using port electrical signal, symptom phenomenon and the fusion of the two types. In addition, the fault diagnosis algorithm was compared with BP neural network (BPNN), recurrent neural network (RNN) and convolution neural network (CNN). The results show that based on the fused and screened multi-modal features, the average classification accuracy of LSTM algorithm model reaches 0.970 9, which is higher than that of using port electrical signal alone, symptom phenomenon alone or the fusion of the two types. It also has higher accuracy than BPNN, RNN and CNN, which provides a relatively feasible new idea for intelligent fault diagnosis of similar equipment.


Sujet(s)
Algorithmes , Électricité , Mémoire à court terme ,
12.
Braz. arch. biol. technol ; 64: e21210296, 2021. tab, graf
Article de Anglais | LILACS-Express | LILACS | ID: biblio-1350262

RÉSUMÉ

Abstract Vehicle re-id play a very import role in recent public safety, it has received more and more attention. The local features (e.g. hanging decorations and stickers) are widely used for vehicle re-id, but the same local feature exists in one perspective, but not exactly exists in other perspectives. In this paper, we firstly use experiments to verify that there is a low linear correlation between different dimension global features. Then we propose a new technique which uses global features instead of local features to distinguish the nuances between different vehicles. We design a vehicle re-identification method named a generated multi branch feature fusion method (GMBFF) to make full use of the complementarity between global features with different dimensions. All branches of the proposed GMBFF model are derived from the same model and there are only slight differences among those branches. Each of those branches can extract highly discriminative features with different dimensions. Finally, we fuse the features extracted by these branches. Existing research uses the fusing features for fusion and we use the global vehicle features for fusion. We also propose two different feature fusion methods which are single fusion method (SFM) and multi fusion method (MFM). In SFM, features for fusion with larger dimension occupy more weight in fused features. MFM overcomes the disadvantage of SFM. Finally, we carry out a lot of experiments on two widely used datasets which are VeRi-776 dataset and Vehicle ID dataset. The experimental results show that our proposed method is much better than the state-of-the-art vehicle re-identification methods.

13.
Article de Chinois | WPRIM | ID: wpr-828149

RÉSUMÉ

Lung nodules are the main manifestation of early lung cancer. So accurate detection of lung nodules is of great significance for early diagnosis and treatment of lung cancer. However, the rapid and accurate detection of pulmonary nodules is a challenging task due to the complex background, large detection range of pulmonary computed tomography (CT) images and the different sizes and shapes of pulmonary nodules. Therefore, this paper proposes a multi-scale feature fusion algorithm for the automatic detection of pulmonary nodules to achieve accurate detection of pulmonary nodules. Firstly, a three-layer modular lung nodule detection model was designed on the deep convolutional network (VGG16) for large-scale image recognition. The first-tier module of the network is used to extract the features of pulmonary nodules in CT images and roughly estimate the location of pulmonary nodules. Then the second-tier module of the network is used to fuse multi-scale image features to further enhance the details of pulmonary nodules. The third-tier module of the network was fused to analyze the features of the first-tier and the second-tier module of the network, and the candidate box of pulmonary nodules in multi-scale was obtained. Finally, the candidate box of pulmonary nodules under multi-scale was analyzed with the method of non-maximum suppression, and the final location of pulmonary nodules was obtained. The algorithm is validated by the data of pulmonary nodules on LIDC-IDRI common data set. The average detection accuracy is 90.9%.

14.
Journal of Biomedical Engineering ; (6): 1056-1064, 2020.
Article de Chinois | WPRIM | ID: wpr-879236

RÉSUMÉ

In the process of lower limb rehabilitation training, fatigue estimation is of great significance to improve the accuracy of intention recognition and avoid secondary injury. However, most of the existing methods only consider surface electromyography (sEMG) features but ignore electrocardiogram (ECG) features when performing in fatigue estimation, which leads to the low and unstable recognition efficiency. Aiming at this problem, a method that uses the fusion features of ECG and sEMG signal to estimate the fatigue during lower limb rehabilitation was proposed, and an improved particle swarm optimization-support vector machine classifier (improved PSO-SVM) was proposed and used to identify the fusion feature vector. Finally, the accurate recognition of the three states of relax, transition and fatigue was achieved, and the recognition rates were 98.5%, 93.5%, and 95.5%, respectively. Comparative experiments showed that the average recognition rate of this method was 4.50% higher than that of sEMG features alone, and 13.66% higher than that of the combined features of ECG and sEMG without feature fusion. It is proved that the feature fusion of ECG and sEMG signals in the process of lower limb rehabilitation training can be used for recognizing fatigue more accurately.


Sujet(s)
Humains , Algorithmes , Électrocardiographie , Électromyographie , Fatigue/diagnostic , Membre inférieur , Machine à vecteur de support
SÉLECTION CITATIONS
DÉTAIL DE RECHERCHE