Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 18 de 18
Filter
1.
Bol. latinoam. Caribe plantas med. aromát ; 23(2): 180-198, mar. 2024. ilus, tab, graf
Article in English | LILACS | ID: biblio-1538281

ABSTRACT

India's commercial advancement and development depend heavily on agriculture. A common fruit grown in tropical settings is citrus. A professional judgment is required while analyzing an illness because different diseases have slight variati ons in their symptoms. In order to recognize and classify diseases in citrus fruits and leaves, a customized CNN - based approach that links CNN with LSTM was developed in this research. By using a CNN - based method, it is possible to automatically differenti ate from healthier fruits and leaves and those that have diseases such fruit blight, fruit greening, fruit scab, and melanoses. In terms of performance, the proposed approach achieves 96% accuracy, 98% sensitivity, 96% Recall, and an F1 - score of 92% for ci trus fruit and leave identification and classification and the proposed method was compared with KNN, SVM, and CNN and concluded that the proposed CNN - based model is more accurate and effective at identifying illnesses in citrus fruits and leaves.


El avance y desarrollo comercial de India dependen en gran medida de la agricultura. Un tipo de fruta comunmente cultivada en en tornos tropicales es el cítrico. Se requiere un juicio profesional al analizar una enfermedad porque diferentes enfermedades tienen ligeras variaciones en sus síntomas. Para reconocer y clasificar enfermedades en frutas y hojas de cítricos, se desarrolló e n esta investigación un enfoque personalizado basado en CNN que vincula CNN con LSTM. Al utilizar un método basado en CNN, es posible diferenciar automáticamente entre frutas y hojas más saludables y aquellas que tienen enfermedades como la plaga de frutas , el verdor de frutas, la sarna de frutas y las melanosis. En términos de desempeño, el enfoque propuesto alcanza una precisión del 96%, una sensibilidad del 98%, una recuperación del 96% y una puntuación F1 del 92% para la identificación y clasificación d e frutas y hojas de cítricos, y el método propuesto se comparó con KNN, SVM y CNN y se concluyó que el modelo basado en CNN propuesto es más preciso y efectivo para identificar enfermedades en frutas y hojas de cítricos.


Subject(s)
Citrus/classification , Citrus/parasitology , Neural Networks, Computer , Plant Leaves/classification , Plant Leaves/parasitology , Artificial Intelligence/trends , Fruit/classification , Fruit/growth & development
2.
Journal of Biomedical Engineering ; (6): 458-464, 2023.
Article in Chinese | WPRIM | ID: wpr-981563

ABSTRACT

Sleep staging is the basis for solving sleep problems. There's an upper limit for the classification accuracy of sleep staging models based on single-channel electroencephalogram (EEG) data and features. To address this problem, this paper proposed an automatic sleep staging model that mixes deep convolutional neural network (DCNN) and bi-directional long short-term memory network (BiLSTM). The model used DCNN to automatically learn the time-frequency domain features of EEG signals, and used BiLSTM to extract the temporal features between the data, fully exploiting the feature information contained in the data to improve the accuracy of automatic sleep staging. At the same time, noise reduction techniques and adaptive synthetic sampling were used to reduce the impact of signal noise and unbalanced data sets on model performance. In this paper, experiments were conducted using the Sleep-European Data Format Database Expanded and the Shanghai Mental Health Center Sleep Database, and achieved an overall accuracy rate of 86.9% and 88.9% respectively. When compared with the basic network model, all the experimental results outperformed the basic network, further demonstrating the validity of this paper's model, which can provide a reference for the construction of a home sleep monitoring system based on single-channel EEG signals.


Subject(s)
China , Sleep Stages , Sleep , Electroencephalography , Databases, Factual
3.
Journal of Biomedical Engineering ; (6): 450-457, 2023.
Article in Chinese | WPRIM | ID: wpr-981562

ABSTRACT

The recurrent neural network architecture improves the processing ability of time-series data. However, issues such as exploding gradients and poor feature extraction limit its application in the automatic diagnosis of mild cognitive impairment (MCI). This paper proposed a research approach for building an MCI diagnostic model using a Bayesian-optimized bidirectional long short-term memory network (BO-BiLSTM) to address this problem. The diagnostic model was based on a Bayesian algorithm and combined prior distribution and posterior probability results to optimize the BO-BiLSTM network hyperparameters. It also used multiple feature quantities that fully reflected the cognitive state of the MCI brain, such as power spectral density, fuzzy entropy, and multifractal spectrum, as the input of the diagnostic model to achieve automatic MCI diagnosis. The results showed that the feature-fused Bayesian-optimized BiLSTM network model achieved an MCI diagnostic accuracy of 98.64% and effectively completed the diagnostic assessment of MCI. In conclusion, based on this optimization, the long short-term neural network model has achieved automatic diagnostic assessment of MCI, providing a new diagnostic model for intelligent diagnosis of MCI.


Subject(s)
Humans , Bayes Theorem , Neural Networks, Computer , Algorithms , Brain , Cognitive Dysfunction/diagnosis
4.
Journal of Biomedical Engineering ; (6): 244-248, 2023.
Article in Chinese | WPRIM | ID: wpr-981535

ABSTRACT

Cardiovascular disease is the leading cause of death worldwide, accounting for 48.0% of all deaths in Europe and 34.3% in the United States. Studies have shown that arterial stiffness takes precedence over vascular structural changes and is therefore considered to be an independent predictor of many cardiovascular diseases. At the same time, the characteristics of Korotkoff signal is related to vascular compliance. The purpose of this study is to explore the feasibility of detecting vascular stiffness based on the characteristics of Korotkoff signal. First, the Korotkoff signals of normal and stiff vessels were collected and preprocessed. Then the scattering features of Korotkoff signal were extracted by wavelet scattering network. Next, the long short-term memory (LSTM) network was established as a classification model to classify the normal and stiff vessels according to the scattering features. Finally, the performance of the classification model was evaluated by some parameters, such as accuracy, sensitivity, and specificity. In this study, 97 cases of Korotkoff signal were collected, including 47 cases from normal vessels and 50 cases from stiff vessels, which were divided into training set and test set according to the ratio of 8 : 2. The accuracy, sensitivity and specificity of the final classification model was 86.4%, 92.3% and 77.8%, respectively. At present, non-invasive screening method for vascular stiffness is very limited. The results of this study show that the characteristics of Korotkoff signal are affected by vascular compliance, and it is feasible to use the characteristics of Korotkoff signal to detect vascular stiffness. This study might be providing a new idea for non-invasive detection of vascular stiffness.


Subject(s)
Humans , Vascular Stiffness , Neural Networks, Computer , Cardiovascular Diseases/diagnosis , Sensitivity and Specificity
5.
Journal of Southern Medical University ; (12): 17-28, 2023.
Article in Chinese | WPRIM | ID: wpr-971490

ABSTRACT

OBJECTIVE@#To propose a semi-supervised epileptic seizure prediction model (ST-WGAN-GP-Bi-LSTM) to enhance the prediction performance by improving time-frequency analysis of electroencephalogram (EEG) signals, enhancing the stability of the unsupervised feature learning model and improving the design of back-end classifier.@*METHODS@#Stockwell transform (ST) of the epileptic EEG signals was performed to locate the time-frequency information by adaptive adjustment of the resolution and retaining the absolute phase to obtain the time-frequency inputs. When there was no overlap between the generated data distribution and the real EEG data distribution, to avoid failure of feature learning due to a constant JS divergence, Wasserstein GAN was used as a feature learning model, and the cost function based on EM distance and gradient penalty strategy was adopted to constrain the unsupervised training process to allow the generation of a high-order feature extractor. A temporal prediction model was finally constructed based on a bi-directional long short term memory network (Bi-LSTM), and the classification performance was improved by obtaining the temporal correlation between high-order time-frequency features. The CHB-MIT scalp EEG dataset was used to validate the proposed patient-specific seizure prediction method.@*RESULTS@#The AUC, sensitivity, and specificity of the proposed method reached 90.40%, 83.62%, and 86.69%, respectively. Compared with the existing semi-supervised methods, the propose method improved the original performance by 17.77%, 15.41%, and 53.66%. The performance of this method was comparable to that of a supervised prediction model based on CNN.@*CONCLUSION@#The utilization of ST, WGAN-GP, and Bi-LSTM effectively improves the prediction performance of the semi-supervised deep learning model, which can be used for optimization of unsupervised feature extraction in epileptic seizure prediction.


Subject(s)
Humans , Memory, Short-Term , Seizures/diagnosis , Electroencephalography
6.
Journal of Biomedical Engineering ; (6): 110-117, 2023.
Article in Chinese | WPRIM | ID: wpr-970680

ABSTRACT

The extraction of neuroimaging features of migraine patients and the design of identification models are of great significance for the auxiliary diagnosis of related diseases. Compared with the commonly used image features, this study directly uses time-series signals to characterize the functional state of the brain in migraine patients and healthy controls, which can effectively utilize the temporal information and reduce the computational effort of classification model training. Firstly, Group Independent Component Analysis and Dictionary Learning were used to segment different brain areas for small-sample groups and then the regional average time-series signals were extracted. Next, the extracted time series were divided equally into multiple subseries to expand the model input sample. Finally, the time series were modeled using a bi-directional long-short term memory network to learn the pre-and-post temporal information within each time series to characterize the periodic brain state changes to improve the diagnostic accuracy of migraine. The results showed that the classification accuracy of migraine patients and healthy controls was 96.94%, the area under the curve was 0.98, and the computation time was relatively shorter. The experiments indicate that the method in this paper has strong applicability, and the combination of time-series feature extraction and bi-directional long-short term memory network model can be better used for the classification and diagnosis of migraine. This work provides a new idea for the lightweight diagnostic model based on small-sample neuroimaging data, and contributes to the exploration of the neural discrimination mechanism of related diseases.


Subject(s)
Humans , Time Factors , Migraine Disorders/diagnostic imaging , Magnetic Resonance Imaging , Brain/diagnostic imaging , Neuroimaging
7.
Journal of Biomedical Engineering ; (6): 1089-1096, 2022.
Article in Chinese | WPRIM | ID: wpr-970646

ABSTRACT

Aiming at the problem that the unbalanced distribution of data in sleep electroencephalogram(EEG) signals and poor comfort in the process of polysomnography information collection will reduce the model's classification ability, this paper proposed a sleep state recognition method using single-channel EEG signals (WKCNN-LSTM) based on one-dimensional width kernel convolutional neural networks(WKCNN) and long-short-term memory networks (LSTM). Firstly, the wavelet denoising and synthetic minority over-sampling technique-Tomek link (SMOTE-Tomek) algorithm were used to preprocess the original sleep EEG signals. Secondly, one-dimensional sleep EEG signals were used as the input of the model, and WKCNN was used to extract frequency-domain features and suppress high-frequency noise. Then, the LSTM layer was used to learn the time-domain features. Finally, normalized exponential function was used on the full connection layer to realize sleep state. The experimental results showed that the classification accuracy of the one-dimensional WKCNN-LSTM model was 91.80% in this paper, which was better than that of similar studies in recent years, and the model had good generalization ability. This study improved classification accuracy of single-channel sleep EEG signals that can be easily utilized in portable sleep monitoring devices.


Subject(s)
Memory, Short-Term , Neural Networks, Computer , Sleep , Electroencephalography/methods , Algorithms
8.
Journal of Biomedical Engineering ; (6): 507-515, 2022.
Article in Chinese | WPRIM | ID: wpr-939618

ABSTRACT

The automatic recognition technology of muscle fatigue has widespread application in the field of kinesiology and rehabilitation medicine. In this paper, we used surface electromyography (sEMG) to study the recognition of leg muscle fatigue during circuit resistance training. The purpose of this study was to solve the problem that the sEMG signals have a lot of noise interference and the recognition accuracy of the existing muscle fatigue recognition model is not high enough. First, we proposed an improved wavelet threshold function denoising algorithm to denoise the sEMG signal. Then, we build a muscle fatigue state recognition model based on long short-term memory (LSTM), and used the Holdout method to evaluate the performance of the model. Finally, the denoising effect of the improved wavelet threshold function denoising method proposed in this paper was compared with the denoising effect of the traditional wavelet threshold denoising method. We compared the performance of the proposed muscle fatigue recognition model with that of particle swarm optimization support vector machine (PSO-SVM) and convolutional neural network (CNN). The results showed that the new wavelet threshold function had better denoising performance than hard and soft threshold functions. The accuracy of LSTM network model in identifying muscle fatigue was 4.89% and 2.47% higher than that of PSO-SVM and CNN, respectively. The sEMG signal denoising method and muscle fatigue recognition model proposed in this paper have important implications for monitoring muscle fatigue during rehabilitation training and exercise.


Subject(s)
Electromyography , Memory, Short-Term , Muscle Fatigue , Neural Networks, Computer , Recognition, Psychology
9.
Journal of Biomedical Engineering ; (6): 301-310, 2022.
Article in Chinese | WPRIM | ID: wpr-928226

ABSTRACT

Electrocardiogram (ECG) can visually reflect the physiological electrical activity of human heart, which is important in the field of arrhythmia detection and classification. To address the negative effect of label imbalance in ECG data on arrhythmia classification, this paper proposes a nested long short-term memory network (NLSTM) model for unbalanced ECG signal classification. The NLSTM is built to learn and memorize the temporal characteristics in complex signals, and the focal loss function is used to reduce the weights of easily identifiable samples. Then the residual attention mechanism is used to modify the assigned weights according to the importance of sample characteristic to solve the sample imbalance problem. Then the synthetic minority over-sampling technique is used to perform a simple manual oversampling process on the Massachusetts institute of technology and Beth Israel hospital arrhythmia (MIT-BIH-AR) database to further increase the classification accuracy of the model. Finally, the MIT-BIH arrhythmia database is applied to experimentally verify the above algorithms. The experimental results show that the proposed method can effectively solve the issues of imbalanced samples and unremarkable features in ECG signals, and the overall accuracy of the model reaches 98.34%. It also significantly improves the recognition and classification of minority samples and has provided a new feasible method for ECG-assisted diagnosis, which has practical application significance.


Subject(s)
Humans , Algorithms , Arrhythmias, Cardiac/diagnosis , Electrocardiography , Memory, Short-Term , Neural Networks, Computer , Signal Processing, Computer-Assisted
10.
Journal of Zhejiang University. Medical sciences ; (6): 1-9, 2022.
Article in English | WPRIM | ID: wpr-928651

ABSTRACT

To compare the performance of generalized additive model (GAM) and long short-term memory recurrent neural network (LSTM-RNN) on the prediction of daily admissions of respiratory diseases with comorbid diabetes. Daily data on air pollutants, meteorological factors and hospital admissions for respiratory diseases from Jan 1st, 2014 to Dec 31st, 2019 in Beijing were collected. LSTM-RNN was used to predict the daily admissions of respiratory diseases with comorbid diabetes, and the results were compared with those of GAM. The evaluation indexes were calculated by five-fold cross validation. Compared with the GAM, the prediction errors of LSTM-RNN were significantly lower [root mean squared error (RMSE): 21.21±3.30 vs. 46.13±7.60, <0.01; mean absolute error (MAE): 14.64±1.99 vs. 36.08±6.20, <0.01], and the value was significantly higher (0.79±0.06 vs. 0.57±0.12, <0.01). In gender stratification, RMSE, MAE and values of LSTM-RNN were better than those of GAM in predicting female admission (all <0.05), but there were no significant difference in predicting male admission between two models (all >0.05). In seasonal stratification, RMSE and MAE of LSTM-RNN were lower than those of GAM in predicting warm season admission (all <0.05), but there was no significant difference in value (>0.05). There were no significant difference in RMSE, MAE and between the two models in predicting cold season admission (all >0.05). In the stratification of functional areas, the RMSE, MAE and values of LSTM-RNN were better than those of GAM in predicting core area admission (all <0.05). has lower prediction errors and better fitting than the GAM, which can provide scientific basis for precise allocation of medical resources in polluted weather in advance.


Subject(s)
Female , Humans , Male , Beijing/epidemiology , Diabetes Mellitus/epidemiology , Hospitalization , Memory, Short-Term , Neural Networks, Computer
11.
Chinese Journal of Medical Instrumentation ; (6): 250-255, 2021.
Article in Chinese | WPRIM | ID: wpr-880461

ABSTRACT

Fetal heart rate plays an essential role in maternal and fetal monitoring and fetal health detection. In this study, a method based on Poincare Plot and LSTM is proposed to realize the high performance classification of abnormal fetal heart rate. Firstly, the original fetal heart rate signal of CTU-UHB database is preprocessed via interpolation, then the sequential fetal heart rate signal is converted into Poincare Plot to obtain nonlinear characteristics of the signals, and then SquenzeNet is used to extract the features of Poincare Plot. Finally, the features extracted by SqueezeNet are classified by LSTM. And the accuracy, the true positive rate and the false positive rate are 98.00%, 100.00%, 92.30% respectively on 2 000 test set data. Compared with the traditional fetal heart rate classification method, all respects are improved. The method proposed in this study has good performance in CTU-UHB fetal monitoring database and has certain practical value in the clinical diagnosis of auxiliary fetal heart rate detection.


Subject(s)
Female , Humans , Pregnancy , Databases, Factual , Fetal Monitoring , Fetus , Heart Rate, Fetal
12.
Journal of Biomedical Engineering ; (6): 361-368, 2021.
Article in Chinese | WPRIM | ID: wpr-879285

ABSTRACT

In order to solve the current problems in medical equipment maintenance, this study proposed an intelligent fault diagnosis method for medical equipment based on long short term memory network(LSTM). Firstly, in the case of no circuit drawings and unknown circuit board signal direction, the symptom phenomenon and port electrical signal of 7 different fault categories were collected, and the feature coding, normalization, fusion and screening were preprocessed. Then, the intelligent fault diagnosis model was built based on LSTM, and the fused and screened multi-modal features were used to carry out the fault diagnosis classification and identification experiment. The results were compared with those using port electrical signal, symptom phenomenon and the fusion of the two types. In addition, the fault diagnosis algorithm was compared with BP neural network (BPNN), recurrent neural network (RNN) and convolution neural network (CNN). The results show that based on the fused and screened multi-modal features, the average classification accuracy of LSTM algorithm model reaches 0.970 9, which is higher than that of using port electrical signal alone, symptom phenomenon alone or the fusion of the two types. It also has higher accuracy than BPNN, RNN and CNN, which provides a relatively feasible new idea for intelligent fault diagnosis of similar equipment.


Subject(s)
Algorithms , Electricity , Memory, Short-Term , Neural Networks, Computer
13.
Journal of Biomedical Engineering ; (6): 257-267, 2021.
Article in Chinese | WPRIM | ID: wpr-879273

ABSTRACT

Fetal electrocardiogram signal extraction is of great significance for perinatal fetal monitoring. In order to improve the prediction accuracy of fetal electrocardiogram signal, this paper proposes a fetal electrocardiogram signal extraction method (GA-LSTM) based on genetic algorithm (GA) optimization with long and short term memory (LSTM) network. Firstly, according to the characteristics of the mixed electrocardiogram signal of the maternal abdominal wall, the global search ability of the GA is used to optimize the number of hidden layer neurons, learning rate and training times of the LSTM network, and the optimal combination of parameters is calculated to make the network topology and the mother body match the characteristics of the mixed signals of the abdominal wall. Then, the LSTM network model is constructed using the optimal network parameters obtained by the GA, and the nonlinear transformation of the maternal chest electrocardiogram signals to the abdominal wall is estimated by the GA-LSTM network. Finally, using the non-linear transformation obtained from the maternal chest electrocardiogram signal and the GA-LSTM network model, the maternal electrocardiogram signal contained in the abdominal wall signal is estimated, and the estimated maternal electrocardiogram signal is subtracted from the mixed abdominal wall signal to obtain a pure fetal electrocardiogram signal. This article uses clinical electrocardiogram signals from two databases for experimental analysis. The final results show that compared with the traditional normalized minimum mean square error (NLMS), genetic algorithm-support vector machine method (GA-SVM) and LSTM network methods, the method proposed in this paper can extract a clearer fetal electrocardiogram signal, and its accuracy, sensitivity, accuracy and overall probability have been better improved. Therefore, the method could extract relatively pure fetal electrocardiogram signals, which has certain application value for perinatal fetal health monitoring.


Subject(s)
Female , Humans , Pregnancy , Algorithms , Electrocardiography , Fetal Monitoring , Memory, Short-Term , Support Vector Machine
14.
Journal of Biomedical Engineering ; (6): 10-20, 2021.
Article in Chinese | WPRIM | ID: wpr-879244

ABSTRACT

Heart sound is one of the common medical signals for diagnosing cardiovascular diseases. This paper studies the binary classification between normal or abnormal heart sounds, and proposes a heart sound classification algorithm based on the joint decision of extreme gradient boosting (XGBoost) and deep neural network, achieving a further improvement in feature extraction and model accuracy. First, the preprocessed heart sound recordings are segmented into four status, and five categories of features are extracted from the signals based on segmentation. The first four categories of features are sieved through recursive feature elimination, which is used as the input of the XGBoost classifier. The last category is the Mel-frequency cepstral coefficient (MFCC), which is used as the input of long short-term memory network (LSTM). Considering the imbalance of the data set, these two classifiers are both improved with weights. Finally, the heterogeneous integrated decision method is adopted to obtain the prediction. The algorithm was applied to the open heart sound database of the PhysioNet Computing in Cardiology(CINC) Challenge in 2016 on the PhysioNet website, to test the sensitivity, specificity, modified accuracy and F score. The results were 93%, 89.4%, 91.2% and 91.3% respectively. Compared with the results of machine learning, convolutional neural networks (CNN) and other methods used by other researchers, the accuracy and sensibility have been obviously improved, which proves that the method in this paper could effectively improve the accuracy of heart sound signal classification, and has great potential in the clinical auxiliary diagnosis application of some cardiovascular diseases.


Subject(s)
Algorithms , Databases, Factual , Heart Sounds , Neural Networks, Computer
15.
Asian Pacific Journal of Tropical Medicine ; (12): 564-574, 2021.
Article in Chinese | WPRIM | ID: wpr-951070

ABSTRACT

Objective: To predict the daily incidence and fatality rates based on long short-term memory (LSTM) in 4 age groups of COVID-19 patients in Mazandaran Province, Iran. Methods: To predict the daily incidence and fatality rates by age groups, this epidemiological study was conducted based on the LSTM model. All data of COVID-19 disease were collected daily for training the LSTM model from February 22, 2020 to April 10, 2021 in the Mazandaran University of Medical Sciences. We defined 4 age groups, i.e., patients under 29, between 30 and 49, between 50 and 59, and over 60 years old. Then, LSTM models were applied to predict the trend of daily incidence and fatality rates from 14 to 40 days in different age groups. The results of different methods were compared with each other. Results: This study evaluated 5 0826 patients and 5 109 deaths with COVID-19 daily in 20 cities of Mazandaran Province. Among the patients, 25 240 were females (49.7%), and 25 586 were males (50.3%). The predicted daily incidence rates on April 11, 2021 were 91.76, 155.84, 150.03, and 325.99 per 100 000 people, respectively; for the fourteenth day April 24, 2021, the predicted daily incidence rates were 35.91, 92.90, 83.74, and 225.68 in each group per 100 000 people. Furthermore, the predicted average daily incidence rates in 40 days for the 4 age groups were 34.25, 95.68, 76.43, and 210.80 per 100 000 people, and the daily fatality rates were 8.38, 4.18, 3.40, 22.53 per 100 000 people according to the established LSTM model. The findings demonstrated the daily incidence and fatality rates of 417.16 and 38.49 per 100 000 people for all age groups over the next 40 days. Conclusions: The results highlighted the proper performance of the LSTM model for predicting the daily incidence and fatality rates. It can clarify the path of spread or decline of the COVID-19 outbreak and the priority of vaccination in age groups.

16.
Journal of Biomedical Engineering ; (6): 447-454, 2021.
Article in Chinese | WPRIM | ID: wpr-888200

ABSTRACT

Emotion plays an important role in people's cognition and communication. By analyzing electroencephalogram (EEG) signals to identify internal emotions and feedback emotional information in an active or passive way, affective brain-computer interactions can effectively promote human-computer interaction. This paper focuses on emotion recognition using EEG. We systematically evaluate the performance of state-of-the-art feature extraction and classification methods with a public-available dataset for emotion analysis using physiological signals (DEAP). The common random split method will lead to high correlation between training and testing samples. Thus, we use block-wise


Subject(s)
Humans , Arousal , Electroencephalography , Emotions , Memory, Short-Term , Neural Networks, Computer
17.
Journal of Biomedical Engineering ; (6): 1-9, 2020.
Article in Chinese | WPRIM | ID: wpr-788902

ABSTRACT

Aiming at the problem that the small samples of critical disease in clinic may lead to prognostic models with poor performance of overfitting, large prediction error and instability, the long short-term memory transferring algorithm (transLSTM) was proposed. Based on the idea of transfer learning, the algorithm leverages the correlation between diseases to transfer information of different disease prognostic models, constructs the effictive model of target disease of small samples with the aid of large data of related diseases, hence improves the prediction performance and reduces the requirement for target training sample quantity. The transLSTM algorithm firstly uses the related disease samples to pretrain partial model parameters, and then further adjusts the whole network with the target training samples. The testing results on MIMIC-Ⅲ database showed that compared with traditional LSTM classification algorithm, the transLSTM algorithm had 0.02-0.07 higher AUROC and 0.05-0.14 larger AUPRC, while its number of training iterations was only 39%-64% of the traditional algorithm. The results of application on sepsis revealed that the transLSTM model of only 100 training samples had comparable mortality prediction performance to the traditional model of 250 training samples. In small sample situations, the transLSTM algorithm has significant advantages with higher prediciton accuracy and faster training speed. It realizes the application of transfer learning in the prognostic model of critical disease with small samples.

18.
Academic Journal of Second Military Medical University ; (12): 497-506, 2019.
Article in Chinese | WPRIM | ID: wpr-837969

ABSTRACT

Objective To propose a conditional random field (CRF) model based on the new word segmentation method Re-entity, and to compare with bi-directional long short-term memory neural network (BiLSTM)-CRF and Lattice-long short-term memory neural network (LSTM). Methods After analyzing the existing entity recognition methods, we proposed CRF method based on Re-entity, BiLSTM-CRF and Lattice-LSTM for the China Conference on Knowledge Graph and Semantic Computing in 2018 (CCKS2018) task one: Chinese clinical named entity recognition, and trained character vector sets at different parameter levels based on different corpora. The comparative experiments on model performance were carried out in the different neural network models for each methods. Finally, the comparative study was carried out based on different input lengths such as the sentence level and the text level. Results Re-entity method can improve the performance of CRF model. Lattice-LSTM model based on sentence level achieved a strict F1-measure of 89.75% on this task, which was higher than the highest F1-measure (89.25%) on the task one of CCKS2018. Conclusion The CRF model based on Re-entity can effectively improve the recognition rate of traditional Chinese medicines in electronic medical records by using normalized Chinese clinical drug. Re-entity method can improve the error accumulation caused by word segmentation in data preprocessing. Lattice structure can better combine the latent semantic information of characters and word sequences. At the same time, sentence-level input can effectively improve the recognition accuracy of neural network models.

SELECTION OF CITATIONS
SEARCH DETAIL