Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 24
Filter
Add more filters










Publication year range
1.
PeerJ Comput Sci ; 10: e1834, 2024.
Article in English | MEDLINE | ID: mdl-38660201

ABSTRACT

Identification of the Internet of Things (IoT) devices has become an essential part of network management to secure the privacy of smart homes and offices. With its wide adoption in the current era, IoT has facilitated the modern age in many ways. However, such proliferation also has associated privacy and data security risks. In the case of smart homes and smart offices, unknown IoT devices increase vulnerabilities and chances of data theft. It is essential to identify the connected devices for secure communication. It is very difficult to maintain the list of rules when the number of connected devices increases and human involvement is necessary to check whether any intruder device has approached the network. Therefore, it is required to automate device identification using machine learning methods. In this article, we propose an accuracy boosting model (ABM) using machine learning models of random forest and extreme gradient boosting. Featuring engineering techniques are employed along with cross-validation to accurately identify IoT devices such as lights, smoke detectors, thermostat, motion sensors, baby monitors, socket, TV, security cameras, and watches. The proposed ensemble model utilizes random forest (RF) and extreme gradient boosting (XGB) as base learners with adaptive boosting. The proposed ensemble model is tested with extensive experiments involving the IoT Device Identification dataset from a public repository. Experimental results indicate a higher accuracy of 91%, precision of 93%, recall of 93%, and F1 score of 93%.

2.
Diagnostics (Basel) ; 13(19)2023 Oct 02.
Article in English | MEDLINE | ID: mdl-37835856

ABSTRACT

Breast cancer is a common cause of female mortality in developing countries. Early detection and treatment are crucial for successful outcomes. Breast cancer develops from breast cells and is considered a leading cause of death in women. This disease is classified into two subtypes: invasive ductal carcinoma (IDC) and ductal carcinoma in situ (DCIS). The advancements in artificial intelligence (AI) and machine learning (ML) techniques have made it possible to develop more accurate and reliable models for diagnosing and treating this disease. From the literature, it is evident that the incorporation of MRI and convolutional neural networks (CNNs) is helpful in breast cancer detection and prevention. In addition, the detection strategies have shown promise in identifying cancerous cells. The CNN Improvements for Breast Cancer Classification (CNNI-BCC) model helps doctors spot breast cancer using a trained deep learning neural network system to categorize breast cancer subtypes. However, they require significant computing power for imaging methods and preprocessing. Therefore, in this research, we proposed an efficient deep learning model that is capable of recognizing breast cancer in computerized mammograms of varying densities. Our research relied on three distinct modules for feature selection: the removal of low-variance features, univariate feature selection, and recursive feature elimination. The craniocaudally and medial-lateral views of mammograms are incorporated. We tested it with a large dataset of 3002 merged pictures gathered from 1501 individuals who had digital mammography performed between February 2007 and May 2015. In this paper, we applied six different categorization models for the diagnosis of breast cancer, including the random forest (RF), decision tree (DT), k-nearest neighbors (KNN), logistic regression (LR), support vector classifier (SVC), and linear support vector classifier (linear SVC). The simulation results prove that our proposed model is highly efficient, as it requires less computational power and is highly accurate.

3.
Humanit Soc Sci Commun ; 10(1): 233, 2023.
Article in English | MEDLINE | ID: mdl-37200567

ABSTRACT

The Food, Beverage & Tobacco (F&B) industry is an essential sector in the competitive economy. Procurement of production factors mainly depends on sales forecasting and the supply chain of raw materials. However, the conflict between Russia and Ukraine has jeopardized the global supply chain. As the conflict worsened, the world faced a food crisis, which was already a significant challenge due to the Covid-19 pandemic. Understanding how conflict-related disruptions in global food markets might affect the stock return of the F&B industry of South Korea, this study forecasts the stock returns on the KOSDAQ F&B sector. This paper highlights that the conflict resulted in immediate and far-reaching consequences on the global food supply chain and future crop harvesting in South Korea. As numerous algorithms have been widely used in predicting stock market returns, we use Autoregressive Integrated Moving Average (ARIMA) model for the prediction. Using daily returns from the KOSDAQ F&B industry from January 1999 to October 2022, the study proposes an ARIMA (2,2,3) model to forecast future movements of the stock returns. With an RMSE of 0.012, the prediction performance holds good using the ARIMA model. The results show a negative trend observed in the F&B sector returns for a few months, implying that sector stock returns decline as the conflict between Russia and Ukraine becomes more pronounced. This study also suggests that South Korea has massive scope to stabilize the demand for healthy, safe food, give more attention to domestic agribusiness, and make itself a self-sufficient agri-economy.

4.
Medicine (Baltimore) ; 102(19): e33796, 2023 May 12.
Article in English | MEDLINE | ID: mdl-37171314

ABSTRACT

Deep learning is an advanced machine learning technique that is used in several medical fields to diagnose diseases and predict therapeutic outcomes. In this study, using anteroposterior ankle radiographs, we developed a convolutional neural network (CNN) model to diagnose osteochondral lesions of the talus (OLTs) using ankle radiographs as input data. We evaluated whether a CNN model trained on anteroposterior ankle radiographs could help diagnose the presence of OLT. We retrospectively collected 379 cases (OLT cases = 133, non-OLT cases = 246) of anteroposterior ankle radiographs taken at a university hospital between January 2010 and December 2020. The OLT was diagnosed using ankle magnetic resonance images of each patient. Among the 379 cases, 70% of the included data were randomly selected as the training set, 10% as the validation set, and the remaining 20% were assigned to the test set to evaluate the model performance. To accurately classify OLT and non-OLT, we cropped the area of the ankle on anteroposterior ankle radiographs, resized the image to 224 × 224, and used it as the input data. We then used the Visual Geometry Group Network model to determine whether the input image was OLT or non-OLT. The performance of the CNN model for the area under the curve, accuracy, positive predictive value, and negative predictive value on the test data were 0.774 (95% confidence interval [CI], 0.673-0.875), 81.58% (95% CI, 0.729-0.903), 80.95% (95% CI, 0.773-0.846), and 81.82% (95% CI, 0.804-0.832), respectively. A CNN model trained on anteroposterior ankle radiographs achieved meaningful accuracy in diagnosing OLT and demonstrated that it could help diagnose OLT.


Subject(s)
Talus , Humans , Talus/diagnostic imaging , Talus/pathology , Ankle , Retrospective Studies , Radiography , Neural Networks, Computer
5.
Cancers (Basel) ; 15(3)2023 Jan 22.
Article in English | MEDLINE | ID: mdl-36765642

ABSTRACT

Breast cancer is one of the most common invasive cancers in women and it continues to be a worldwide medical problem since the number of cases has significantly increased over the past decade. Breast cancer is the second leading cause of death from cancer in women. The early detection of breast cancer can save human life but the traditional approach for detecting breast cancer disease needs various laboratory tests involving medical experts. To reduce human error and speed up breast cancer detection, an automatic system is required that would perform the diagnosis accurately and timely. Despite the research efforts for automated systems for cancer detection, a wide gap exists between the desired and provided accuracy of current approaches. To overcome this issue, this research proposes an approach for breast cancer prediction by selecting the best fine needle aspiration features. To enhance the prediction accuracy, several feature selection techniques are applied to analyze their efficacy, such as principal component analysis, singular vector decomposition, and chi-square (Chi2). Extensive experiments are performed with different features and different set sizes of features to investigate the optimal feature set. Additionally, the influence of imbalanced and balanced data using the SMOTE approach is investigated. Six classifiers including random forest, support vector machine, gradient boosting machine, logistic regression, multilayer perceptron, and K-nearest neighbors (KNN) are tuned to achieve increased classification accuracy. Results indicate that KNN outperforms all other classifiers on the used dataset with 20 features using SVD and with the 15 most important features using a PCA with a 100% accuracy score.

6.
Medicine (Baltimore) ; 101(44): e31510, 2022 Nov 04.
Article in English | MEDLINE | ID: mdl-36343061

ABSTRACT

Deep learning is an advanced machine learning approach used in diverse areas such as image analysis, bioinformatics, and natural language processing. In the current study, using only one knee magnetic resonance image of each patient, we attempted to develop a convolutional neural network (CNN) to diagnose anterior cruciate ligament (ACL) tear. We retrospectively recruited 164 patients who had knee injury and underwent knee magnetic resonance imaging evaluation. Of 164 patients, 83 patients' ACLs were torn (20 patients, partial tear; 63 patients, complete tear), whereas 81 patients' ACLs were intact. We used a CNN algorithm. Of the included subjects, 79% were assigned randomly to the training set and the remaining 21% were assigned to the test set to measure the model performance. The area under the curve was 0.941 (95% CI, 0.862-1.000) for the classification of intact and tears of the ACL. We demonstrated that a CNN model trained using one knee magnetic resonance image of each patient could be helpful in diagnosing ACL tear.


Subject(s)
Anterior Cruciate Ligament Injuries , Lacerations , Humans , Anterior Cruciate Ligament/diagnostic imaging , Anterior Cruciate Ligament/pathology , Retrospective Studies , Arthroscopy , Anterior Cruciate Ligament Injuries/diagnostic imaging , Anterior Cruciate Ligament Injuries/pathology , Rupture/pathology , Magnetic Resonance Imaging/methods , Lacerations/pathology , Neural Networks, Computer
7.
Medicina (Kaunas) ; 58(11)2022 Nov 21.
Article in English | MEDLINE | ID: mdl-36422232

ABSTRACT

Background and Objectives: This study investigated the usefulness of deep neural network (DNN) models based on 18F-fluorodeoxyglucose positron emission tomography (FDG-PET) and blood inflammatory markers to assess the therapeutic response in pyogenic vertebral osteomyelitis (PVO). Materials and Methods: This was a retrospective study with prospectively collected data. Seventy-four patients diagnosed with PVO underwent clinical assessment for therapeutic responses based on clinical features during antibiotic therapy. The decisions of the clinical assessment were confirmed as 'Cured' or 'Non-cured'. FDG-PETs were conducted concomitantly regardless of the decision at each clinical assessment. We developed DNN models depending on the use of attributes, including C-reactive protein (CRP), erythrocyte sedimentation ratio (ESR), and maximum standardized FDG uptake values of PVO lesions (SUVmax), and we compared their performances to predict PVO remission. Results: The 126 decisions (80 'Cured' and 46 'Non-cured' patients) were randomly assigned with training and test sets (7:3). We trained DNN models using a training set and evaluated their performances for a test set. DNN model 1 had an accuracy of 76.3% and an area under the receiver operating characteristic curve (AUC) of 0.768 [95% confidence interval, 0.625-0.910] using CRP and ESR, and these values were 79% and 0.804 [0.674-0.933] for DNN model 2 using ESR and SUVmax, 86.8% and 0.851 [0.726-0.976] for DNN model 3 using CRP and SUVmax, and 89.5% and 0.902 [0.804-0.999] for DNN model 4 using ESR, CRP, and SUVmax, respectively. Conclusions: The DNN models using SUVmax showed better performances when predicting the remission of PVO compared to CRP and ESR. The best performance was obtained in the DNN model using all attributes, including CRP, ESR, and SUVmax, which may be helpful for predicting the accurate remission of PVO.


Subject(s)
Fluorodeoxyglucose F18 , Osteomyelitis , Humans , Retrospective Studies , Positron-Emission Tomography/methods , Osteomyelitis/diagnostic imaging , Osteomyelitis/drug therapy , Neural Networks, Computer , C-Reactive Protein
8.
Digit Health ; 8: 20552076221109530, 2022.
Article in English | MEDLINE | ID: mdl-35898288

ABSTRACT

Vaccination for the COVID-19 pandemic has raised serious concerns among the public and various rumours are spread regarding the resulting illness, adverse reactions, and death. Such rumours can damage the campaign against the COVID-19 and should be dealt with accordingly. One prospective solution is to use machine learning-based models to predict the death risk for vaccinated people by utilizing the available data. This study focuses on the prognosis of three significant events including 'not survived', 'recovered', and 'not recovered' based on the adverse events followed by the second dose of the COVID-19 vaccine. Extensive experiments are performed to analyse the efficacy of the proposed Extreme Regression- Voting Classifier model in comparison with machine learning models with Term Frequency-Inverse Document Frequency, Bag of Words, and Global Vectors, and deep learning models like Convolutional Neural Network, Long Short Term Memory, and Bidirectional Long Short Term Memory. Experiments are carried out on the original, as well as, a balanced dataset using Synthetic Minority Oversampling Approach. Results reveal that the proposed voting classifier in combination with TF-IDF outperforms with a 0.85 accuracy score on the SMOTE-balanced dataset. In line with this, the validation of the proposed voting classifier on binary classification shows state-of-the-art results with a 0.98 accuracy.

9.
Sci Rep ; 12(1): 12508, 2022 07 22.
Article in English | MEDLINE | ID: mdl-35869103

ABSTRACT

Educational Data Mining is widely used for predicting student's performance. It's a challenging task because a plethora of features related to demographics, personality traits, socio-economic, and environmental may affect students' performance. Such varying features may depend on the level of study, program offered, nature of subject, and geographical location. This study attempted to predict the final semester's results of students studying Doctor of Veterinary Medicine (DVM) based on their pre-admission academic achievements, demographics, and first semester performance. The imbalanced data led to non-generic prediction models, so it was addressed through synthetic minority oversampling technique. Among five prediction models, the Support Vector Machine led the best with 92% accuracy. The decision tree model identified key features affecting students' performance. The analysis led to the conclusion that marks obtained in Biology, Islamiat, and Urdu at Matric and English at Intermediate level affected the students' performance in their final semester. The findings provide useful information to predict students' performance and guidelines for academic institutes' management regarding improving students' achievement. It is speculated that adoption of digital transformation may help reduce difficulty faced in data collection and analysis.


Subject(s)
Academic Success , Students , Demography , Educational Status , Humans
10.
Eur Neurol ; 85(6): 460-466, 2022.
Article in English | MEDLINE | ID: mdl-35738236

ABSTRACT

BACKGROUND: Deep learning techniques can outperform traditional machine learning techniques and learn from unstructured and perceptual data, such as images and languages. We evaluated whether a convolutional neural network (CNN) model using whole axial brain T2-weighted magnetic resonance (MR) images as input data can help predict motor outcomes of the upper and lower limbs at the chronic stage in stroke patients. METHODS: We collected MR images taken at the early stage of stroke in 1,233 consecutive stroke patients. We categorized modified Brunnstrom classification (MBC) scores of ≥5 and functional ambulatory category (FAC) scores of ≥4 at 6 months after stroke as favorable outcomes in the upper and lower limbs, respectively, and MBC scores of <5 and FAC scores of <4 as poor outcomes. We applied a CNN to train the image data. Of the 1,233 patients, 70% (863 patients) were randomly selected for the training set and the remaining 30% (370 patients) were assigned to the validation set. RESULTS: In the prediction of upper limb motor function on the validation dataset, the area under the curve (AUC) was 0.768, and for lower limb motor function, the AUC was 0.828. CONCLUSION: We showed that a CNN model trained using whole-brain axial T2-weighted MR images of stroke patients would help predict upper and lower limb motor function at the chronic stage.


Subject(s)
Deep Learning , Stroke , Humans , Magnetic Resonance Imaging/methods , Algorithms , Brain/diagnostic imaging , Stroke/diagnostic imaging
11.
PLoS One ; 17(6): e0270327, 2022.
Article in English | MEDLINE | ID: mdl-35767542

ABSTRACT

COVID-19 vaccination raised serious concerns among the public and people are mind stuck by various rumors regarding the resulting illness, adverse reactions, and death. Such rumors are dangerous to the campaign against the COVID-19 and should be dealt with accordingly and timely. One prospective solution is to use machine learning-based models to predict the death risk for vaccinated people and clarify people's perceptions regarding death risk. This study focuses on the prediction of the death risks associated with vaccinated people followed by a second dose for two reasons; first to build consensus among people to get the vaccines; second, to reduce the fear regarding vaccines. Given that, this study utilizes the COVID-19 VAERS dataset that records adverse events after COVID-19 vaccination as 'recovered', 'not recovered', and 'survived'. To obtain better prediction results, a novel voting classifier extreme regression-voting classifier (ER-VC) is introduced. ER-VC ensembles extra tree classifier and logistic regression using soft voting criterion. To avoid model overfitting and get better results, two data balancing techniques synthetic minority oversampling (SMOTE) and adaptive synthetic sampling (ADASYN) have been applied. Moreover, three feature extraction techniques term frequency-inverse document frequency (TF-IDF), bag of words (BoW), and global vectors (GloVe) have been used for comparison. Both machine learning and deep learning models are deployed for experiments. Results obtained from extensive experiments reveal that the proposed model in combination with TF-TDF has shown robust results with a 0.85 accuracy when trained on the SMOTE-balanced dataset. In line with this, validation of the proposed voting classifier on binary classification shows state-of-the-art results with a 0.98 accuracy. Results show that machine learning models can predict the death risk with high accuracy and can assist the authors in taking timely measures.


Subject(s)
COVID-19 Vaccines/adverse effects , COVID-19 , Adverse Drug Reaction Reporting Systems , COVID-19/prevention & control , Humans , Politics , Prospective Studies
12.
Sci Rep ; 12(1): 7421, 2022 05 06.
Article in English | MEDLINE | ID: mdl-35523906

ABSTRACT

The molecular weight and isoelectric point of the proteins are very important parameters that control their subcellular localization and subsequent function. Although the genome sequence data of the plant kingdom improved enormously, the proteomic details have been poorly elaborated. Therefore, we have calculated the molecular weight and isoelectric point of the plant proteins and reported them in this database. A database, PlantMWpIDB, containing protein data from 342 plant proteomes was created to provide information on plant proteomes for hypothesis formulation in basic research and for biotechnological applications. The Molecular weight and isoelectric point (pI) are important molecular parameters of proteins that are useful when conducting protein studies involving 2D gel electrophoresis, liquid chromatography-mass spectrometry, and X-ray protein crystallography. PlantMWpIDB provides an easy-to-use and efficient interface for search options and generates a summary of basic protein parameters. The database represents a virtual 2D proteome map of plants, and the molecular weight and pI of a protein can be obtained by searching on the name of a protein, a keyword, or by a list of accession numbers. The PlantMWpIDB database also allows one to query protein sequences. The database can be found in the following link https://plantmwpidb.com/ . The individual 2D virtual proteome map of the plant kingdom will enable us to understand the proteome diversity between different species. Further, the molecular weight and isoelectric point of individual proteins can enable us to understand their functional significance in different species.


Subject(s)
Proteome , Proteomics , Databases, Protein , Electrophoresis, Gel, Two-Dimensional , Isoelectric Focusing , Isoelectric Point , Molecular Weight , Plants/metabolism , Proteome/metabolism , Proteomics/methods
13.
BMC Musculoskelet Disord ; 23(1): 510, 2022 May 30.
Article in English | MEDLINE | ID: mdl-35637451

ABSTRACT

BACKGROUND: Deep learning (DL) is an advanced machine learning approach used in diverse areas, such as image analysis, bioinformatics, and natural language processing. A convolutional neural network (CNN) is a representative DL model that is advantageous for image recognition and classification. In this study, we aimed to develop a CNN to detect meniscal tears and classify tear types using coronal and sagittal magnetic resonance (MR) images of each patient. METHODS: We retrospectively collected 599 cases (medial meniscus tear = 384, lateral meniscus tear = 167, and medial and lateral meniscus tear = 48) of knee MR images from patients with meniscal tears and 449 cases of knee MR images from patients without meniscal tears. To develop the DL model for evaluating the presence of meniscal tears, all the collected knee MR images of 1048 cases were used. To develop the DL model for evaluating the type of meniscal tear, 538 cases with meniscal tears (horizontal tear = 268, complex tear = 147, radial tear = 48, and longitudinal tear = 75) and 449 cases without meniscal tears were used. Additionally, a CNN algorithm was used. To measure the model's performance, 70% of the included data were randomly assigned to the training set, and the remaining 30% were assigned to the test set. RESULTS: The area under the curves (AUCs) of our model were 0.889, 0.817, and 0.924 for medial meniscal tears, lateral meniscal tears, and medial and lateral meniscal tears, respectively. The AUCs of the horizontal, complex, radial, and longitudinal tears were 0.761, 0.850, 0.601, and 0.858, respectively. CONCLUSION: Our study showed that the CNN model has the potential to be used in diagnosing the presence of meniscal tears and differentiating the types of meniscal tears.


Subject(s)
Knee Injuries , Tibial Meniscus Injuries , Humans , Knee Injuries/diagnosis , Magnetic Resonance Imaging/methods , Menisci, Tibial/pathology , Neural Networks, Computer , Retrospective Studies , Rupture/pathology , Tibial Meniscus Injuries/diagnostic imaging , Tibial Meniscus Injuries/pathology
14.
PeerJ Comput Sci ; 8: e914, 2022.
Article in English | MEDLINE | ID: mdl-35494818

ABSTRACT

The Internet Movie Database (IMDb), being one of the popular online databases for movies and personalities, provides a wide range of movie reviews from millions of users. This provides a diverse and large dataset to analyze users' sentiments about various personalities and movies. Despite being helpful to provide the critique of movies, the reviews on IMDb cannot be read as a whole and requires automated tools to provide insights on the sentiments in such reviews. This study provides the implementation of various machine learning models to measure the polarity of the sentiments presented in user reviews on the IMDb website. For this purpose, the reviews are first preprocessed to remove redundant information and noise, and then various classification models like support vector machines (SVM), Naïve Bayes classifier, random forest, and gradient boosting classifiers are used to predict the sentiment of these reviews. The objective is to find the optimal process and approach to attain the highest accuracy with the best generalization. Various feature engineering approaches such as term frequency-inverse document frequency (TF-IDF), bag of words, global vectors for word representations, and Word2Vec are applied along with the hyperparameter tuning of the classification models to enhance the classification accuracy. Experimental results indicate that the SVM obtains the highest accuracy when used with TF-IDF features and achieves an accuracy of 89.55%. The sentiment classification accuracy of the models is affected due to the contradictions in the user sentiments in the reviews and assigned labels. For tackling this issue, TextBlob is used to assign a sentiment to the dataset containing reviews before it can be used for training. Experimental results on TextBlob assigned sentiments indicate that an accuracy of 92% can be obtained using the proposed model.

15.
J Korean Med Sci ; 37(6): e42, 2022 Feb 14.
Article in English | MEDLINE | ID: mdl-35166079

ABSTRACT

BACKGROUND: Videofluoroscopic swallowing study (VFSS) is currently considered the gold standard to precisely diagnose and quantitatively investigate dysphagia. However, VFSS interpretation is complex and requires consideration of several factors. Therefore, considering the expected impact on dysphagia management, this study aimed to apply deep learning to detect the presence of penetration or aspiration in VFSS of patients with dysphagia automatically. METHODS: The VFSS data of 190 participants with dysphagia were collected. A total of 10 frame images from one swallowing process were selected (five high-peak images and five low-peak images) for the application of deep learning in a VFSS video of a patient with dysphagia. We applied a convolutional neural network (CNN) for deep learning using the Python programming language. For the classification of VFSS findings (normal swallowing, penetration, and aspiration), the classification was determined in both high-peak and low-peak images. Thereafter, the two classifications determined through high-peak and low-peak images were integrated into a final classification. RESULTS: The area under the curve (AUC) for the validation dataset of the VFSS image for the CNN model was 0.942 for normal findings, 0.878 for penetration, and 1.000 for aspiration. The macro average AUC was 0.940 and micro average AUC was 0.961. CONCLUSION: This study demonstrated that deep learning algorithms, particularly the CNN, could be applied for detecting the presence of penetration and aspiration in VFSS of patients with dysphagia.


Subject(s)
Deep Learning , Deglutition Disorders/diagnosis , Deglutition/physiology , Fluoroscopy , Video Recording , Aged , Aged, 80 and over , Female , Humans , Male , Middle Aged
16.
J Ambient Intell Humaniz Comput ; 13(1): 535-547, 2022.
Article in English | MEDLINE | ID: mdl-33527000

ABSTRACT

COVID-19 pandemic is widely spreading over the entire world and has established significant community spread. Fostering a prediction system can help prepare the officials to respond properly and quickly. Medical imaging like X-ray and computed tomography (CT) can play an important role in the early prediction of COVID-19 patients that will help the timely treatment of the patients. The x-ray images from COVID-19 patients reveal the pneumonia infections that can be used to identify the patients of COVID-19. This study presents the use of Convolutional Neural Network (CNN) that extracts the features from chest x-ray images for the prediction. Three filters are applied to get the edges from the images that help to get the desired segmented target with the infected area of the x-ray. To cope with the smaller size of the training dataset, Keras' ImageDataGenerator class is used to generate ten thousand augmented images. Classification is performed with two, three, and four classes where the four-class problem has X-ray images from COVID-19, normal people, virus pneumonia, and bacterial pneumonia. Results demonstrate that the proposed CNN model can predict COVID-19 patients with high accuracy. It can help automate screening of the patients for COVID-19 with minimal contact, especially areas where the influx of patients can not be treated by the available medical staff. The performance comparison of the proposed approach with VGG16 and AlexNet shows that classification results for two and four classes are competitive and identical for three-class classification.

17.
PeerJ Comput Sci ; 7: e645, 2021.
Article in English | MEDLINE | ID: mdl-34541306

ABSTRACT

Sarcasm emerges as a common phenomenon across social networking sites because people express their negative thoughts, hatred and opinions using positive vocabulary which makes it a challenging task to detect sarcasm. Although various studies have investigated the sarcasm detection on baseline datasets, this work is the first to detect sarcasm from a multi-domain dataset that is constructed by combining Twitter and News Headlines datasets. This study proposes a hybrid approach where the convolutional neural networks (CNN) are used for feature extraction while the long short-term memory (LSTM) is trained and tested on those features. For performance analysis, several machine learning algorithms such as random forest, support vector classifier, extra tree classifier and decision tree are used. The performance of both the proposed model and machine learning algorithms is analyzed using the term frequency-inverse document frequency, bag of words approach, and global vectors for word representations. Experimental results indicate that the proposed model surpasses the performance of the traditional machine learning algorithms with an accuracy of 91.60%. Several state-of-the-art approaches for sarcasm detection are compared with the proposed model and results suggest that the proposed model outperforms these approaches concerning the precision, recall and F1 scores. The proposed model is accurate, robust, and performs sarcasm detection on a multi-domain dataset.

18.
PeerJ Comput Sci ; 7: e547, 2021.
Article in English | MEDLINE | ID: mdl-34395856

ABSTRACT

Medical diagnosis through the classification of biomedical attributes is one of the exponentially growing fields in bioinformatics. Although a large number of approaches have been presented in the past, wide use and superior performance of the machine learning (ML) methods in medical diagnosis necessitates significant consideration for automatic diagnostic methods. This study proposes a novel approach called concatenated resampling (CR) to increase the efficacy of traditional ML algorithms. The performance is analyzed leveraging four ML approaches like tree-based ensemble approaches, and linear machine learning approach for automatic diagnosis of inter-vertebral pathologies with increased. Besides, undersampling, over-sampling, and proposed CR techniques have been applied to unbalanced training dataset to analyze the impact of these techniques on the accuracy of each of the classification model. Extensive experiments have been conducted to make comparisons among different classification models using several metrics including accuracy, precision, recall, and F 1 score. Comparative analysis has been performed on the experimental results to identify the best performing classifier along with the application of the re-sampling technique. The results show that the extra tree classifier achieves an accuracy of 0.99 in association with the proposed CR technique.

19.
Sci Rep ; 11(1): 7989, 2021 04 12.
Article in English | MEDLINE | ID: mdl-33846472

ABSTRACT

Deep learning (DL) is an advanced machine learning approach used in diverse areas such as bioinformatics, image analysis, and natural language processing. Here, using brain magnetic resonance imaging (MRI) data obtained at early stages of infarcts, we attempted to develop a convolutional neural network (CNN) to predict the ambulatory outcome of corona radiata infarction at six months after onset. We retrospectively recruited 221 patients with corona radiata infarcts. A favorable outcome of ambulatory function was defined as a functional ambulation category (FAC) score of ≥ 4 (able to walk without a guardian's assistance), and a poor outcome of ambulatory function was defined as an FAC score of < 4. We used a CNN algorithm. Of the included subjects, 69.7% (n = 154) were assigned randomly to the training set and the remaining 30.3% (n = 67) were assigned to the validation set to measure the model performance. The area under the curve was 0.751 (95% CI 0.649-0.852) for the prediction of ambulatory function with the validation dataset using the CNN model. We demonstrated that a CNN model trained using brain MRIs captured at an early stage after corona radiata infarction could be helpful in predicting long-term ambulatory outcomes.


Subject(s)
Brain Infarction/physiopathology , Deep Learning , Walking/physiology , Aged , Area Under Curve , Humans , Prognosis , ROC Curve
20.
PLoS One ; 16(2): e0245909, 2021.
Article in English | MEDLINE | ID: mdl-33630869

ABSTRACT

The spread of Covid-19 has resulted in worldwide health concerns. Social media is increasingly used to share news and opinions about it. A realistic assessment of the situation is necessary to utilize resources optimally and appropriately. In this research, we perform Covid-19 tweets sentiment analysis using a supervised machine learning approach. Identification of Covid-19 sentiments from tweets would allow informed decisions for better handling the current pandemic situation. The used dataset is extracted from Twitter using IDs as provided by the IEEE data port. Tweets are extracted by an in-house built crawler that uses the Tweepy library. The dataset is cleaned using the preprocessing techniques and sentiments are extracted using the TextBlob library. The contribution of this work is the performance evaluation of various machine learning classifiers using our proposed feature set. This set is formed by concatenating the bag-of-words and the term frequency-inverse document frequency. Tweets are classified as positive, neutral, or negative. Performance of classifiers is evaluated on the accuracy, precision, recall, and F1 score. For completeness, further investigation is made on the dataset using the Long Short-Term Memory (LSTM) architecture of the deep learning model. The results show that Extra Trees Classifiers outperform all other models by achieving a 0.93 accuracy score using our proposed concatenated features set. The LSTM achieves low accuracy as compared to machine learning classifiers. To demonstrate the effectiveness of our proposed feature set, the results are compared with the Vader sentiment analysis technique based on the GloVe feature extraction approach.


Subject(s)
COVID-19 , Social Media , Supervised Machine Learning , Deep Learning , Humans , Natural Language Processing , Pandemics , Public Opinion
SELECTION OF CITATIONS
SEARCH DETAIL
...