Search | VHL Regional Portal

1.

A Real-Time Embedded System for Driver Drowsiness Detection Based on Visual Analysis of the Eyes and Mouth Using Convolutional Neural Network and Mouth Aspect Ratio.

Florez, Ruben; Palomino-Quispe, Facundo; Alvarez, Ana Beatriz; Coaquira-Castillo, Roger Jesus; Herrera-Levano, Julio Cesar.

Sensors (Basel) ; 24(19)2024 Sep 27.

Article in English | MEDLINE | ID: mdl-39409301

ABSTRACT

Currently, the number of vehicles in circulation continues to increase steadily, leading to a parallel increase in vehicular accidents. Among the many causes of these accidents, human factors such as driver drowsiness play a fundamental role. In this context, one solution to address the challenge of drowsiness detection is to anticipate drowsiness by alerting drivers in a timely and effective manner. Thus, this paper presents a Convolutional Neural Network (CNN)-based approach for drowsiness detection by analyzing the eye region and Mouth Aspect Ratio (MAR) for yawning detection. As part of this approach, endpoint delineation is optimized for extraction of the region of interest (ROI) around the eyes. An NVIDIA Jetson Nano-based device and near-infrared (NIR) camera are used for real-time applications. A Driver Drowsiness Artificial Intelligence (DD-AI) architecture is proposed for the eye state detection procedure. In a performance analysis, the results of the proposed approach were compared with architectures based on InceptionV3, VGG16, and ResNet50V2. Night-Time Yawning-Microsleep-Eyeblink-Driver Distraction (NITYMED) was used for training, validation, and testing of the architectures. The proposed DD-AI network achieved an accuracy of 99.88% with the NITYMED test data, proving superior to the other networks. In the hardware implementation, tests were conducted in a real environment, resulting in 96.55% and 14 fps on average for the DD-AI network, thereby confirming its superior performance.

Subject(s)

Automobile Driving , Neural Networks, Computer , Humans , Mouth/physiology , Eye , Sleep Stages/physiology , Sleepiness , Artificial Intelligence , Accidents, Traffic

2.

Enhancing geotechnical damage detection with deep learning: a convolutional neural network approach.

de Araujo, Thabatta Moreira Alves; Teixeira, Carlos André de Mattos; Francês, Carlos Renato Lisboa.

PeerJ Comput Sci ; 10: e2052, 2024.

Article in English | MEDLINE | ID: mdl-39314724

ABSTRACT

Most natural disasters result from geodynamic events such as landslides and slope collapse. These failures cause catastrophes that directly impact the environment and cause financial and human losses. Visual inspection is the primary method for detecting failures in geotechnical structures, but on-site visits can be risky due to unstable soil. In addition, the body design and hostile and remote installation conditions make monitoring these structures inviable. When a fast and secure evaluation is required, analysis by computational methods becomes feasible. In this study, a convolutional neural network (CNN) approach to computer vision is applied to identify defects in the surface of geotechnical structures aided by unmanned aerial vehicle (UAV) and mobile devices, aiming to reduce the reliance on human-led on-site inspections. However, studies in computer vision algorithms still need to be explored in this field due to particularities of geotechnical engineering, such as limited public datasets and redundant images. Thus, this study obtained images of surface failure indicators from slopes near a Brazilian national road, assisted by UAV and mobile devices. We then proposed a custom CNN and low complexity model architecture to build a binary classifier image-aided to detect faults in geotechnical surfaces. The model achieved a satisfactory average accuracy rate of 94.26%. An AUC metric score of 0.99 from the receiver operator characteristic (ROC) curve and matrix confusion with a testing dataset show satisfactory results. The results suggest that the capability of the model to distinguish between the classes 'damage' and 'intact' is excellent. It enables the identification of failure indicators. Early failure indicator detection on the surface of slopes can facilitate proper maintenance and alarms and prevent disasters, as the integrity of the soil directly affects the structures built around and above it.

3.

Low-Cost Non-Wearable Fall Detection System Implemented on a Single Board Computer for People in Need of Care.

Vargas, Vanessa; Ramos, Pablo; Orbe, Edwin A; Zapata, Mireya; Valencia-Aragón, Kevin.

Sensors (Basel) ; 24(17)2024 Aug 29.

Article in English | MEDLINE | ID: mdl-39275503

ABSTRACT

This work aims at proposing an affordable, non-wearable system to detect falls of people in need of care. The proposal uses artificial vision based on deep learning techniques implemented on a Raspberry Pi4 4GB RAM with a High-Definition IR-CUT camera. The CNN architecture classifies detected people into five classes: fallen, crouching, sitting, standing, and lying down. When a fall is detected, the system sends an alert notification to mobile devices through the Telegram instant messaging platform. The system was evaluated considering real daily indoor activities under different conditions: outfit, lightning, and distance from camera. Results show a good trade-off between performance and cost of the system. Obtained performance metrics are: precision of 96.4%, specificity of 96.6%, accuracy of 94.8%, and sensitivity of 93.1%. Regarding privacy concerns, even though this system uses a camera, the video is not recorded or monitored by anyone, and pictures are only sent in case of fall detection. This work can contribute to reducing the fatal consequences of falls in people in need of care by providing them with prompt attention. Such a low-cost solution would be desirable, particularly in developing countries with limited or no medical alert systems and few resources.

Subject(s)

Accidental Falls , Humans , Accidental Falls/prevention & control , Deep Learning , Computers , Algorithms

4.

An Intrinsically Explainable Method to Decode P300 Waveforms from EEG Signal Plots Based on Convolutional Neural Networks.

Ail, Brian Ezequiel; Ramele, Rodrigo; Gambini, Juliana; Santos, Juan Miguel.

Brain Sci ; 14(8)2024 Aug 20.

Article in English | MEDLINE | ID: mdl-39199527

ABSTRACT

This work proposes an intrinsically explainable, straightforward method to decode P300 waveforms from electroencephalography (EEG) signals, overcoming the black box nature of deep learning techniques. The proposed method allows convolutional neural networks to decode information from images, an area where they have achieved astonishing performance. By plotting the EEG signal as an image, it can be both visually interpreted by physicians and technicians and detected by the network, offering a straightforward way of explaining the decision. The identification of this pattern is used to implement a P300-based speller device, which can serve as an alternative communication channel for persons affected by amyotrophic lateral sclerosis (ALS). This method is validated by identifying this signal by performing a brain-computer interface simulation on a public dataset from ALS patients. Letter identification rates from the speller on the dataset show that this method can identify the P300 signature on the set of 8 patients. The proposed approach achieves similar performance to other state-of-the-art proposals while providing clinically relevant explainability (XAI).

5.

Decoding lower-limb kinematic parameters during pedaling tasks using deep learning approaches and EEG.

Blanco-Diaz, Cristian Felipe; Guerrero-Mendez, Cristian David; de Andrade, Rafhael Milanezi; Badue, Claudine; De Souza, Alberto Ferreira; Delisle-Rodriguez, Denis; Bastos-Filho, Teodiano.

Med Biol Eng Comput ; 2024 Jul 19.

Article in English | MEDLINE | ID: mdl-39028484

ABSTRACT

Stroke is a neurological condition that usually results in the loss of voluntary control of body movements, making it difficult for individuals to perform activities of daily living (ADLs). Brain-computer interfaces (BCIs) integrated into robotic systems, such as motorized mini exercise bikes (MMEBs), have been demonstrated to be suitable for restoring gait-related functions. However, kinematic estimation of continuous motion in BCI systems based on electroencephalography (EEG) remains a challenge for the scientific community. This study proposes a comparative analysis to evaluate two artificial neural network (ANN)-based decoders to estimate three lower-limb kinematic parameters: x- and y-axis position of the ankle and knee joint angle during pedaling tasks. Long short-term memory (LSTM) was used as a recurrent neural network (RNN), which reached Pearson correlation coefficient (PCC) scores close to 0.58 by reconstructing kinematic parameters from the EEG features on the delta band using a time window of 250 ms. These estimates were evaluated through kinematic variance analysis, where our proposed algorithm showed promising results for identifying pedaling and rest periods, which could increase the usability of classification tasks. Additionally, negative linear correlations were found between pedaling speed and decoder performance, thereby indicating that kinematic parameters between slower speeds may be easier to estimate. The results allow concluding that the use of deep learning (DL)-based methods is feasible for the estimation of lower-limb kinematic parameters during pedaling tasks using EEG signals. This study opens new possibilities for implementing controllers most robust for MMEBs and BCIs based on continuous decoding, which may allow for maximizing the degrees of freedom and personalized rehabilitation.

6.

Innovative infrastructure to access Brazilian fungal diversity using deep learning.

Chaves, Thiago; Santos Xavier, Joicymara; Gonçalves Dos Santos, Alfeu; Martins-Cunha, Kelmer; Karstedt, Fernanda; Kossmann, Thiago; Sourell, Susanne; Leopoldo, Eloisa; Fortuna Ferreira, Miriam Nathalie; Farias, Roger; Titton, Mahatmã; Alves-Silva, Genivaldo; Bittencourt, Felipe; Bortolini, Dener; Gumboski, Emerson L; von Wangenheim, Aldo; Góes-Neto, Aristóteles; Drechsler-Santos, Elisandro Ricardo.

PeerJ ; 12: e17686, 2024.

Article in English | MEDLINE | ID: mdl-39006015

ABSTRACT

In the present investigation, we employ a novel and meticulously structured database assembled by experts, encompassing macrofungi field-collected in Brazil, featuring upwards of 13,894 photographs representing 505 distinct species. The purpose of utilizing this database is twofold: firstly, to furnish training and validation for convolutional neural networks (CNNs) with the capacity for autonomous identification of macrofungal species; secondly, to develop a sophisticated mobile application replete with an advanced user interface. This interface is specifically crafted to acquire images, and, utilizing the image recognition capabilities afforded by the trained CNN, proffer potential identifications for the macrofungal species depicted therein. Such technological advancements democratize access to the Brazilian Funga, thereby enhancing public engagement and knowledge dissemination, and also facilitating contributions from the populace to the expanding body of knowledge concerning the conservation of macrofungal species of Brazil.

Subject(s)

Deep Learning , Fungi , Brazil , Fungi/classification , Fungi/isolation & purification , Biodiversity , Neural Networks, Computer , Databases, Factual

7.

Multiclass Classification of Visual Electroencephalogram Based on Channel Selection, Minimum Norm Estimation Algorithm, and Deep Network Architectures.

Mwata-Velu, Tat'y; Zamora, Erik; Vasquez-Gomez, Juan Irving; Ruiz-Pinales, Jose; Sossa, Humberto.

Sensors (Basel) ; 24(12)2024 Jun 19.

Article in English | MEDLINE | ID: mdl-38931751

ABSTRACT

This work addresses the challenge of classifying multiclass visual EEG signals into 40 classes for brain-computer interface applications using deep learning architectures. The visual multiclass classification approach offers BCI applications a significant advantage since it allows the supervision of more than one BCI interaction, considering that each class label supervises a BCI task. However, because of the nonlinearity and nonstationarity of EEG signals, using multiclass classification based on EEG features remains a significant challenge for BCI systems. In the present work, mutual information-based discriminant channel selection and minimum-norm estimate algorithms were implemented to select discriminant channels and enhance the EEG data. Hence, deep EEGNet and convolutional recurrent neural networks were separately implemented to classify the EEG data for image visualization into 40 labels. Using the k-fold cross-validation approach, average classification accuracies of 94.8% and 89.8% were obtained by implementing the aforementioned network architectures. The satisfactory results obtained with this method offer a new implementation opportunity for multitask embedded BCI applications utilizing a reduced number of both channels (<50%) and network parameters (<110 K).

Subject(s)

Algorithms , Brain-Computer Interfaces , Deep Learning , Electroencephalography , Neural Networks, Computer , Electroencephalography/methods , Humans , Signal Processing, Computer-Assisted

8.

Parallel Ictal-Net, a Parallel CNN Architecture with Efficient Channel Attention for Seizure Detection.

Hernández-Nava, Gerardo; Salazar-Colores, Sebastián; Cabal-Yepez, Eduardo; Ramos-Arreguín, Juan-Manuel.

Sensors (Basel) ; 24(3)2024 Jan 23.

Article in English | MEDLINE | ID: mdl-38339433

ABSTRACT

Around 70 million people worldwide are affected by epilepsy, a neurological disorder characterized by non-induced seizures that occur at irregular and unpredictable intervals. During an epileptic seizure, transient symptoms emerge as a result of extreme abnormal neural activity. Epilepsy imposes limitations on individuals and has a significant impact on the lives of their families. Therefore, the development of reliable diagnostic tools for the early detection of this condition is considered beneficial to alleviate the social and emotional distress experienced by patients. While the Bonn University dataset contains five collections of EEG data, not many studies specifically focus on subsets D and E. These subsets correspond to EEG recordings from the epileptogenic zone during ictal and interictal events. In this work, the parallel ictal-net (PIN) neural network architecture is introduced, which utilizes scalograms obtained through a continuous wavelet transform to achieve the high-accuracy classification of EEG signals into ictal or interictal states. The results obtained demonstrate the effectiveness of the proposed PIN model in distinguishing between ictal and interictal events with a high degree of confidence. This is validated by the computing accuracy, precision, recall, and F1 scores, all of which consistently achieve around 99% confidence, surpassing previous approaches in the related literature.

Subject(s)

Electroencephalography , Epilepsy , Humans , Electroencephalography/methods , Seizures/diagnosis , Epilepsy/diagnosis , Neural Networks, Computer , Wavelet Analysis

9.

Spectral temporal graph neural network for multivariate agricultural price forecasting / Rede neural de gráfico temporal espectral para previsão de preços agrícolas multivariados

Özden, Cevher; Bulut, Mutlu.

Ciênc. rural (Online) ; 54(1): e20220677, 2024. ilus, graf, tab

Article in English | VETINDEX | ID: biblio-1438078

ABSTRACT

Multivariate time series forecasting has an important role in many real-world domains. Especially, price prediction has always been on the focus of researchers. Yet, it is a challenging task that requires the capturing of intra-series and inter-series correlations. Most of the models in literature focus only on the correlation in temporal domain. In this paper, we have curated a new dataset from the official website of Turkish Ministry of Commerce. The dataset consists of daily prices and trade volume of vegetables and covers 1791 days between January 1, 2018 and November 26, 2022. A Spectral Temporal Graph Neural Network (StemGNN) is employed on the curated dataset and the results are given in comparison to Convolutional neural networks (CNN), Long short-term memory (LSTM) and Random Forest models. GNN architecture achieved a state-of-the-art result such as mean absolute error (MAE): 1,37 and root mean squared error (RMSE): 1.94). To our knowledge, this is one of the few studies that investigates GNN for time series analysis and the first study in architecture field.

A previsão multivariada de séries temporais tem um papel importante em muitos domínios do mundo real. Especialmente, a previsão de preços sempre esteve no foco dos pesquisadores. No entanto, é uma tarefa desafiadora que requer a captura de correlações intra-séries e inter-séries. A maioria dos modelos na literatura foca apenas a correlação no domínio temporal. Neste artigo, selecionamos um novo conjunto de dados do site oficial do Ministério do Comércio Turco. O conjunto de dados consiste em preços diários e volume comercial de vegetais e abrange 1.791 dias entre 1º de janeiro de 2018 e 26 de novembro de 2022. Uma Rede Neural de Gráfico Temporal Espectral é empregada no conjunto de dados curado e os resultados são fornecidos em comparação com CNN, LSTM e Modelos de Floresta Aleatória. A arquitetura GNN alcançou um resultado de ponta (MAE: 1,37, RMSE: 1,94). Até onde sabemos, este é um dos poucos estudos que investiga GNN para análise de séries temporais e o primeiro estudo na área de arquitetura.

Subject(s)

Time Factors , Commerce , Agriculture/economics

10.

Real-Time Person Detection in Wooded Areas Using Thermal Images from an Aerial Perspective.

Ramírez-Ayala, Oscar; González-Hernández, Iván; Salazar, Sergio; Flores, Jonathan; Lozano, Rogelio.

Sensors (Basel) ; 23(22)2023 Nov 16.

Article in English | MEDLINE | ID: mdl-38005600

ABSTRACT

Detecting people in images and videos captured from an aerial platform in wooded areas for search and rescue operations is a current problem. Detection is difficult due to the relatively small dimensions of the person captured by the sensor in relation to the environment. The environment can generate occlusion, complicating the timely detection of people. There are currently numerous RGB image datasets available that are used for person detection tasks in urban and wooded areas and consider the general characteristics of a person, like size, shape, and height, without considering the occlusion of the object of interest. The present research work focuses on developing a thermal image dataset, which considers the occlusion situation to develop CNN convolutional deep learning models to perform detection tasks in real-time from an aerial perspective using altitude control in a quadcopter prototype. Extended models are proposed considering the occlusion of the person, in conjunction with a thermal sensor, which allows for highlighting the desired characteristics of the occluded person.

11.

Detection of Pedestrians in Reverse Camera Using Multimodal Convolutional Neural Networks.

Reveles-Gómez, Luis C; Luna-García, Huizilopoztli; Celaya-Padilla, José M; Barría-Huidobro, Cristian; Gamboa-Rosales, Hamurabi; Solís-Robles, Roberto; Arceo-Olague, José G; Galván-Tejada, Jorge I; Galván-Tejada, Carlos E; Rondon, David; Villalba-Condori, Klinge O.

Sensors (Basel) ; 23(17)2023 Aug 31.

Article in English | MEDLINE | ID: mdl-37688015

ABSTRACT

In recent years, the application of artificial intelligence (AI) in the automotive industry has led to the development of intelligent systems focused on road safety, aiming to improve protection for drivers and pedestrians worldwide to reduce the number of accidents yearly. One of the most critical functions of these systems is pedestrian detection, as it is crucial for the safety of everyone involved in road traffic. However, pedestrian detection goes beyond the front of the vehicle; it is also essential to consider the vehicle's rear since pedestrian collisions occur when the car is in reverse drive. To contribute to the solution of this problem, this research proposes a model based on convolutional neural networks (CNN) using a proposed one-dimensional architecture and the Inception V3 architecture to fuse the information from the backup camera and the distance measured by the ultrasonic sensors, to detect pedestrians when the vehicle is reversing. In addition, specific data collection was performed to build a database for the research. The proposed model showed outstanding results with 99.85% accuracy and 99.86% correct classification performance, demonstrating that it is possible to achieve the goal of pedestrian detection using CNN by fusing two types of data.

12.

Automatic COVID-19 and Common-Acquired Pneumonia Diagnosis Using Chest CT Scans.

Motta, Pedro Crosara; Cortez, Paulo César; Silva, Bruno R S; Yang, Guang; Albuquerque, Victor Hugo C de.

Bioengineering (Basel) ; 10(5)2023 Apr 26.

Article in English | MEDLINE | ID: mdl-37237599

ABSTRACT

Even with over 80% of the population being vaccinated against COVID-19, the disease continues to claim victims. Therefore, it is crucial to have a secure Computer-Aided Diagnostic system that can assist in identifying COVID-19 and determining the necessary level of care. This is especially important in the Intensive Care Unit to monitor disease progression or regression in the fight against this epidemic. To accomplish this, we merged public datasets from the literature to train lung and lesion segmentation models with five different distributions. We then trained eight CNN models for COVID-19 and Common-Acquired Pneumonia classification. If the examination was classified as COVID-19, we quantified the lesions and assessed the severity of the full CT scan. To validate the system, we used Resnetxt101 Unet++ and Mobilenet Unet for lung and lesion segmentation, respectively, achieving accuracy of 98.05%, F1-score of 98.70%, precision of 98.7%, recall of 98.7%, and specificity of 96.05%. This was accomplished in just 19.70 s per full CT scan, with external validation on the SPGC dataset. Finally, when classifying these detected lesions, we used Densenet201 and achieved accuracy of 90.47%, F1-score of 93.85%, precision of 88.42%, recall of 100.0%, and specificity of 65.07%. The results demonstrate that our pipeline can correctly detect and segment lesions due to COVID-19 and Common-Acquired Pneumonia in CT scans. It can differentiate these two classes from normal exams, indicating that our system is efficient and effective in identifying the disease and assessing the severity of the condition.

13.

The Public Health Contribution of Sentiment Analysis of Monkeypox Tweets to Detect Polarities Using the CNN-LSTM Model.

Iparraguirre-Villanueva, Orlando; Alvarez-Risco, Aldo; Herrera Salazar, Jose Luis; Beltozar-Clemente, Saul; Zapata-Paulini, Joselyn; Yáñez, Jaime A; Cabanillas-Carbonell, Michael.

Vaccines (Basel) ; 11(2)2023 Jan 31.

Article in English | MEDLINE | ID: mdl-36851190

ABSTRACT

Monkeypox is a rare disease caused by the monkeypox virus. This disease was considered eradicated in 1980 and was believed to affect rodents and not humans. However, recent years have seen a massive outbreak of monkeypox in humans, setting off worldwide alerts from health agencies. As of September 2022, the number of confirmed cases in Peru had reached 1964. Although most monkeypox patients have been discharged, we cannot neglect the monitoring of the population with respect to the monkeypox virus. Lately, the population has started to express their feelings and opinions through social media, specifically Twitter, as it is the most used social medium and is an ideal space to gather what people think about the monkeypox virus. The information imparted through this medium can be in different formats, such as text, videos, images, audio, etc. The objective of this work is to analyze the positive, negative, and neutral feelings of people who publish their opinions on Twitter with the hashtag #Monkeypox. To find out what people think about this disease, a hybrid-based model architecture built on CNN and LSTM was used to determine the prediction accuracy. The prediction result obtained from the total monkeypox data was 83% accurate. Other performance metrics were also used to evaluate the model, such as specificity, recall level, and F1 score, representing 99%, 85%, and 88%, respectively. The results also showed the polarity of feelings through the CNN-LSTM confusion matrix, where 45.42% of people expressed neither positive nor negative opinions, while 19.45% expressed negative and fearful feelings about this infectious disease. The results of this work contribute to raising public awareness about the monkeypox virus.

14.

Deep Learning Multi-Class Approach for Human Fall Detection Based on Doppler Signatures.

Cardenas, Jorge D; Gutierrez, Carlos A; Aguilar-Ponce, Ruth.

Int J Environ Res Public Health ; 20(2)2023 01 08.

Article in English | MEDLINE | ID: mdl-36673883

ABSTRACT

Falling events are a global health concern with short- and long-term physical and psychological implications, especially for the elderly population. This work aims to monitor human activity in an indoor environment and recognize falling events without requiring users to carry a device or sensor on their bodies. A sensing platform based on the transmission of a continuous wave (CW) radio-frequency (RF) probe signal was developed using general-purpose equipment. The CW probe signal is similar to the pilot subcarriers transmitted by commercial off-the-shelf WiFi devices. As a result, our methodology can easily be integrated into a joint radio sensing and communication scheme. The sensing process is carried out by analyzing the changes in phase, amplitude, and frequency that the probe signal suffers when it is reflected or scattered by static and moving bodies. These features are commonly extracted from the channel state information (CSI) of WiFi signals. However, CSI relies on complex data acquisition and channel estimation processes. Doppler radars have also been used to monitor human activity. While effective, a radar-based fall detection system requires dedicated hardware. In this paper, we follow an alternative method to characterize falling events on the basis of the Doppler signatures imprinted on the CW probe signal by a falling person. A multi-class deep learning framework for classification was conceived to differentiate falling events from other activities that can be performed in indoor environments. Two neural network models were implemented. The first is based on a long-short-term memory network (LSTM) and the second on a convolutional neural network (CNN). A series of experiments comprising 11 subjects were conducted to collect empirical data and test the system's performance. Falls were detected with an accuracy of 92.1% for the LSTM case, while for the CNN, an accuracy rate of 92.1% was obtained. The results demonstrate the viability of human fall detection based on a radio sensing system such as the one described in this paper.

Subject(s)

Deep Learning , Humans , Aged , Neural Networks, Computer , Radar , Human Activities

15.

Artificial intelligence based glaucoma and diabetic retinopathy detection using MATLAB - retrained AlexNet convolutional neural network.

Arias-Serrano, Isaac; Velásquez-López, Paolo A; Avila-Briones, Laura N; Laurido-Mora, Fanny C; Villalba-Meneses, Fernando; Tirado-Espin, Andrés; Cruz-Varela, Jonathan; Almeida-Galárraga, Diego.

F1000Res ; 12: 14, 2023.

Article in English | MEDLINE | ID: mdl-38826575

ABSTRACT

Background: Glaucoma and diabetic retinopathy (DR) are the leading causes of irreversible retinal damage leading to blindness. Early detection of these diseases through regular screening is especially important to prevent progression. Retinal fundus imaging serves as the principal method for diagnosing glaucoma and DR. Consequently, automated detection of eye diseases represents a significant application of retinal image analysis. Compared with classical diagnostic techniques, image classification by convolutional neural networks (CNN) exhibits potential for effective eye disease detection. Methods: This paper proposes the use of MATLAB - retrained AlexNet CNN for computerized eye diseases identification, particularly glaucoma and diabetic retinopathy, by employing retinal fundus images. The acquisition of the database was carried out through free access databases and access upon request. A transfer learning technique was employed to retrain the AlexNet CNN for non-disease (Non_D), glaucoma (Sus_G) and diabetic retinopathy (Sus_R) classification. Moreover, model benchmarking was conducted using ResNet50 and GoogLeNet architectures. A Grad-CAM analysis is also incorporated for each eye condition examined. Results: Metrics for validation accuracy, false positives, false negatives, precision, and recall were reported. Validation accuracies for the NetTransfer (I-V) and netAlexNet ranged from 89.7% to 94.3%, demonstrating varied effectiveness in identifying Non_D, Sus_G, and Sus_R categories, with netAlexNet achieving a 93.2% accuracy in the benchmarking of models against netResNet50 at 93.8% and netGoogLeNet at 90.4%. Conclusions: This study demonstrates the efficacy of using a MATLAB-retrained AlexNet CNN for detecting glaucoma and diabetic retinopathy. It emphasizes the need for automated early detection tools, proposing CNNs as accessible solutions without replacing existing technologies.

Subject(s)

Diabetic Retinopathy , Glaucoma , Neural Networks, Computer , Humans , Diabetic Retinopathy/diagnostic imaging , Diabetic Retinopathy/diagnosis , Glaucoma/diagnosis , Glaucoma/diagnostic imaging , Artificial Intelligence

16.

Performance Evaluation of Different Object Detection Models for the Segmentation of Optical Cups and Discs.

Alfonso-Francia, Gendry; Pedraza-Ortega, Jesus Carlos; Badillo-Fernández, Mariana; Toledano-Ayala, Manuel; Aceves-Fernandez, Marco Antonio; Rodriguez-Resendiz, Juvenal; Ko, Seok-Bum; Tovar-Arriaga, Saul.

Diagnostics (Basel) ; 12(12)2022 Dec 02.

Article in English | MEDLINE | ID: mdl-36553037

ABSTRACT

Glaucoma is an eye disease that gradually deteriorates vision. Much research focuses on extracting information from the optic disc and optic cup, the structure used for measuring the cup-to-disc ratio. These structures are commonly segmented with deeplearning techniques, primarily using Encoder-Decoder models, which are hard to train and time-consuming. Object detection models using convolutional neural networks can extract features from fundus retinal images with good precision. However, the superiority of one model over another for a specific task is still being determined. The main goal of our approach is to compare object detection model performance to automate segment cups and discs on fundus images. This study brings the novelty of seeing the behavior of different object detection models in the detection and segmentation of the disc and the optical cup (Mask R-CNN, MS R-CNN, CARAFE, Cascade Mask R-CNN, GCNet, SOLO, Point_Rend), evaluated on Retinal Fundus Images for Glaucoma Analysis (REFUGE), and G1020 datasets. Reported metrics were Average Precision (AP), F1-score, IoU, and AUCPR. Several models achieved the highest AP with a perfect 1.000 when the threshold for IoU was set up at 0.50 on REFUGE, and the lowest was Cascade Mask R-CNN with an AP of 0.997. On the G1020 dataset, the best model was Point_Rend with an AP of 0.956, and the worst was SOLO with 0.906. It was concluded that the methods reviewed achieved excellent performance with high precision and recall values, showing efficiency and effectiveness. The problem of how many images are needed was addressed with an initial value of 100, with excellent results. Data augmentation, multi-scale handling, and anchor box size brought improvements. The capability to translate knowledge from one database to another shows promising results too.

17.

Detection and Classification of Knee Osteoarthritis.

Cueva, Joseph Humberto; Castillo, Darwin; Espinós-Morató, Héctor; Durán, David; Díaz, Patricia; Lakshminarayanan, Vasudevan.

Diagnostics (Basel) ; 12(10)2022 Sep 29.

Article in English | MEDLINE | ID: mdl-36292051

ABSTRACT

Osteoarthritis (OA) affects nearly 240 million people worldwide. Knee OA is the most common type of arthritis, especially in older adults. Physicians measure the severity of knee OA according to the Kellgren and Lawrence (KL) scale through visual inspection of X-ray or MR images. We propose a semi-automatic CADx model based on Deep Siamese convolutional neural networks and a fine-tuned ResNet-34 to simultaneously detect OA lesions in the two knees according to the KL scale. The training was done using a public dataset, whereas the validations were performed with a private dataset. Some problems of the imbalanced dataset were solved using transfer learning. The model results average of the multi-class accuracy is 61%, presenting better performance results for classifying classes KL-0, KL-3, and KL-4 than KL-1 and KL-2. The classification results were compared and validated using the classification of experienced radiologists.

18.

Deep Learning Algorithms for Diagnosis of Lung Cancer: A Systematic Review and Meta-Analysis.

Forte, Gabriele C; Altmayer, Stephan; Silva, Ricardo F; Stefani, Mariana T; Libermann, Lucas L; Cavion, Cesar C; Youssef, Ali; Forghani, Reza; King, Jeremy; Mohamed, Tan-Lucien; Andrade, Rubens G F; Hochhegger, Bruno.

Cancers (Basel) ; 14(16)2022 Aug 09.

Article in English | MEDLINE | ID: mdl-36010850

ABSTRACT

We conducted a systematic review and meta-analysis of the diagnostic performance of current deep learning algorithms for the diagnosis of lung cancer. We searched major databases up to June 2022 to include studies that used artificial intelligence to diagnose lung cancer, using the histopathological analysis of true positive cases as a reference. The quality of the included studies was assessed independently by two authors based on the revised Quality Assessment of Diagnostic Accuracy Studies. Six studies were included in the analysis. The pooled sensitivity and specificity were 0.93 (95% CI 0.85−0.98) and 0.68 (95% CI 0.49−0.84), respectively. Despite the significantly high heterogeneity for sensitivity (I2 = 94%, p < 0.01) and specificity (I2 = 99%, p < 0.01), most of it was attributed to the threshold effect. The pooled SROC curve with a bivariate approach yielded an area under the curve (AUC) of 0.90 (95% CI 0.86 to 0.92). The DOR for the studies was 26.7 (95% CI 19.7−36.2) and heterogeneity was 3% (p = 0.40). In this systematic review and meta-analysis, we found that when using the summary point from the SROC, the pooled sensitivity and specificity of DL algorithms for the diagnosis of lung cancer were 93% and 68%, respectively.

19.

Convolutional Neural Network Applied to SARS-CoV-2 Sequence Classification.

Câmara, Gabriel B M; Coutinho, Maria G F; Silva, Lucileide M D da; Gadelha, Walter V do N; Torquato, Matheus F; Barbosa, Raquel de M; Fernandes, Marcelo A C.

Sensors (Basel) ; 22(15)2022 Jul 31.

Article in English | MEDLINE | ID: mdl-35957287

ABSTRACT

COVID-19, the illness caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus belonging to the Coronaviridade family, a single-strand positive-sense RNA genome, has been spreading around the world and has been declared a pandemic by the World Health Organization. On 17 January 2022, there were more than 329 million cases, with more than 5.5 million deaths. Although COVID-19 has a low mortality rate, its high capacities for contamination, spread, and mutation worry the authorities, especially after the emergence of the Omicron variant, which has a high transmission capacity and can more easily contaminate even vaccinated people. Such outbreaks require elucidation of the taxonomic classification and origin of the virus (SARS-CoV-2) from the genomic sequence for strategic planning, containment, and treatment of the disease. Thus, this work proposes a high-accuracy technique to classify viruses and other organisms from a genome sequence using a deep learning convolutional neural network (CNN). Unlike the other literature, the proposed approach does not limit the length of the genome sequence. The results show that the novel proposal accurately distinguishes SARS-CoV-2 from the sequences of other viruses. The results were obtained from 1557 instances of SARS-CoV-2 from the National Center for Biotechnology Information (NCBI) and 14,684 different viruses from the Virus-Host DB. As a CNN has several changeable parameters, the tests were performed with forty-eight different architectures; the best of these had an accuracy of 91.94 ± 2.62% in classifying viruses into their realms correctly, in addition to 100% accuracy in classifying SARS-CoV-2 into its respective realm, Riboviria. For the subsequent classifications (family, genera, and subgenus), this accuracy increased, which shows that the proposed architecture may be viable in the classification of the virus that causes COVID-19.

Subject(s)

COVID-19 , SARS-CoV-2 , Humans , Neural Networks, Computer , Pandemics , SARS-CoV-2/genetics

20.

The convolutional neural network as a tool to classify electroencephalography data resulting from the consumption of juice sweetened with caloric or non-caloric sweeteners.

von Atzingen, Gustavo Voltani; Arteaga, Hubert; da Silva, Amanda Rodrigues; Ortega, Nathalia Fontanari; Costa, Ernane Jose Xavier; Silva, Ana Carolina de Sousa.

Front Nutr ; 9: 901333, 2022.

Article in English | MEDLINE | ID: mdl-35928831

ABSTRACT

Sweetener type can influence sensory properties and consumer's acceptance and preference for low-calorie products. An ideal sweetener does not exist, and each sweetener must be used in situations to which it is best suited. Aspartame and sucralose can be good substitutes for sucrose in passion fruit juice. Despite the interest in artificial sweeteners, little is known about how artificial sweeteners are processed in the human brain. Here, we applied the convolutional neural network (CNN) to evaluate brain signals of 11 healthy subjects when they tasted passion fruit juice equivalently sweetened with sucrose (9.4 g/100 g), sucralose (0.01593 g/100 g), or aspartame (0.05477 g/100 g). Electroencephalograms were recorded for two sites in the gustatory cortex (i.e., C3 and C4). Data with artifacts were disregarded, and the artifact-free data were used to feed a Deep Neural Network with tree branches that applied a Convolutions and pooling for different feature filtering and selection. The CNN received raw signal as input for multiclass classification and with supervised training was able to extract underling features and patterns from the signal with better performance than handcrafted filters like FFT. Our results indicated that CNN is an useful tool for electroencephalography (EEG) analyses and classification of perceptually similar tastes.

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL