RESUMO
Hope is a vital coping mechanism, enabling individuals to effectively confront life's challenges. This study proposes a technique employing Natural Language Processing (NLP) tools like Linguistic Inquiry and Word Count (LIWC), NRC-emotion-lexicon, and vaderSentiment to analyze social media posts, extracting psycholinguistic, emotional, and sentimental features from a hope speech dataset. The findings of this study reveal distinct cognitive, emotional, and communicative characteristics and psycholinguistic dimensions, emotions, and sentiments associated with different types of hope shared in social media. Furthermore, the study investigates the potential of leveraging this data to classify different types of hope using machine learning algorithms. Notably, models such as LightGBM and CatBoost demonstrate impressive performance, surpassing traditional methods and competing effectively with deep learning techniques. We employed hyperparameter tuning to optimize the models' parameters and compared their performance using both default and tuned settings. The results highlight the enhanced efficiency achieved through hyperparameter tuning for these models.
Assuntos
Emoções , Processamento de Linguagem Natural , Psicolinguística , Mídias Sociais , Fala , Humanos , Emoções/fisiologia , Psicolinguística/métodos , Esperança , Aprendizado de Máquina , Algoritmos , Aprendizado ProfundoRESUMO
Protamines play a critical role in DNA compaction and stabilization in sperm cells, significantly influencing male fertility and various biotechnological applications. Traditionally, identifying these proteins is a challenging and time-consuming process due to their species-specific variability and complexity. Leveraging advancements in computational biology, we present PROTA, a novel tool that combines machine learning (ML) and deep learning (DL) techniques to predict protamines with high accuracy. For the first time, we integrate Generative Adversarial Networks (GANs) with supervised learning methods to enhance the accuracy and generalizability of protamine prediction. Our methodology evaluated multiple ML models, including Light Gradient-Boosting Machine (LIGHTGBM), Multilayer Perceptron (MLP), Random Forest (RF), eXtreme Gradient Boosting (XGBOOST), k-Nearest Neighbors (KNN), Logistic Regression (LR), Naive Bayes (NB), and Radial Basis Function-Support Vector Machine (RBF-SVM). During ten-fold cross-validation on our training dataset, the MLP model with GAN-augmented data demonstrated superior performance metrics: 0.997 accuracy, 0.997 F1 score, 0.998 precision, 0.997 sensitivity, and 1.0 AUC. In the independent testing phase, this model achieved 0.999 accuracy, 0.999 F1 score, 1.0 precision, 0.999 sensitivity, and 1.0 AUC. These results establish PROTA, accessible via a user-friendly web application. We anticipate that PROTA will be a crucial resource for researchers, enabling the rapid and reliable prediction of protamines, thereby advancing our understanding of their roles in reproductive biology, biotechnology, and medicine.
Assuntos
Aprendizado Profundo , Aprendizado de Máquina , Protaminas , Protaminas/metabolismo , Biologia Computacional/métodos , Máquina de Vetores de Suporte , Humanos , SoftwareRESUMO
Deep learning methods, trained on the increasing set of available protein 3D structures and sequences, have substantially impacted the protein modeling and design field. These advancements have facilitated the creation of novel proteins, or the optimization of existing ones designed for specific functions, such as binding a target protein. Despite the demonstrated potential of such approaches in designing general protein binders, their application in designing immunotherapeutics remains relatively underexplored. A relevant application is the design of T cell receptors (TCRs). Given the crucial role of T cells in mediating immune responses, redirecting these cells to tumor or infected target cells through the engineering of TCRs has shown promising results in treating diseases, especially cancer. However, the computational design of TCR interactions presents challenges for current physics-based methods, particularly due to the unique natural characteristics of these interfaces, such as low affinity and cross-reactivity. For this reason, in this study, we explored the potential of two structure-based deep learning protein design methods, ProteinMPNN and ESM-IF1, in designing fixed-backbone TCRs for binding target antigenic peptides presented by the MHC through different design scenarios. To evaluate TCR designs, we employed a comprehensive set of sequence- and structure-based metrics, highlighting the benefits of these methods in comparison to classical physics-based design methods and identifying deficiencies for improvement.
Assuntos
Biologia Computacional , Aprendizado Profundo , Receptores de Antígenos de Linfócitos T , Receptores de Antígenos de Linfócitos T/imunologia , Receptores de Antígenos de Linfócitos T/química , Receptores de Antígenos de Linfócitos T/metabolismo , Biologia Computacional/métodos , Humanos , Engenharia de Proteínas/métodos , Modelos Moleculares , Conformação Proteica , Ligação ProteicaRESUMO
Molecular dynamics (MD) simulations produce a substantial volume of high-dimensional data, and traditional methods for analyzing these data pose significant computational demands. Advances in MD simulation analysis combined with deep learning-based approaches have led to the understanding of specific structural changes observed in MD trajectories, including those induced by mutations. In this study, we model the trajectories resulting from MD simulations of the SARS-CoV-2 spike protein-ACE2, specifically the receptor-binding domain (RBD), as interresidue distance maps, and use deep convolutional neural networks to predict the functional impact of point mutations, related to the virus's infectivity and immunogenicity. Our model was successful in predicting mutant types that increase the affinity of the S protein for human receptors and reduce its immunogenicity, both based on MD trajectories (precision = 0.718; recall = 0.800; [Formula: see text] = 0.757; MCC = 0.488; AUC = 0.800) and their centroids. In an additional analysis, we also obtained a strong positive Pearson's correlation coefficient equal to 0.776, indicating a significant relationship between the average sigmoid probability for the MD trajectories and binding free energy (BFE) changes. Furthermore, we obtained a coefficient of determination of 0.602. Our 2D-RMSD analysis also corroborated predictions for more infectious and immune-evading mutants and revealed fluctuating regions within the receptor-binding motif (RBM), especially in the [Formula: see text] loop. This region presented a significant standard deviation for mutations that enable SARS-CoV-2 to evade the immune response, with RMSD values of 5Å in the simulation. This methodology offers an efficient alternative to identify potential strains of SARS-CoV-2, which may be potentially linked to more infectious and immune-evading mutations. Using clustering and deep learning techniques, our approach leverages information from the ensemble of MD trajectories to recognize a broad spectrum of multiple conformational patterns characteristic of mutant types. This represents a strategic advantage in identifying emerging variants, bypassing the need for long MD simulations. Furthermore, the present work tends to contribute substantially to the field of computational biology and virology, particularly to accelerate the design and optimization of new therapeutic agents and vaccines, offering a proactive stance against the constantly evolving threat of COVID-19 and potential future pandemics.
Assuntos
Enzima de Conversão de Angiotensina 2 , Aprendizado Profundo , Simulação de Dinâmica Molecular , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus , Glicoproteína da Espícula de Coronavírus/química , Glicoproteína da Espícula de Coronavírus/genética , Glicoproteína da Espícula de Coronavírus/metabolismo , Humanos , SARS-CoV-2/genética , SARS-CoV-2/química , SARS-CoV-2/metabolismo , Enzima de Conversão de Angiotensina 2/química , Enzima de Conversão de Angiotensina 2/metabolismo , COVID-19/virologia , Ligação Proteica , Conformação Proteica , Mutação , Sítios de Ligação , Domínios ProteicosRESUMO
Over the last ten years, social media has become a crucial data source for businesses and researchers, providing a space where people can express their opinions and emotions. To analyze this data and classify emotions and their polarity in texts, natural language processing (NLP) techniques such as emotion analysis (EA) and sentiment analysis (SA) are employed. However, the effectiveness of these tasks using machine learning (ML) and deep learning (DL) methods depends on large labeled datasets, which are scarce in languages like Spanish. To address this challenge, researchers use data augmentation (DA) techniques to artificially expand small datasets. This study aims to investigate whether DA techniques can improve classification results using ML and DL algorithms for sentiment and emotion analysis of Spanish texts. Various text manipulation techniques were applied, including transformations, paraphrasing (back-translation), and text generation using generative adversarial networks, to small datasets such as song lyrics, social media comments, headlines from national newspapers in Chile, and survey responses from higher education students. The findings show that the Convolutional Neural Network (CNN) classifier achieved the most significant improvement, with an 18% increase using the Generative Adversarial Networks for Sentiment Text (SentiGan) on the Aggressiveness (Seriousness) dataset. Additionally, the same classifier model showed an 11% improvement using the Easy Data Augmentation (EDA) on the Gender-Based Violence dataset. The performance of the Bidirectional Encoder Representations from Transformers (BETO) also improved by 10% on the back-translation augmented version of the October 18 dataset, and by 4% on the EDA augmented version of the Teaching survey dataset. These results suggest that data augmentation techniques enhance performance by transforming text and adapting it to the specific characteristics of the dataset. Through experimentation with various augmentation techniques, this research provides valuable insights into the analysis of subjectivity in Spanish texts and offers guidance for selecting algorithms and techniques based on dataset features.
Assuntos
Emoções , Processamento de Linguagem Natural , Mídias Sociais , Humanos , Algoritmos , Redes Neurais de Computação , Aprendizado de Máquina , Idioma , Aprendizado ProfundoRESUMO
Smart indoor tourist attractions, such as smart museums and aquariums, require a significant investment in indoor localization devices. The use of Global Positioning Systems on smartphones is unsuitable for scenarios where dense materials such as concrete and metal blocks weaken GPS signals, which is most often the case in indoor tourist attractions. With the help of deep learning, indoor localization can be done region by region using smartphone images. This approach requires no investment in infrastructure and reduces the cost and time needed to turn museums and aquariums into smart museums or smart aquariums. In this paper, we propose using deep learning algorithms to classify locations based on smartphone camera images for indoor tourist attractions. We evaluate our proposal in a real-world scenario in Brazil. We extensively collect images from ten different smartphones to classify biome-themed fish tanks in the Pantanal Biopark, creating a new dataset of 3654 images. We tested seven state-of-the-art neural networks, three of them based on transformers. On average, we achieved a precision of about 90% and a recall and f-score of about 89%. The results show that the proposal is suitable for most indoor tourist attractions.
Assuntos
Aprendizado Profundo , Smartphone , Turismo , Humanos , Algoritmos , Processamento de Imagem Assistida por Computador/métodos , Redes Neurais de Computação , Sistemas de Informação Geográfica , BrasilRESUMO
OBJECTIVE: The primary objective was to assess the diagnostic accuracy of a deep learning-based artificial intelligence model for the detection of acute appendicular fractures in pediatric patients presenting with a recent history of trauma to the emergency department. The secondary goal was to examine the effect of assistive support on the emergency doctor's ability to detect fractures. METHODS: The dataset was 5,150 radiographs of which 850 showed fractures, while 4,300 radiographs did not show any fractures. The process utilized 4,532 (88%) radiographs, inclusive of both fractured and non-fractured radiographs, in the training phase. Subsequently, 412 (8%) radiographs were appraised during validation, and 206 (4%) were set apart for the testing phase. With and without artificial intelligence assistance, the emergency doctor reviewed another set of 2,000 radiographs (400 fractures and 600 non-fractures each) for labeling in the second test. RESULTS: The artificial intelligence model showed a mean average precision 50 of 89%, a specificity of 92%, a sensitivity of 90%, and an F1 score of 90%. The confusion matrix revealed that the model trained with artificial intelligence achieved accuracies of 93 and 95% in detecting fractures, respectively. Artificial intelligence assistance improved the reading sensitivity from 93.7% (without assistance) to 97.0% (with assistance) and the reading accuracy from 88% (without assistance) to 94.9% (with assistance). CONCLUSION: A deep learning-based artificial intelligence model has proven to be highly effective in detecting fractures in pediatric patients, enhancing the diagnostic capabilities of emergency doctors through assistive support.
Assuntos
Inteligência Artificial , Fraturas Ósseas , Humanos , Fraturas Ósseas/diagnóstico por imagem , Criança , Pré-Escolar , Sensibilidade e Especificidade , Feminino , Aprendizado Profundo , Serviço Hospitalar de Emergência , Masculino , Reprodutibilidade dos Testes , Radiografia/métodos , Adolescente , LactenteRESUMO
Stroke, the second leading cause of mortality globally, predominantly results from ischemic conditions. Immediate attention and diagnosis, related to the characterization of brain lesions, play a crucial role in patient prognosis. Standard stroke protocols include an initial evaluation from a non-contrast CT to discriminate between hemorrhage and ischemia. However, non-contrast CTs lack sensitivity in detecting subtle ischemic changes in this phase. Alternatively, diffusion-weighted MRI studies provide enhanced capabilities, yet are constrained by limited availability and higher costs. Hence, we idealize new approaches that integrate ADC stroke lesion findings into CT, to enhance the analysis and accelerate stroke patient management. This study details a public challenge where scientists applied top computational strategies to delineate stroke lesions on CT scans, utilizing paired ADC information. Also, it constitutes the first effort to build a paired dataset with NCCT and ADC studies of acute ischemic stroke patients. Submitted algorithms were validated with respect to the references of two expert radiologists. The best achieved Dice score was 0.2 over a test study with 36 patient studies. Despite all the teams employing specialized deep learning tools, results reveal limitations of computational approaches to support the segmentation of small lesions with heterogeneous density.
Assuntos
AVC Isquêmico , Tomografia Computadorizada por Raios X , Humanos , AVC Isquêmico/diagnóstico por imagem , Tomografia Computadorizada por Raios X/métodos , Imageamento por Ressonância Magnética/métodos , Algoritmos , Imagem de Difusão por Ressonância Magnética/métodos , Isquemia Encefálica/diagnóstico por imagem , Masculino , Feminino , Idoso , Processamento de Imagem Assistida por Computador/métodos , Aprendizado Profundo , Acidente Vascular Cerebral/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Encéfalo/patologiaRESUMO
The use of artificial intelligence algorithms (AI) has gained importance for dental applications in recent years. Analyzing AI information from different sensor data such as images or panoramic radiographs (panoramic X-rays) can help to improve medical decisions and achieve early diagnosis of different dental pathologies. In particular, the use of deep learning (DL) techniques based on convolutional neural networks (CNNs) has obtained promising results in dental applications based on images, in which approaches based on classification, detection, and segmentation are being studied with growing interest. However, there are still several challenges to be tackled, such as the data quality and quantity, the variability among categories, and the analysis of the possible bias and variance associated with each dataset distribution. This study aims to compare the performance of three deep learning object detection models-Faster R-CNN, YOLO V2, and SSD-using different ResNet architectures (ResNet-18, ResNet-50, and ResNet-101) as feature extractors for detecting and classifying third molar angles in panoramic X-rays according to Winter's classification criterion. Each object detection architecture was trained, calibrated, validated, and tested with three different feature extraction CNNs which are ResNet-18, ResNet-50, and ResNet-101, which were the networks that best fit our dataset distribution. Based on such detection networks, we detect four different categories of angles in third molars using panoramic X-rays by using Winter's classification criterion. This criterion characterizes the third molar's position relative to the second molar's longitudinal axis. The detected categories for the third molars are distoangular, vertical, mesioangular, and horizontal. For training, we used a total of 644 panoramic X-rays. The results obtained in the testing dataset reached up to 99% mean average accuracy performance, demonstrating the YOLOV2 obtained higher effectiveness in solving the third molar angle detection problem. These results demonstrate that the use of CNNs for object detection in panoramic radiographs represents a promising solution in dental applications.
Assuntos
Aprendizado Profundo , Dente Serotino , Redes Neurais de Computação , Radiografia Panorâmica , Radiografia Panorâmica/métodos , Humanos , Dente Serotino/diagnóstico por imagem , Algoritmos , Inteligência Artificial , Processamento de Imagem Assistida por Computador/métodosRESUMO
The massive arrival of pelagic Sargassum on the coasts of several countries of the Atlantic Ocean began in 2011 and to date continues to generate social and environmental challenges for the region. Therefore, knowing the distribution and quantity of Sargassum in the ocean, coasts, and beaches is necessary to understand the phenomenon and develop protocols for its management, use, and final disposal. In this context, the present study proposes a methodology to calculate the area Sargassum occupies on beaches in square meters, based on the semantic segmentation of aerial images using the pix2pix architecture. For training and testing the algorithm, a unique dataset was built from scratch, consisting of 15,268 aerial images segmented into three classes. The images correspond to beaches in the cities of Mahahual and Puerto Morelos, located in Quintana Roo, Mexico. To analyze the results the fß-score metric was used. The results for the Sargassum class indicate that there is a balance between false positives and false negatives, with a slight bias towards false negatives, which means that the algorithm tends to underestimate the Sargassum pixels in the images. To know the confidence intervals within which the algorithm performs better, the results of the f0.5-score metric were resampled by bootstrapping considering all classes and considering only the Sargassum class. From the above, we found that the algorithm offers better performance when segmenting Sargassum images on the sand. From the results, maps showing the Sargassum coverage area along the beach were designed to complement the previous ones and provide insight into the field of study.
Assuntos
Aprendizado Profundo , Sargassum , México , Algoritmos , Monitoramento Ambiental/métodos , Oceano Atlântico , Humanos , Imagens de Satélites , Conservação dos Recursos Naturais/métodos , PraiasRESUMO
This work aims at proposing an affordable, non-wearable system to detect falls of people in need of care. The proposal uses artificial vision based on deep learning techniques implemented on a Raspberry Pi4 4GB RAM with a High-Definition IR-CUT camera. The CNN architecture classifies detected people into five classes: fallen, crouching, sitting, standing, and lying down. When a fall is detected, the system sends an alert notification to mobile devices through the Telegram instant messaging platform. The system was evaluated considering real daily indoor activities under different conditions: outfit, lightning, and distance from camera. Results show a good trade-off between performance and cost of the system. Obtained performance metrics are: precision of 96.4%, specificity of 96.6%, accuracy of 94.8%, and sensitivity of 93.1%. Regarding privacy concerns, even though this system uses a camera, the video is not recorded or monitored by anyone, and pictures are only sent in case of fall detection. This work can contribute to reducing the fatal consequences of falls in people in need of care by providing them with prompt attention. Such a low-cost solution would be desirable, particularly in developing countries with limited or no medical alert systems and few resources.
Assuntos
Acidentes por Quedas , Humanos , Acidentes por Quedas/prevenção & controle , Aprendizado Profundo , Computadores , AlgoritmosRESUMO
Emotion recognition through speech is a technique employed in various scenarios of Human-Computer Interaction (HCI). Existing approaches have achieved significant results; however, limitations persist, with the quantity and diversity of data being more notable when deep learning techniques are used. The lack of a standard in feature selection leads to continuous development and experimentation. Choosing and designing the appropriate network architecture constitutes another challenge. This study addresses the challenge of recognizing emotions in the human voice using deep learning techniques, proposing a comprehensive approach, and developing preprocessing and feature selection stages while constructing a dataset called EmoDSc as a result of combining several available databases. The synergy between spectral features and spectrogram images is investigated. Independently, the weighted accuracy obtained using only spectral features was 89%, while using only spectrogram images, the weighted accuracy reached 90%. These results, although surpassing previous research, highlight the strengths and limitations when operating in isolation. Based on this exploration, a neural network architecture composed of a CNN1D, a CNN2D, and an MLP that fuses spectral features and spectogram images is proposed. The model, supported by the unified dataset EmoDSc, demonstrates a remarkable accuracy of 96%.
Assuntos
Aprendizado Profundo , Emoções , Redes Neurais de Computação , Humanos , Emoções/fisiologia , Fala/fisiologia , Bases de Dados Factuais , Algoritmos , Reconhecimento Automatizado de Padrão/métodosRESUMO
INTRODUCTION: Undetected high-risk conditions in pregnancy are a leading cause of perinatal mortality in low-income and middle-income countries. A key contributor to adverse perinatal outcomes in these settings is limited access to high-quality screening and timely referral to care. Recently, a low-cost one-dimensional Doppler ultrasound (1-D DUS) device was developed that front-line workers in rural Guatemala used to collect quality maternal and fetal data. Further, we demonstrated with retrospective preliminary data that 1-D DUS signal could be processed using artificial intelligence and deep-learning algorithms to accurately estimate fetal gestational age, intrauterine growth and maternal blood pressure. This protocol describes a prospective observational pregnancy cohort study designed to prospectively evaluate these preliminary findings. METHODS AND ANALYSIS: This is a prospective observational cohort study conducted in rural Guatemala. In this study, we will follow pregnant women (N =700) recruited prior to 18 6/7 weeks gestation until their delivery and early postpartum period. During pregnancy, trained nurses will collect data on prenatal risk factors and obstetrical care. Every 4 weeks, the research team will collect maternal weight, blood pressure and 1-D DUS recordings of fetal heart tones. Additionally, we will conduct three serial obstetric ultrasounds to evaluate for fetal growth restriction (FGR), and one postpartum visit to record maternal blood pressure and neonatal weight and length. We will compare the test characteristics (receiver operator curves) of 1-D DUS algorithms developed by deep-learning methods to two-dimensional fetal ultrasound survey and published clinical pre-eclampsia risk prediction algorithms for predicting FGR and pre-eclampsia, respectively. ETHICS AND DISSEMINATION: Results of this study will be disseminated at scientific conferences and through peer-reviewed articles. Deidentified data sets will be made available through public repositories. The study has been approved by the institutional ethics committees of Maya Health Alliance and Emory University.
Assuntos
Inteligência Artificial , Retardo do Crescimento Fetal , Pré-Eclâmpsia , Ultrassonografia Doppler , Humanos , Gravidez , Feminino , Pré-Eclâmpsia/diagnóstico por imagem , Pré-Eclâmpsia/diagnóstico , Guatemala , Retardo do Crescimento Fetal/diagnóstico por imagem , Retardo do Crescimento Fetal/diagnóstico , Estudos Prospectivos , Ultrassonografia Doppler/métodos , População Rural , Ultrassonografia Pré-Natal/métodos , Adulto , Idade Gestacional , Aprendizado Profundo , HipertensãoRESUMO
Introducción. La formación integral de los residentes excede el conocimiento teórico y la técnica operatoria. Frente a la complejidad de la cirugía moderna, su incertidumbre y dinamismo, es necesario redefinir la comprensión de la educación quirúrgica y promover capacidades adaptativas en los futuros cirujanos para manejar efectivamente el entorno. Estos aspectos se refieren a la experticia adaptativa. Métodos. La presente revisión narrativa propone una definición de la educación quirúrgica con énfasis en la experticia adaptativa, y un enfoque para su adopción en la práctica. Resultados. Con base en la literatura disponible, la educación quirúrgica representa un proceso dinámico que se sitúa en la intersección de la complejidad de la cultura quirúrgica, del aprendizaje en el sitio de trabajo y de la calidad en el cuidado de la salud, dirigido a la formación de capacidades cognitivas, manuales y adaptativas en el futuro cirujano, que le permitan proveer cuidado de alto valor en un sistema de trabajo colectivo, mientras se fortalece su identidad profesional. La experticia adaptativa del residente es una capacidad fundamental para maximizar su desempeño frente a estas características de la educación quirúrgica. En la literatura disponible se encuentran seis estrategias para fortalecer esta capacidad. Conclusión. La experticia adaptativa es una capacidad esperada y necesaria en el médico residente de cirugía, para hacer frente a la complejidad de la educación quirúrgica. Existen estrategias prácticas que pueden ayudar a fortalecerla, las cuales deben ser evaluadas en nuevos estudios.
Introduction. The comprehensive training of residents exceeds theoretical knowledge and operative technique. Faced with the complexity of modern surgery, its uncertainty and dynamism, it is necessary to redefine the understanding of surgical education and promote adaptive capabilities in future surgeons for the effective management of the environment. These aspects refer to adaptive expertise. Methods. The present narrative review proposes a definition of surgical education with an emphasis on adaptive expertise, and an approach for its adoption in practice. Results. Based on the available literature, surgical education represents a dynamic process that is situated at the intersection of the complexity of surgical culture, learning in the workplace, and quality in health care, aimed at training of cognitive, manual, and adaptive capacities in the future surgeon, which allow them to provide high-value care in a collective work system, while strengthening their professional identity. Resident's adaptive expertise is a fundamental capacity to maximize his or her performance in the face of these characteristics of surgical education. In the available literature there are six strategies to strengthen this capacity. Conclusion. Adaptive expertise is an expected and necessary capacity in the surgical resident to deal with the complexity of surgical education. There are practical strategies that can help strengthen it, which must be evaluated in new studies.
Assuntos
Humanos , Educação de Pós-Graduação em Medicina , Aprendizado Profundo , Competência Profissional , Cirurgia Geral , Educação Vocacional , MetacogniçãoRESUMO
Monitoring forest canopies is vital for ecological studies, particularly for assessing epiphytes in rain forest ecosystems. Traditional methods for studying epiphytes, such as climbing trees and building observation structures, are labor, cost intensive and risky. Unmanned Aerial Vehicles (UAVs) have emerged as a valuable tool in this domain, offering botanists a safer and more cost-effective means to collect data. This study leverages AI-assisted techniques to enhance the identification and mapping of epiphytes using UAV imagery. The primary objective of this research is to evaluate the effectiveness of AI-assisted methods compared to traditional approaches in segmenting/identifying epiphytes from UAV images collected in a reserve forest in Costa Rica. Specifically, the study investigates whether Deep Learning (DL) models can accurately identify epiphytes during complex backgrounds, even with a limited dataset of varying image quality. Systematically, this study compares three traditional image segmentation methods Auto Cluster, Watershed, and Level Set with two DL-based segmentation networks: the UNet and the Vision Transformer-based TransUNet. Results obtained from this study indicate that traditional methods struggle with the complexity of vegetation backgrounds and variability in target characteristics. Epiphyte identification results were quantitatively evaluated using the Jaccard score. Among traditional methods, Watershed scored 0.10, Auto Cluster 0.13, and Level Set failed to identify the target. In contrast, AI-assisted models performed better, with UNet scoring 0.60 and TransUNet 0.65. These results highlight the potential of DL approaches to improve the accuracy and efficiency of epiphyte identification and mapping, advancing ecological research and conservation.
Assuntos
Dispositivos Aéreos não Tripulados , Costa Rica , Ecossistema , Monitoramento Ambiental/métodos , Aprendizado Profundo , Inteligência Artificial , Florestas , Plantas , Floresta Úmida , ÁrvoresRESUMO
BACKGROUND: The early diagnosis of Alzheimer's disease (AD) and mild cognitive impairment (MCI) remains a significant challenge in neurology, with conventional methods often limited by subjectivity and variability in interpretation. Integrating deep learning with artificial intelligence (AI) in magnetic resonance imaging (MRI) analysis emerges as a transformative approach, offering the potential for unbiased, highly accurate diagnostic insights. OBJECTIVE: A meta-analysis was designed to analyze the diagnostic accuracy of deep learning of MRI images on AD and MCI models. METHODS: A meta-analysis was performed across PubMed, Embase, and Cochrane library databases following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, focusing on the diagnostic accuracy of deep learning. Subsequently, methodological quality was assessed using the QUADAS-2 checklist. Diagnostic measures, including sensitivity, specificity, likelihood ratios, diagnostic odds ratio, and area under the receiver operating characteristic curve (AUROC) were analyzed, alongside subgroup analyses for T1-weighted and non-T1-weighted MRI. RESULTS: A total of 18 eligible studies were identified. The Spearman correlation coefficient was -0.6506. Meta-analysis showed that the combined sensitivity and specificity, positive likelihood ratio, negative likelihood ratio, and diagnostic odds ratio were 0.84, 0.86, 6.0, 0.19, and 32, respectively. The AUROC was 0.92. The quiescent point of hierarchical summary of receiver operating characteristic (HSROC) was 3.463. Notably, the images of 12 studies were acquired by T1-weighted MRI alone, and those of the other 6 were gathered by non-T1-weighted MRI alone. CONCLUSION: Overall, deep learning of MRI for the diagnosis of AD and MCI showed good sensitivity and specificity and contributed to improving diagnostic accuracy.
ANTECEDENTES: O diagnóstico precoce da doença de Alzheimer (DA) e do comprometimento cognitivo leve (CCL) continua sendo um desafio significativo na neurologia, com métodos convencionais frequentemente limitados pela subjetividade e variabilidade na interpretação. A integração da aprendizagem profunda com a inteligência artificial (IA) na análise de imagens de ressonância magnética surge como uma abordagem transformadora, oferecendo o potencial para insights diagnósticos imparciais e altamente precisos. OBJETIVO: Uma metanálise foi projetada para analisar a precisão diagnóstica do aprendizado profundo de imagens de ressonância magnética em modelos de DA e CCL. MéTODOS: Uma metanálise foi realizada nos bancos de dados das bibliotecas PubMed, Embase e Cochrane seguindo as diretrizes Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA), com foco na precisão diagnóstica do aprendizado profundo. Posteriormente, a qualidade metodológica foi avaliada por meio do checklist QUADAS-2. Medidas diagnósticas, incluindo sensibilidade, especificidade, razões de verossimilhança, razão de chances diagnósticas e área sob a curva característica de operação do receptor (area under the receiver operating characteristic curve [AUROC]) foram analisadas, juntamente com análises de subgrupo para ressonância magnética ponderada em T1 e não ponderada em T1. RESULTADOS: Um total de 18 estudos elegíveis foram identificados. O coeficiente de correlação de Spearman foi de -0,6506. A metanálise mostrou que a sensibilidade e a especificidade combinadas, a razão de verossimilhança positiva, a razão de verossimilhança negativa e a razão de chances de diagnóstico foram 0,84, 0,86, 6,0, 0,19 e 32, respectivamente. A AUROC foi de 0,92. O ponto quiescente do resumo hierárquico da característica de operação do receptor (hierarchical summary of receiver operating characteristic [HSROC]) foi 3,463. Notavelmente, as imagens de 12 estudos foram adquiridas apenas por ressonância magnética ponderada em T1, e as dos outros 6 foram obtidas apenas por ressonância magnética não ponderada em T1. CONCLUSãO: Em geral, a aprendizagem profunda da ressonância magnética para o diagnóstico de DA e CCL mostrou boa sensibilidade e especificidade e contribuiu para melhorar a precisão diagnóstica.
Assuntos
Doença de Alzheimer , Disfunção Cognitiva , Aprendizado Profundo , Imageamento por Ressonância Magnética , Sensibilidade e Especificidade , Humanos , Disfunção Cognitiva/diagnóstico por imagem , Disfunção Cognitiva/diagnóstico , Doença de Alzheimer/diagnóstico por imagem , Doença de Alzheimer/diagnóstico , Imageamento por Ressonância Magnética/métodos , Diagnóstico Precoce , Curva ROCRESUMO
The classification of carambola, also known as starfruit, according to quality parameters is usually conducted by trained human evaluators through visual inspections. This is a costly and subjective method that can generate high variability in results. As an alternative, computer vision systems (CVS) combined with deep learning (DCVS) techniques have been introduced in the industry as a powerful and an innovative tool for the rapid and non-invasive classification of fruits. However, validating the learning capability and trustworthiness of a DL model, aka black box, to obtain insights can be challenging. To reduce this gap, we propose an integrated eXplainable Artificial Intelligence (XAI) method for the classification of carambolas at different maturity stages. We compared two Residual Neural Networks (ResNet) and Visual Transformers (ViT) to identify the image regions that are enhanced by a Random Forest (RF) model, with the aim of providing more detailed information at the feature level for classifying the maturity stage. Changes in fruit colour and physicochemical data throughout the maturity stages were analysed, and the influence of these parameters on the maturity stages was evaluated using the Gradient-weighted Class Activation Mapping (Grad-CAM), the Attention Maps using RF importance. The proposed approach provides a visualization and description of the most important regions that led to the model decision, in wide visualization follows the models an importance features from RF. Our approach has promising potential for standardized and rapid carambolas classification, achieving 91 % accuracy with ResNet and 95 % with ViT, with potential application for other fruits.
Assuntos
Averrhoa , Frutas , Redes Neurais de Computação , Frutas/crescimento & desenvolvimento , Frutas/classificação , Averrhoa/química , Aprendizado Profundo , Inteligência Artificial , CorRESUMO
TB/HIV coinfection poses a complex public health challenge. Accurate forecasting of future trends is essential for efficient resource allocation and intervention strategy development. This study compares classical statistical and machine learning models to predict TB/HIV coinfection cases stratified by gender and the general populations. We analyzed time series data using exponential smoothing and ARIMA to establish the baseline trend and seasonality. Subsequently, machine learning models (SVR, XGBoost, LSTM, CNN, GRU, CNN-GRU, and CNN-LSTM) were employed to capture the complex dynamics and inherent non-linearities of TB/HIV coinfection data. Performance metrics (MSE, MAE, sMAPE) and the Diebold-Mariano test were used to evaluate the model performance. Results revealed that Deep Learning models, particularly Bidirectional LSTM and CNN-LSTM, significantly outperformed classical methods. This demonstrates the effectiveness of Deep Learning for modeling TB/HIV coinfection time series and generating more accurate forecasts.
Assuntos
Coinfecção , Previsões , Infecções por HIV , Aprendizado de Máquina , Tuberculose , Humanos , Infecções por HIV/complicações , Infecções por HIV/epidemiologia , Coinfecção/epidemiologia , Tuberculose/epidemiologia , Tuberculose/complicações , Previsões/métodos , Feminino , Masculino , Aprendizado ProfundoRESUMO
Plasmodium parasites cause Malaria disease, which remains a significant threat to global health, affecting 200 million people and causing 400,000 deaths yearly. Plasmodium falciparum and Plasmodium vivax remain the two main malaria species affecting humans. Identifying the malaria disease in blood smears requires years of expertise, even for highly trained specialists. Literature studies have been coping with the automatic identification and classification of malaria. However, several points must be addressed and investigated so these automatic methods can be used clinically in a Computer-aided Diagnosis (CAD) scenario. In this work, we assess the transfer learning approach by using well-known pre-trained deep learning architectures. We considered a database with 6222 Region of Interest (ROI), of which 6002 are from the Broad Bioimage Benchmark Collection (BBBC), and 220 were acquired locally by us at Fundação Oswaldo Cruz (FIOCRUZ) in Porto Velho Velho, Rondônia-Brazil, which is part of the legal Amazon. We exhaustively cross-validated the dataset using 100 distinct partitions with 80% train and 20% test for each considering circular ROIs (rough segmentation). Our experimental results show that DenseNet201 has a potential to identify Plasmodium parasites in ROIs (infected or uninfected) of microscopic images, achieving 99.41% AUC with a fast processing time. We further validated our results, showing that DenseNet201 was significantly better (99% confidence interval) than the other networks considered in the experiment. Our results support claiming that transfer learning with texture features potentially differentiates subjects with malaria, spotting those with Plasmodium even in Leukocytes images, which is a challenge. In Future work, we intend scale our approach by adding more data and developing a friendly user interface for CAD use. We aim at aiding the worldwide population and our local natives living nearby the legal Amazon's rivers.
Assuntos
Microscopia , Humanos , Microscopia/métodos , Plasmodium falciparum/patogenicidade , Plasmodium vivax , Biologia Computacional/métodos , Malária/parasitologia , Plasmodium , Aprendizado Profundo , Bases de Dados Factuais , Processamento de Imagem Assistida por Computador/métodos , Malária Falciparum/parasitologia , Diagnóstico por Computador/métodosRESUMO
PURPOSE: Amid rising health awareness, natural products which has milder effects than medical drugs are becoming popular. However, only few systems can quantitatively assess their impact on living organisms. Therefore, we developed a deep-learning system to automate the counting of cells in a gerbil model, aiming to assess a natural product's effectiveness against ischemia. METHODS: The image acquired from paraffin blocks containing gerbil brains was analyzed by a deep-learning model (fine-tuned Detectron2). RESULTS: The counting system achieved a 79%-positive predictive value and 85%-sensitivity when visual judgment by an expert was used as ground truth. CONCLUSIONS: Our system evaluated hydrogen water's potential against ischemia and found it potentially useful, which is consistent with expert assessment. Due to natural product's milder effects, large data sets are needed for evaluation, making manual measurement labor-intensive. Hence, our system offers a promising new approach for evaluating natural products.