ABSTRACT
Energy consumption of constructed educational facilities significantly impacts economic, social and environment sustainable development. It contributes to approximately 37% of the carbon dioxide emissions associated with energy use and procedures. This paper aims to introduce a study that investigates several artificial intelligence-based models to predict the energy consumption of the most important educational buildings; schools. These models include decision trees, K-nearest neighbors, gradient boosting, and long-term memory networks. The research also investigates the relationship between the input parameters and the yearly energy usage of educational buildings. It has been discovered that the school sizes and AC capacities are the most impact variable associated with higher energy consumption. While 'Type of School' is less direct or weaker correlation with 'Annual Consumption'. The four developed models were evaluated and compared in training and testing stages. The Decision Tree model demonstrates strong performance on the training data with an average prediction error of about 3.58%. The K-Nearest Neighbors model has significantly higher errors, with RMSE on training data as high as 38,429.4, which may be indicative of overfitting. In contrast, Gradient Boosting can almost perfectly predict the variations within the training dataset. The performance metrics suggest that some models manage this variability better than others, with Gradient Boosting and LSTM standing out in terms of their ability to handle diverse data ranges, from the minimum consumption of approximately 99,274.95 to the maximum of 683,191.8. This research underscores the importance of sustainable educational buildings not only as physical learning spaces but also as dynamic environments that contribute to informal educational processes. Sustainable buildings serve as real-world examples of environmental stewardship, teaching students about energy efficiency and sustainability through their design and operation. By incorporating advanced AI-driven tools to optimize energy consumption, educational facilities can become interactive learning hubs that encourage students to engage with concepts of sustainability in their everyday surroundings.
Subject(s)
Artificial Intelligence , Schools , Humans , Conservation of Energy Resources/methods , Decision Trees , Models, TheoreticalABSTRACT
Investigation of the biological sex of human remains is a crucial aspect of physical anthropology. However, due to varying states of skeletal preservation, multiple approaches and structures of interest need to be explored. This research aims to investigate the potential use of distances between bifrontal breadth (FMB), infraorbital foramina distance (IOD), nasal breadth (NLB), inter-canine width (ICD), and distance between mental foramina (MFD) for combined sex prediction through traditional statistical methods and through open-access machine-learning tools. Ethical approval was obtained from the ethics committee, and out of 100 cone beam computed tomography (CBCT) scans, 54 individuals were selected with all the points visible. Ten extra exams were chosen to test the predictors developed from the learning sample. Descriptive analysis of measurements, standard deviation, and standard error were obtained. T-student and Mann-Whitney tests were utilized to assess the sex differences within the variables. A logistic regression equation was developed and tested for the investigation of the biological sex as well as decision trees, random forest, and artificial neural networks machine-learning models. The results indicate a strong correlation between the measurements and the sex of individuals. When combined, the measurements were able to predict sex using a regression formula or machine learning based models which can be exported and added to software or webpages. Considering the methods, the estimations showed an accuracy rate superior to 80% for males and 82% for females. All skulls in the test sample were accurately predicted by both statistical and machine-learning models. This exploratory study successfully established a correlation between facial measurements and the sex of individuals, validating the prediction potential of machine learning, augmenting the investigative tools available to experts with a high differentiation potential.
Subject(s)
Cephalometry , Cone-Beam Computed Tomography , Machine Learning , Sex Determination by Skeleton , Humans , Male , Female , Sex Determination by Skeleton/methods , Adult , Forensic Anthropology/methods , Logistic Models , Middle Aged , Neural Networks, Computer , Young Adult , Skull/diagnostic imaging , Aged , Decision TreesABSTRACT
Leptospirosis is a global disease that impacts people worldwide, particularly in humid and tropical regions, and is associated with significant socio-economic deficiencies. Its symptoms are often confused with other syndromes, which can compromise clinical diagnosis and the failure to carry out specific laboratory tests. In this respect, this paper presents a study of three algorithms (Decision Tree, Random Forest and Adaboost) for predicting the outcome (cure or death) of individuals with leptospirosis. Using the records contained in the government National System of Aggressions and Notification (SINAN, in portuguese) from 2007 to 2017, for the state of Pará, Brazil, where the temporal attributes of health care, symptoms (headache, vomiting, jaundice, calf pain) and clinical evolution (renal failure and respiratory changes) were used. In the performance evaluation of the selected models, it was observed that the Random Forest exhibited an accuracy of 90.81% for the training dataset, considering the attributes of experiment 8, and the Decision Tree presented an accuracy of 74.29 for the validation database. So, this result considers the best attributes pointed out by experiment 10: time first symptoms medical attention, time first symptoms ELISA sample collection, medical attention hospital admission time, headache, calf pain, vomiting, jaundice, renal insufficiency, and respiratory alterations. The contribution of this article is the confirmation that artificial intelligence, using the Decision Tree model algorithm, depicting the best choice as the final model to be used in future data for the prediction of human leptospirosis cases, helping in the diagnosis and course of the disease, aiming to avoid the evolution to death.
Subject(s)
Leptospirosis , Machine Learning , Leptospirosis/diagnosis , Humans , Algorithms , Decision Trees , Brazil/epidemiology , Outcome Assessment, Health Care/methods , Male , Female , AdultABSTRACT
Dengue causes approximately 10.000 deaths and 100 million symptomatic infections annually worldwide, making it a significant public health concern. To address this, artificial intelligence tools like machine learning can play a crucial role in developing more effective strategies for control, diagnosis, and treatment. This study identifies relevant variables for the screening of dengue cases through machine learning models and evaluates the accuracy of the models. Data from reported dengue cases in the states of Rio de Janeiro and Minas Gerais for the years 2016 and 2019 were obtained through the National Notifiable Diseases Surveillance System (SINAN). The mutual information technique was used to assess which variables were most related to laboratory-confirmed dengue cases. Next, a random selection of 10,000 confirmed cases and 10,000 discarded cases was performed, and the dataset was divided into training (70%) and testing (30%). Machine learning models were then tested to classify the cases. It was found that the logistic regression model with 10 variables (gender, age, fever, myalgia, headache, vomiting, nausea, back pain, rash, retro-orbital pain) and the Decision Tree and Multilayer Perceptron (MLP) models achieved the best results in decision metrics, with an accuracy of 98%. Therefore, a tree-based model would be suitable for building an application and implementing it on smartphones. This resource would be available to healthcare professionals such as doctors and nurses.
Subject(s)
Dengue , Machine Learning , Mass Screening , Dengue/diagnosis , Mass Screening/methods , Mass Screening/standards , Brazil , Decision Trees , HumansABSTRACT
OBJECTIVES: To undertake a cost-effectiveness analysis of restorative treatments for a first permanent molar with severe molar incisor hypomineralization from the perspective of the Brazilian public system. MATERIALS AND METHODS: Two models were constructed: a one-year decision tree and a ten-year Markov model, each based on a hypothetical cohort of one thousand individuals through Monte Carlo simulation. Eight restorative strategies were evaluated: high viscosity glass ionomer cement (HVGIC); encapsulated GIC; etch and rinse adhesive + composite; self-etch adhesive + composite; preformed stainless steel crown; HVGIC + etch and rinse adhesive + composite; HVGIC + self-etch adhesive + composite, and encapsulated GIC + etch and rinse adhesive + composite. Effectiveness data were sourced from the literature. Micro-costing was applied using 2022 USD market averages with a 5% variation. Incremental cost-effectiveness ratio (ICER), net monetary benefit (%NMB), and the budgetary impact were obtained. RESULTS: Cost-effective treatments included HVGIC (%NMB = 0%/ 0%), encapsulated GIC (%NMB = 19.4%/ 19.7%), and encapsulated GIC + etch and rinse adhesive + composite (%NMB = 23.4%/ 24.5%) at 1 year and 10 years, respectively. The benefit gain of encapsulated GIC + etch and rinse adhesive + composite in relation to encapsulated GIC was small when compared to the cost increase at 1 year (gain of 3.28% and increase of USD 24.26) and 10 years (gain of 4% and increase of USD 15.54). CONCLUSION: Within the horizon and perspective analyzed, the most cost-effective treatment was encapsulated GIC restoration. CLINICAL RELEVANCE: This study can provide information for decision-making.
Subject(s)
Dental Enamel Hypoplasia , Dental Restoration, Permanent , Glass Ionomer Cements , Humans , Brazil , Decision Trees , Dental Enamel Hypoplasia/therapy , Dental Restoration, Permanent/methods , Dental Restoration, Permanent/economics , Glass Ionomer Cements/therapeutic use , Markov Chains , Molar , Molar Hypomineralization , Monte Carlo MethodABSTRACT
OBJECTIVES: To evaluate cost-effective pharmacological treatment in adult kidney transplant recipients from the perspective of the Colombian health system. METHODS: A decision tree model for the induction phase and a Markov model for the maintenance phase were built. A review of the clinical literature was conducted to extract probabilities, and the life-years were used as the outcome. Costs were calculated using the administrative databases. The evaluating treatment schemes are organized by groups of evidence with direct comparisons. RESULTS: In the induction phase, anti-thymocyte immunoglobulin+ methylprednisolone is dominant, more effective, and less expensive, compared with basiliximab+methylprednisolone. In the maintenance phase, azathioprine (AZA) is dominant in contrast to mycophenolate mofetil (MFM) both with cyclosporine (CIC)+ corticosteroids (CE); CIC is dominant relative to sirolimus (SIR) and tacrolimus (TAC) (both with MFM+CE or AZA+CE), and TAC is dominant compared with SIR (in addition with MFM+CE or mycophenolate sodium [MFS]+CE); MFM is dominant in relation to MFS and everolimus, and SIR is more effective MFM but it does not exceed the threshold (in sum with TAC+CE); MFS and MFM are dominant relative to everolimus, and SIR is more effective than MFM, but it does not exceed the threshold (in addiction with CIC+CE); MFM is dominant in relation to TAC (in sum with SIR+CE), and CIC+AZA+CE is dominant in relation to TAC+MFM+CE. CONCLUSIONS: The base-case results for all evidence groups are consistent with the different sensitivity analyses.
Subject(s)
Immunosuppressive Agents , Kidney Transplantation , Adult , Humans , Adrenal Cortex Hormones/therapeutic use , Adrenal Cortex Hormones/economics , Azathioprine/therapeutic use , Azathioprine/economics , Colombia , Cost-Effectiveness Analysis , Cyclosporine/therapeutic use , Cyclosporine/economics , Decision Trees , Graft Rejection/prevention & control , Graft Rejection/economics , Immunosuppressive Agents/economics , Immunosuppressive Agents/therapeutic use , Kidney Transplantation/economics , Markov Chains , Mycophenolic Acid/therapeutic use , Mycophenolic Acid/economics , Sirolimus/therapeutic use , Sirolimus/economics , Tacrolimus/economics , Tacrolimus/therapeutic use , Transplant Recipients/statistics & numerical dataABSTRACT
OBJECTIVES: The study aimed to evaluate the cost-effectiveness of the Pare de Fumar Conosco software compared with the standard of care adopted in Brazil for the treatment of smoking cessation. METHODS: In the cohort of smokers with multiple chronic conditions, we developed an decision tree model for the benefit measures of smoking cessation. We adopted the perspectives of the Brazilian Unified Health System and the service provider. Resources and costs were measured by primary and secondary sources and effectiveness by a randomized clinical trial. The incremental cost-effectiveness ratio (ICER) was calculated, followed by deterministic and probabilistic sensitivity analyses and deterministic and probabilistic sensitivity analyses. No willingness to pay threshold was adopted. RESULTS: The software had a lower cost and greater effectiveness than its comparator. The ICER was dominant in all of the benefits examined (-R$2 585 178.29 to -R$325 001.20). The cost of the standard of care followed by that of the electronic tool affected the ICER of the benefit measures. In all probabilistic analyses, the software was superior to the standard of care (53.6%-82.5%). CONCLUSION: The Pare de Fumar Conosco software is a technology that results in cost savings in treating smoking cessation.
Subject(s)
Smoking Cessation , Standard of Care , Adult , Female , Humans , Male , Middle Aged , Brazil , Cost-Effectiveness Analysis , Decision Making , Decision Trees , Smoking Cessation/methods , Smoking Cessation/economics , Software/standards , Standard of Care/economicsABSTRACT
PURPOSE: Machine learning (ML) models presented an excellent performance in the prognosis prediction. However, the black box characteristic of ML models limited the clinical applications. Here, we aimed to establish explainable and visualizable ML models to predict biochemical recurrence (BCR) of prostate cancer (PCa). MATERIALS AND METHODS: A total of 647 PCa patients were retrospectively evaluated. Clinical parameters were identified using LASSO regression. Then, cohort was split into training and validation datasets with a ratio of 0.75:0.25 and BCR-related features were included in Cox regression and five ML algorithm to construct BCR prediction models. The clinical utility of each model was evaluated by concordance index (C-index) values and decision curve analyses (DCA). Besides, Shapley Additive Explanation (SHAP) values were used to explain the features in the models. RESULTS: We identified 11 BCR-related features using LASSO regression, then establishing five ML-based models, including random survival forest (RSF), survival support vector machine (SSVM), survival Tree (sTree), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and a Cox regression model, C-index were 0.846 (95%CI 0.796-0.894), 0.774 (95%CI 0.712-0.834), 0.757 (95%CI 0.694-0.818), 0.820 (95%CI 0.765-0.869), 0.793 (95%CI 0.735-0.852), and 0.807 (95%CI 0.753-0.858), respectively. The DCA showed that RSF model had significant advantages over all models. In interpretability of ML models, the SHAP value demonstrated the tangible contribution of each feature in RSF model. CONCLUSIONS: Our score system provide reference for the identification for BCR, and the crafting of a framework for making therapeutic decisions for PCa on a personalized basis.
Subject(s)
Machine Learning , Neoplasm Recurrence, Local , Prostatic Neoplasms , Humans , Male , Prostatic Neoplasms/blood , Prostatic Neoplasms/pathology , Neoplasm Recurrence, Local/blood , Neoplasm Recurrence, Local/pathology , Retrospective Studies , Aged , Middle Aged , Prognosis , Decision Trees , Proportional Hazards Models , Algorithms , Support Vector Machine , Prostate-Specific Antigen/bloodABSTRACT
BACKGROUND: Occupational accidents in the plumbing activity in the construction sector in developing countries have high rates of work absenteeism. The productivity of enterprises is heavily influenced by it. OBJECTIVE: To propose a model based on the Plan, Do, Check, and Act cycle and data mining for the prevention of occupational accidents in the plumbing activity in the construction sector. METHODS: This cross-sectional study was administered on a total of 200 male technical workers in plumbing. It considers biological, biomechanical, chemical, and, physical risk factors. Three data mining algorithms were compared: Logistic Regression, Naive Bayes, and Decision Trees, classifying the occurrences occupational accident. The model was validated considering 20% of the data collected, maintaining the same proportion between accidents and non-accidents. The model was applied to data collected from the last 17 years of occupational accidents in the plumbing activity in a Colombian construction company. RESULTS: The results showed that, in 90.5% of the cases, the decision tree classifier (J48) correctly identified the possible cases of occupational accidents with the biological, chemical, and, biomechanical, risk factors training variables applied in the model. CONCLUSION: The results of this study are promising in that the model is efficient in predicting the occurrence of an occupational accident in the plumbing activity in the construction sector. For the accidents identified and the associated causes, a plan of measures to mitigate the risk of occupational accidents is proposed.
Subject(s)
Accidents, Occupational , Construction Industry , Data Mining , Humans , Data Mining/methods , Cross-Sectional Studies , Accidents, Occupational/prevention & control , Accidents, Occupational/statistics & numerical data , Male , Adult , Colombia/epidemiology , Risk Factors , Bayes Theorem , Decision Trees , Logistic Models , AlgorithmsABSTRACT
Objetivo: analisar os dados de normatização dos escores da versão brasileira do instrumento eHealth Literacy Scale (eHeals) para avaliação do letramento digital em saúde. Método: estudo transversal com 502 adultos brasileiros, realizado em 2019. Dados coletados pelo instrumento eHeals e questionário sociodemográfico. Foram aplicadas árvores de decisão e análise discriminante. Estudo aprovado pelo Comite de Ética em Pesquisa. Resultados: a análise discriminante determinou as faixas de classificação do eHeals a partir da distribuição dos escores. A árvore de decisão indicou que a escolaridade afetou de forma relevante os resultados da escala. Os indivíduos com escolaridade até o ensino fundamental II incompleto: baixo (até 10), médio (11 a 25), alto (27 a 40), e escolaridade acima: baixo (até 25), médio (25 a 32) e alto LDS (33 a 40). Conclusão: a classificação dos níveis de letramento digital em saúde de adultos pelo eHeals deve ser controlada pelos níveis de escolaridade dos participantes(AU)
Objective: to analyze the normative data of the scores of the Brazilian version of the eHealth Literacy Scale (eHeals) instrument for assessing digital health literacy. Method: cross-sectional study with 502 Brazilian adults in 2019. Data collected using the eHeals instrument and sociodemographic questionnaire. Decision trees and discriminant analysis were applied. Study approved by the Research Ethics Committee. Results: Discriminant analysis determined the eHeals classification ranges based on the distribution of scores. The decision tree indicated that education significantly affected the scale results. Thus, individuals with incomplete elementary school education up to II: low (up to 10), medium (11 to 25), high (27 to 40), and higher education: low (up to 25), medium (25 to 32) and high LDS (33 to 40). Conclusion: the classification of digital health literacy levels using eHeals in adults should be controlled by the participants' education levels(AU)
Objetivo: analizar los datos de estandarización de las puntuaciones de la versión brasileña del instrumento eHealth Literacy Scale (eHeals) para evaluar la alfabetización digital en salud. Método: estudio transversal con 502 adultos brasileños que tuvo lugar en 2019. La recolección de datos se hizo mediante el instrumento eHeals y un cuestionario sociodemográfico. Se aplicaron árboles de decisión y análisis discriminante. El Comité de Ética en Investigación aprobó el estudio. Resultados: El análisis discriminante determinó los rangos de clasificación de eHeals con base en la distribución de puntuaciones. El árbol de decisión indicó que la educación afectó significativamente los resultados de la escala. Individuos con educación primaria incompleta: baja (hasta 10), media (11 a 25), alta (27 a 40), y educación superior a esa mencionada: baja (hasta 25), media (25 a 32) y alto LDS (33 a 40). Conclusión: la clasificación de los niveles de alfabetización en salud digital en adultos con eHeals debe ser controlada por los niveles de educación de los participantes(AU)
Subject(s)
Humans , Male , Female , Adult , Middle Aged , Aged , Surveys and Questionnaires/standards , Health Literacy , Brazil , Decision Trees , Discriminant Analysis , Cross-Sectional Studies , Reproducibility of Results , Validation StudyABSTRACT
Maxillary central incisors are critical to occlusal function, smile esthetics, and even one's self-image. Furthermore, their impaction at an early age could have harmful psychological consequences on the individual. Maxillary central incisors can be impacted due to early dentoalveolar trauma to the upper anterior region that displaces the incisor in formation and, in rare instances, tooth germs are deformed. The aftermath of trauma during primary dentition is seen later during mixed dentition. Other causes are either an impediment in the eruption pathway of the maxillary central incisor due to the presence of odontomas or supernumerary teeth, an insufficient eruption space, or, very rarely, syndromic and/or other general medical conditions. Diagnosis is completed through a detailed medical/dental history, clinical evaluation, and appropriate imaging. Arch width increase, space opening, removal of obstructions if present, suitable soft-tissue management, well-designed orthodontic traction mechanics, and long-term periodontal follow-up are all essential elements in resolving cases of impacted maxillary central incisors.
Subject(s)
Incisor , Tooth, Impacted , Humans , Incisor/surgery , Incisor/injuries , Maxilla/surgery , Esthetics, Dental , Tooth, Impacted/surgery , Decision TreesABSTRACT
OBJECTIVE: Cerebrospinal fluid (CSF) biomarkers add accuracy to the diagnostic workup of cognitive impairment by illustrating Alzheimer's disease (AD) pathology. However, there are no universally accepted cutoff values for the interpretation of AD biomarkers. The aim of this study is to determine the viability of a decision-tree method to analyse CSF biomarkers of AD as a support for clinical diagnosis. METHODS: A decision-tree method (automated classification analysis) was applied to concentrations of AD biomarkers in CSF as a support for clinical diagnosis in older adults with or without cognitive impairment in a Brazilian cohort. In brief, 272 older adults (68 with AD, 122 with mild cognitive impairment [MCI], and 82 healthy controls) were assessed for CSF concentrations of Aß1-42, total-tau, and phosphorylated-tau using multiplexed Luminex assays; biomarker values were used to generate decision-tree algorithms (classification and regression tree) in the R statistical software environment. RESULTS: The best decision tree model had an accuracy of 74.65% to differentiate the three groups. Cluster analysis supported the combination of CSF biomarkers to differentiate AD and MCI vs. controls, suggesting the best cutoff values for each clinical condition. CONCLUSION: Automated analyses of AD biomarkers provide valuable information to support the clinical diagnosis of MCI and AD in research settings.
Subject(s)
Alzheimer Disease , Cognitive Dysfunction , Aged , Alzheimer Disease/diagnosis , Alzheimer Disease/psychology , Amyloid beta-Peptides/cerebrospinal fluid , Biomarkers/cerebrospinal fluid , Cognitive Dysfunction/cerebrospinal fluid , Cognitive Dysfunction/diagnosis , Decision Trees , Humans , tau Proteins/cerebrospinal fluidABSTRACT
Introducción y Objetivo Con el advenimiento de nuevas tecnologías, vienen controversias respecto al espectro de sus aplicaciones. El costo derivado de estas tecnologías juega un papel muy importante en el momento de la toma de decisiones terapéuticas. Es por esto que consideramos relevante estimar la costo-efectividad de la nefrolitotomía percutánea comparada con la nefrolitotomía retrógrada flexible con láser de holmio en pacientes con litiasis renal de 20 mm a 30 mm en Colombia. Materiales y Métodos Por medio de la construcción de un modelo de árbol de decisión usando el programa Treeage (TreeAge Software, LLC, Williamstown, MA, EE.UU.), se realizó una comparación entre la nefrolitotomía percutánea y la nefrolitotomía retrógrada flexible con láser de holmio en pacientes con litiasis renal de 20 mm a 30 mm. La perspectiva fue la del tercer pagador, y se incluyeron los costos directos. Las cifras fueron expresadas en pesos colombianos de 2018. La mejoría clínica, definida como el paciente libre de cálculos, fue la unidad de resultado. Se hizo una extracción de datos de efectividad y seguridad por medio de una revisión sistemática de la literatura. La razón de costo-efectividad incremental fue calculada. Resultados El modelo final indica que la nefrolitotomía percutánea puede ser considerada como la alternativa más costo-efectiva. Los hallazgos fueron sensibles a la probabilidad de mejoría clínica de la nefrolitotomía percutánea. Conclusión Teniendo en cuenta las variables económicas, los supuestos del modelo y desde la perspectiva del tercer pagador, la nefrolitotomía percutánea para el tratamiento de pacientes con cálculos renales de 20 mm a 30 mm es costo-efectiva en nuestro país. Estos hallazgos fueron sensibles a los costos y a la efectividad de los procedimientos quirúrgicos.
Introduction and Objective The advent of new technologies leads to controversies regarding the spectrum of their applications and their cost. The cost of these technologies plays a very important role when making therapeutic decisions. Therefore, we consider it relevant to estimate the cost-effectiveness of percutaneous nephrolithotomy compared with flexible retrograde holmium laser nephrolithotomy in patients with kidney stones of 20 mm to 30 mm in Colombia. Materials and Methods Through the development of a decision tree model using the Treeage (TreeAge Software, LLC, Williamstown, MA, US) software, we compared percutaneous nephrolithotomy with flexible holmium laser retrograde nephrolithotomy in patients with kidney stones of 20 mm to 30 mm. The perspective was that of the third payer, and all direct costs were included. The figures were expressed in terms of 2018 Colombian pesos. Clinical improvement, which was defined as a stone-free patient, was the outcome unit. We extracted data on effectiveness and safety through a systematic review of the literature. The incremental cost-effectiveness ratio was calculated. Results In terms of cost-effectiveness the final model indicates that percutaneous nephrolithotomy may be considered the best alternative. These findings were sensitive to the probability of clinical improvement of the percutaneous nephrolithotomy. Conclusion Taking into account the economic variables, the assumptions of the model, and through the perspective of the third payer, percutaneous nephrolithotomy for the treatment of patients with kidney stones of 20 mm to 30mm is cost-effective in our country. These findings were sensitive to the costs and effectiveness of the surgical procedures.
Subject(s)
Humans , Surgical Procedures, Operative , Costs and Cost Analysis , Nephrolithiasis , Lasers, Solid-State , Nephrolithotomy, Percutaneous , Technology , Effectiveness , Decision Trees , Kidney Calculi , ColombiaABSTRACT
BACKGROUND: We aimed to identify the 2001-2013 incidence trend, and characteristics associated with adolescent pregnancies reported by 20-24-year-old women. METHODS: A retrospective analysis of the Cuatro Santos Northern Nicaragua Health and Demographic Surveillance 2004-2014 data on women aged 15-19 and 20-24. To calculate adolescent birth and pregnancy rates, we used the first live birth at ages 10-14 and 15-19 years reported by women aged 15-19 and 20-24 years, respectively, along with estimates of annual incidence rates reported by women aged 20-24 years. We conducted conditional inference tree analyses using 52 variables to identify characteristics associated with adolescent pregnancies. RESULTS: The number of first live births reported by women aged 20-24 years was 361 during the study period. Adolescent pregnancies and live births decreased from 2004 to 2009 and thereafter increased up to 2014. The adolescent pregnancy incidence (persons-years) trend dropped from 2001 (75.1 per 1000) to 2007 (27.2 per 1000), followed by a steep upward trend from 2007 to 2008 (19.1 per 1000) that increased in 2013 (26.5 per 1000). Associated factors with adolescent pregnancy were living in low-education households, where most adults in the household were working, and high proportion of adolescent pregnancies in the local community. Wealth was not linked to teenage pregnancies. CONCLUSIONS: Interventions to prevent adolescent pregnancy are imperative and must bear into account the context that influences the culture of early motherhood and lead to socioeconomic and health gains in resource-poor settings.
Subject(s)
Pregnancy Rate/trends , Pregnancy in Adolescence/ethnology , Adolescent , Child , Decision Trees , Demography , Family Characteristics/ethnology , Female , Humans , Incidence , Nicaragua/epidemiology , Population Surveillance/methods , Pregnancy , Retrospective Studies , Young AdultABSTRACT
Dementia interferes with the individual's motor, behavioural, and intellectual functions, causing him to be unable to perform instrumental activities of daily living. This study is aimed at identifying the best performing algorithm and the most relevant characteristics to categorise individuals with HIV/AIDS at high risk of dementia from the application of data mining. Principal component analysis (PCA) algorithm was used and tested comparatively between the following machine learning algorithms: logistic regression, decision tree, neural network, KNN, and random forest. The database used for this study was built from the data collection of 270 individuals infected with HIV/AIDS and followed up at the outpatient clinic of a reference hospital for infectious and parasitic diseases in the State of Ceará, Brazil, from January to April 2019. Also, the performance of the algorithms was analysed for the 104 characteristics available in the database; then, with the reduction of dimensionality, there was an improvement in the quality of the machine learning algorithms and identified that during the tests, even losing about 30% of the variation. Besides, when considering only 23 characteristics, the precision of the algorithms was 86% in random forest, 56% logistic regression, 68% decision tree, 60% KNN, and 59% neural network. The random forest algorithm proved to be more effective than the others, obtaining 84% precision and 86% accuracy.
Subject(s)
AIDS Dementia Complex/diagnosis , Acquired Immunodeficiency Syndrome/complications , Algorithms , Dementia/etiology , AIDS Dementia Complex/epidemiology , AIDS Dementia Complex/etiology , Aged , Brazil/epidemiology , Computational Biology , Data Mining/methods , Data Mining/statistics & numerical data , Databases, Factual , Decision Trees , Female , Follow-Up Studies , Humans , Logistic Models , Machine Learning , Male , Middle Aged , Neural Networks, Computer , Risk FactorsABSTRACT
Although decreasing rates of cervical cancer in the U.S. are attributable to health policy, immigrant women, particularly Haitians, experience disproportionate disease burden related to delayed detection and treatment. However, risk prediction and dynamics of access remain largely underexplored and unresolved in this population. This study seeks to assess cervical cancer risk and access of unscreened Haitian women. Extracted and merged from two studies, this sample includes n = 346 at-risk Haitian women in South Florida, the largest U.S. enclave of Haitians (ages 30-65 and unscreened in the previous three years). Three approaches (logistic regression [LR]; classification and regression trees [CART]; and random forest [RF]) were employed to assess the association between screening history and sociodemographic variables. LR results indicated women who reported US citizenship (OR = 3.22, 95% CI = 1.52-6.84), access to routine care (OR = 2.11, 95%CI = 1.04-4.30), and spent more years in the US (OR = 1.01, 95%CI = 1.00-1.03) were significantly more likely to report previous screening. CART results returned an accuracy of 0.75 with a tree initially splitting on women who were not citizens, then on 43 or fewer years in the U.S., and without access to routine care. RF model identified U.S. years, citizenship, and access to routine care as variables of highest importance indicated by greatest mean decreases in Gini index. The model was .79 accurate (95% CI = 0.74-0.84). This multi-pronged analysis identifies previously undocumented barriers to health screening for Haitian women. Recent US immigrants without citizenship or perceived access to routine care may be at higher risk for disease due to barriers in accessing U.S. health-systems.
Subject(s)
Health Services Accessibility , Statistics as Topic , Uterine Cervical Neoplasms/epidemiology , Algorithms , Decision Trees , Female , Florida/epidemiology , Haiti , Humans , Logistic Models , Middle Aged , Multivariate Analysis , Risk FactorsABSTRACT
INTRODUCTION: Several studies demonstrate the therapeutic superiority of thrombolysis plus mechanical thrombectomy versus thrombolysis alone to treat stroke. OBJECTIVE: To analyze the cost-utility of thrombolysis plus mechanical thrombectomy versus thrombolysis in patients with ischemic stroke due to large vessel occlusion. METHODS: Cost-utility analysis. The model used is blended: Decision Tree (first 90 days) and Markov in the long term, of seven health states based on a disease-specific scale, from the Chilean public insurance and societal perspective. Quality-Adjusted Life-Years and costs are evaluated. Deterministic (DSA) and probabilistic (PSA) analyses were carried out. RESULTS: From the public insurance perspective, in the base case, mechanical thrombectomy is associated with lower costs in a lifetime horizon, and with higher benefits (2.63 incremental QALYs, and 1.19 discounted incremental life years), at a Net Monetary Benefit (NMB) of CLP 37,289,874, and an Incremental Cost-Utility Ratio (ICUR) of CLP 3,807,413/QALY. For the scenario that incorporates access to rehabilitation, 2.54 incremental QALYs and 1.13 discounted life years were estimated, resulting in an NMB of CLP 35,670,319 and ICUR of CLP 3,960,624/QALY. In the scenario that incorporates access to long-term care from a societal perspective, the ICUR falls to CLP 951,911/QALY, and the NMB raises to CLP 43,318,072, improving the previous scenarios. In the DSA, health states, starting age, and relative risk of dying were the variables with the greatest influence. The PSA for the base case corroborated the estimates. CONCLUSIONS: Thrombolysis plus mechanical thrombectomy adds quality of life at costs acceptable for decision-makers versus thrombolysis alone. The results are consistent with international studies.
INTRODUCCIÓN: Diversos estudios demuestran la superioridad terapéutica de la trombólisis más trombectomía mecánica, versus trombólisis sola, en el tratamiento del accidente vascular cerebral. OBJETIVOS: Analizar el costo utilidad de la trombólisis más trombectomía versus trombólisis sola en pacientes con accidente vascular cerebral isquémico con oclusión de grandes vasos. MÉTODOS: Evaluación de costo utilidad. Se ha utilizado un modelo mixto: árbol de decisión (primeros 90 días) y Markov en el largo plazo, de siete estados de salud definidos en escala específica de enfermedad, desde la perspectiva del seguro público chileno y societal. Se evalúan costos y años de vida ajustados por calidad. Se realizó análisis de incertidumbre determinístico y probabilístico. RESULTADOS: Bajo la perspectiva de seguro público, en el caso base la trombectomía mecánica se relaciona con menores costos en un horizonte de por vida, con mayores beneficios (2,63 años de vida ajustados por calidad incrementales, y 1,19 años de vida incrementales descontados), a un beneficio monetario neto de $37 289 874 pesos chilenos, y una razón incremental de costo utilidad de $3 807 413 pesos por años de vida ajustados por calidad. Para el escenario que agrega acceso a rehabilitación se estimaron 2,54 años de vida ajustados por calidad incremental y 1,13 años de vida descontados, resultando en un beneficio monetario neto de $35 670 319 pesos y razón incremental de costo utilidad de $3 960 624 pesos por años de vida ajustados por calidad. En el escenario que agrega el efecto de acceso a cuidados de larga duración con perspectiva societal, la razón incremental de costo utilidad cae hasta $951 911 pesos por años de vida ajustados por calidad y el beneficio monetario neto se eleva a $43 318 072 pesos, superando las estimaciones anteriores. En el análisis de incertidumbre determinístico, los estados de salud, edad de inicio de la cohorte y riesgo relativo de morir, fueron las variables con mayor influencia. El análisis de incertidumbre probabilístico para el caso base, corroboró las estimaciones. CONCLUSIONES: La trombólisis más trombectomía mecánica agrega calidad de vida a costos aceptables por el tomador de decisión, versus trombólisis sola. Los resultados son consistentes con los estudios internacionales.
Subject(s)
Ischemic Stroke/therapy , Mechanical Thrombolysis/methods , Thrombectomy/methods , Brain Ischemia/therapy , Cerebrovascular Circulation , Chile , Decision Trees , Health Care Costs , Humans , Ischemic Stroke/etiology , Markov Chains , Mechanical Thrombolysis/economics , Quality of Life , Stroke/therapy , Thrombectomy/economics , Thrombolytic Therapy/economics , Thrombolytic Therapy/methodsABSTRACT
Autism Spectrum Disorder is a mental disorder that afflicts millions of people worldwide. It is estimated that one in 160 children has traces of autism, with five times the higher prevalence in boys. The protocols for detecting symptoms are diverse. However, the following are among the most used: the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5), of the American Psychiatric Association; the Revised Autistic Diagnostic Observation Schedule (ADOS-R); the Autistic Diagnostic Interview (ADI); and the International Classification of Diseases, 10th edition (ICD-10), published by the World Health Organization (WHO) and adopted in Brazil by the Unified Health System (SUS). The application of machine learning models helps make the diagnostic process of Autism Spectrum Disorder more precise, reducing, in many cases, the number of criteria necessary for evaluation, denoting a form of attribute engineering (feature engineering) efficiency. This work proposes a hybrid approach based on machine learning algorithms' composition to discover knowledge and concepts associated with the multicriteria method of decision support based on Verbal Decision Analysis to refine the results. Therefore, the study has the general objective of evaluating how the mentioned hybrid methodology proposal can make the protocol derived from ICD-10 more efficient, providing agility to diagnosing Autism Spectrum Disorder by observing a minor symptom. The study database covers thousands of cases of people who, once diagnosed, obtained government assistance in Brazil.
Subject(s)
Autism Spectrum Disorder/diagnosis , Decision Support Techniques , Diagnosis, Computer-Assisted/methods , Machine Learning , Algorithms , Brazil , Child, Preschool , Computational Biology , Decision Trees , Diagnosis, Computer-Assisted/statistics & numerical data , Diagnostic and Statistical Manual of Mental Disorders , Expert Systems , Female , Humans , Infant , Infant, Newborn , MaleABSTRACT
The sudden outbreak of coronavirus disease 2019 (COVID-19) revealed the need for fast and reliable automatic tools to help health teams. This paper aims to present understandable solutions based on Machine Learning (ML) techniques to deal with COVID-19 screening in routine blood tests. We tested different ML classifiers in a public dataset from the Hospital Albert Einstein, São Paulo, Brazil. After cleaning and pre-processing the data has 608 patients, of which 84 are positive for COVID-19 confirmed by RT-PCR. To understand the model decisions, we introduce (i) a local Decision Tree Explainer (DTX) for local explanation and (ii) a Criteria Graph to aggregate these explanations and portrait a global picture of the results. Random Forest (RF) classifier achieved the best results (accuracy 0.88, F1-score 0.76, sensitivity 0.66, specificity 0.91, and AUROC 0.86). By using DTX and Criteria Graph for cases confirmed by the RF, it was possible to find some patterns among the individuals able to aid the clinicians to understand the interconnection among the blood parameters either globally or on a case-by-case basis. The results are in accordance with the literature and the proposed methodology may be embedded in an electronic health record system.