Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 39.823
Filtrar
1.
Cephalalgia ; 44(5): 3331024241251488, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38690640

RESUMO

BACKGROUND: We aimed to develop the first machine learning models to predict citation counts and the translational impact, defined as inclusion in guidelines or policy documents, of headache research, and assess which factors are most predictive. METHODS: Bibliometric data and the titles, abstracts, and keywords from 8600 publications in three headache-oriented journals from their inception to 31 December 2017 were used. A series of machine learning models were implemented to predict three classes of 5-year citation count intervals (0-5, 6-14 and, >14 citations); and the translational impact of a publication. Models were evaluated out-of-sample with area under the receiver operating characteristics curve (AUC). RESULTS: The top performing gradient boosting model predicted correct citation count class with an out-of-sample AUC of 0.81. Bibliometric data such as page count, number of references, first and last author citation counts and h-index were among the most important predictors. Prediction of translational impact worked optimally when including both bibliometric data and information from the title, abstract and keywords, reaching an out-of-sample AUC of 0.71 for the top performing random forest model. CONCLUSION: Citation counts are best predicted by bibliometric data, while models incorporating both bibliometric data and publication content identifies the translational impact of headache research.


Assuntos
Bibliometria , Cefaleia , Aprendizado de Máquina , Humanos , Pesquisa Biomédica/estatística & dados numéricos , Pesquisa Translacional Biomédica , Fator de Impacto de Revistas
2.
J Med Virol ; 96(5): e29657, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38727035

RESUMO

The H1N1pdm09 virus has been a persistent threat to public health since the 2009 pandemic. Particularly, since the relaxation of COVID-19 pandemic mitigation measures, the influenza virus and SARS-CoV-2 have been concurrently prevalent worldwide. To determine the antigenic evolution pattern of H1N1pdm09 and develop preventive countermeasures, we collected influenza sequence data and immunological data to establish a new antigenic evolution analysis framework. A machine learning model (XGBoost, accuracy = 0.86, area under the receiver operating characteristic curve = 0.89) was constructed using epitopes, physicochemical properties, receptor binding sites, and glycosylation sites as features to predict the antigenic similarity relationships between influenza strains. An antigenic correlation network was constructed, and the Markov clustering algorithm was used to identify antigenic clusters. Subsequently, the antigenic evolution pattern of H1N1pdm09 was analyzed at the global and regional scales across three continents. We found that H1N1pdm09 evolved into around five antigenic clusters between 2009 and 2023 and that their antigenic evolution trajectories were characterized by cocirculation of multiple clusters, low-level persistence of former dominant clusters, and local heterogeneity of cluster circulations. Furthermore, compared with the seasonal H1N1 virus, the potential cluster-transition determining sites of H1N1pdm09 were restricted to epitopes Sa and Sb. This study demonstrated the effectiveness of machine learning methods for characterizing antigenic evolution of viruses, developed a specific model to rapidly identify H1N1pdm09 antigenic variants, and elucidated their evolutionary patterns. Our findings may provide valuable support for the implementation of effective surveillance strategies and targeted prevention efforts to mitigate the impact of H1N1pdm09.


Assuntos
Antígenos Virais , Vírus da Influenza A Subtipo H1N1 , Influenza Humana , Vírus da Influenza A Subtipo H1N1/genética , Vírus da Influenza A Subtipo H1N1/imunologia , Humanos , Influenza Humana/epidemiologia , Influenza Humana/prevenção & controle , Influenza Humana/virologia , Influenza Humana/imunologia , Antígenos Virais/genética , Antígenos Virais/imunologia , Aprendizado de Máquina , Evolução Molecular , Epitopos/genética , Epitopos/imunologia , COVID-19/epidemiologia , COVID-19/prevenção & controle , COVID-19/virologia , COVID-19/imunologia , Pandemias/prevenção & controle , Glicoproteínas de Hemaglutininação de Vírus da Influenza/genética , Glicoproteínas de Hemaglutininação de Vírus da Influenza/imunologia , SARS-CoV-2/genética , SARS-CoV-2/imunologia
3.
Sci Rep ; 14(1): 10833, 2024 05 12.
Artigo em Inglês | MEDLINE | ID: mdl-38734835

RESUMO

Our aim was to develop a machine learning-based predictor for early mortality and severe intraventricular hemorrhage (IVH) in very-low birth weight (VLBW) preterm infants in Taiwan. We collected retrospective data from VLBW infants, dividing them into two cohorts: one for model development and internal validation (Cohort 1, 2016-2021), and another for external validation (Cohort 2, 2022). Primary outcomes included early mortality, severe IVH, and early poor outcomes (a combination of both). Data preprocessing involved 23 variables, with the top four predictors identified as gestational age, birth body weight, 5-min Apgar score, and endotracheal tube ventilation. Six machine learning algorithms were employed. Among 7471 infants analyzed, the selected predictors consistently performed well across all outcomes. Logistic regression and neural network models showed the highest predictive performance (AUC 0.81-0.90 in both internal and external validation) and were well-calibrated, confirmed by calibration plots and the lowest two mean Brier scores (0.0685 and 0.0691). We developed a robust machine learning-based outcome predictor using only four accessible variables, offering valuable prognostic information for parents and aiding healthcare providers in decision-making.


Assuntos
Recém-Nascido Prematuro , Recém-Nascido de muito Baixo Peso , Aprendizado de Máquina , Humanos , Recém-Nascido , Feminino , Masculino , Estudos Retrospectivos , Taiwan/epidemiologia , Lactente , Prognóstico , Hemorragia Cerebral/mortalidade , Idade Gestacional , Hemorragia Cerebral Intraventricular/mortalidade , Hemorragia Cerebral Intraventricular/epidemiologia , Mortalidade Infantil , Peso ao Nascer , Doenças do Prematuro/mortalidade
4.
PLoS One ; 19(5): e0303287, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38739586

RESUMO

Globally, stroke is the third-leading cause of mortality and disability combined, and one of the costliest diseases in society. More accurate predictions of stroke outcomes can guide healthcare organizations in allocating appropriate resources to improve care and reduce both the economic and social burden of the disease. We aim to develop and evaluate the performance and explainability of three supervised machine learning models and the traditional multinomial logistic regression (mLR) in predicting functional dependence and death three months after stroke, using routinely-collected data. This prognostic study included adult patients, registered in the Swedish Stroke Registry (Riksstroke) from 2015 to 2020. Riksstroke contains information on stroke care and outcomes among patients treated in hospitals in Sweden. Prognostic factors (features) included demographic characteristics, pre-stroke functional status, cardiovascular risk factors, medications, acute care, stroke type, and severity. The outcome was measured using the modified Rankin Scale at three months after stroke (a scale of 0-2 indicates independent, 3-5 dependent, and 6 dead). Outcome prediction models included support vector machines, artificial neural networks (ANN), eXtreme Gradient Boosting (XGBoost), and mLR. The models were trained and evaluated on 75% and 25% of the dataset, respectively. Model predictions were explained using SHAP values. The study included 102,135 patients (85.8% ischemic stroke, 53.3% male, mean age 75.8 years, and median NIHSS of 3). All models demonstrated similar overall accuracy (69%-70%). The ANN and XGBoost models performed significantly better than the mLR in classifying dependence with F1-scores of 0.603 (95% CI; 0.594-0.611) and 0.577 (95% CI; 0.568-0.586), versus 0.544 (95% CI; 0.545-0.563) for the mLR model. The factors that contributed most to the predictions were expectedly similar in the models, based on clinical knowledge. Our ANN and XGBoost models showed a modest improvement in prediction performance and explainability compared to mLR using routinely-collected data. Their improved ability to predict functional dependence may be of particular importance for the planning and organization of acute stroke care and rehabilitation.


Assuntos
Aprendizado de Máquina , Acidente Vascular Cerebral , Humanos , Suécia/epidemiologia , Masculino , Feminino , Acidente Vascular Cerebral/fisiopatologia , Idoso , Idoso de 80 Anos ou mais , Prognóstico , Pessoa de Meia-Idade , Sistema de Registros , Máquina de Vetores de Suporte , Modelos Logísticos , Redes Neurais de Computação , Fatores de Risco
5.
PLoS One ; 19(5): e0302685, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38739633

RESUMO

BACKGROUND: Primary pancreatic signet ring cell carcinoma (PSRCC), an extremely rare histologic variant of pancreatic cancer, has a poor prognosis. This study aimed to investigate the prognostic value of chemotherapy in PSRCC. METHODS: Patients with PSRCC between 2000 and 2019 were identified using the Surveillance Epidemiology and End Results (SEER) database. The main outcomes in this study were cancer-specific survival (CSS) and overall survival (OS). The baseline characteristics of patients were compared using Pearson's Chi-square test. Kaplan-Meier analysis was used to generate the survival curves. Least absolute shrinkage and selection operator (LASSO), univariate and multivariate Cox regression models, and Random Survival Forest model were used to analyze the prognostic variables for OS and CSS. The variance inflation factors (VIFs) were used to analyze whether there was an overfitting problem. RESULTS: A total of 588 patients were identified. Chemotherapy was an independent prognostic factor for OS and CSS, and significantly associated with OS (HR = 0.33, 95% CI = 0.27-0.40, P <0.001) and CSS (HR = 0.32, 95% CI = 0.26-0.39, P <0.001). CONCLUSIONS: Chemotherapy showed beneficial effects on OS and CSS in patients with PSRCC and should be recommended in clinical practice.


Assuntos
Carcinoma de Células em Anel de Sinete , Aprendizado de Máquina , Neoplasias Pancreáticas , Humanos , Carcinoma de Células em Anel de Sinete/tratamento farmacológico , Carcinoma de Células em Anel de Sinete/patologia , Carcinoma de Células em Anel de Sinete/mortalidade , Feminino , Masculino , Pessoa de Meia-Idade , Prognóstico , Neoplasias Pancreáticas/tratamento farmacológico , Neoplasias Pancreáticas/mortalidade , Neoplasias Pancreáticas/patologia , Idoso , Estimativa de Kaplan-Meier , Programa de SEER , Adulto , Modelos de Riscos Proporcionais
6.
PLoS One ; 19(5): e0303101, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38739642

RESUMO

This research study aims to understand the application of Artificial Neural Networks (ANNs) to forecast the Self-Compacting Recycled Coarse Aggregate Concrete (SCRCAC) compressive strength. From different literature, 602 available data sets from SCRCAC mix designs are collected, and the data are rearranged, reconstructed, trained and tested for the ANN model development. The models were established using seven input variables: the mass of cementitious content, water, natural coarse aggregate content, natural fine aggregate content, recycled coarse aggregate content, chemical admixture and mineral admixture used in the SCRCAC mix designs. Two normalization techniques are used for data normalization to visualize the data distribution. For each normalization technique, three transfer functions are used for modelling. In total, six different types of models were run in MATLAB and used to estimate the 28th day SCRCAC compressive strength. Normalization technique 2 performs better than 1 and TANSING is the best transfer function. The best k-fold cross-validation fold is k = 7. The coefficient of determination for predicted and actual compressive strength is 0.78 for training and 0.86 for testing. The impact of the number of neurons and layers on the model was performed. Inputs from standards are used to forecast the 28th day compressive strength. Apart from ANN, Machine Learning (ML) techniques like random forest, extra trees, extreme boosting and light gradient boosting techniques are adopted to predict the 28th day compressive strength of SCRCAC. Compared to ML, ANN prediction shows better results in terms of sensitive analysis. The study also extended to determine 28th day compressive strength from experimental work and compared it with 28th day compressive strength from ANN best model. Standard and ANN mix designs have similar fresh and hardened properties. The average compressive strength from ANN model and experimental results are 39.067 and 38.36 MPa, respectively with correlation coefficient is 1. It appears that ANN can validly predict the compressive strength of concrete.


Assuntos
Força Compressiva , Materiais de Construção , Aprendizado de Máquina , Redes Neurais de Computação , Materiais de Construção/análise , Reciclagem
7.
PLoS One ; 19(5): e0302639, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38739639

RESUMO

Heart failure (HF) encompasses a diverse clinical spectrum, including instances of transient HF or HF with recovered ejection fraction, alongside persistent cases. This dynamic condition exhibits a growing prevalence and entails substantial healthcare expenditures, with anticipated escalation in the future. It is essential to classify HF patients into three groups based on their ejection fraction: reduced (HFrEF), mid-range (HFmEF), and preserved (HFpEF), such as for diagnosis, risk assessment, treatment choice, and the ongoing monitoring of heart failure. Nevertheless, obtaining a definitive prediction poses challenges, requiring the reliance on echocardiography. On the contrary, an electrocardiogram (ECG) provides a straightforward, quick, continuous assessment of the patient's cardiac rhythm, serving as a cost-effective adjunct to echocardiography. In this research, we evaluate several machine learning (ML)-based classification models, such as K-nearest neighbors (KNN), neural networks (NN), support vector machines (SVM), and decision trees (TREE), to classify left ventricular ejection fraction (LVEF) for three categories of HF patients at hourly intervals, using 24-hour ECG recordings. Information from heterogeneous group of 303 heart failure patients, encompassing HFpEF, HFmEF, or HFrEF classes, was acquired from a multicenter dataset involving both American and Greek populations. Features extracted from ECG data were employed to train the aforementioned ML classification models, with the training occurring in one-hour intervals. To optimize the classification of LVEF levels in coronary artery disease (CAD) patients, a nested cross-validation approach was employed for hyperparameter tuning. HF patients were best classified using TREE and KNN models, with an overall accuracy of 91.2% and 90.9%, and average area under the curve of the receiver operating characteristics (AUROC) of 0.98, and 0.99, respectively. Furthermore, according to the experimental findings, the time periods of midnight-1 am, 8-9 am, and 10-11 pm were the ones that contributed to the highest classification accuracy. The results pave the way for creating an automated screening system tailored for patients with CAD, utilizing optimal measurement timings aligned with their circadian cycles.


Assuntos
Eletrocardiografia , Insuficiência Cardíaca , Aprendizado de Máquina , Volume Sistólico , Função Ventricular Esquerda , Humanos , Insuficiência Cardíaca/fisiopatologia , Insuficiência Cardíaca/diagnóstico , Feminino , Masculino , Eletrocardiografia/métodos , Idoso , Função Ventricular Esquerda/fisiologia , Pessoa de Meia-Idade , Ritmo Circadiano/fisiologia , Máquina de Vetores de Suporte , Redes Neurais de Computação
8.
PLoS One ; 19(5): e0303366, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38739676

RESUMO

This study presents a novel approach to modeling the velocity-time curve in 100m sprinting by integrating machine learning algorithms. It critically addresses the limitations of traditional speed models, which often require extensive and intricate data collection, by proposing a more accessible and accurate method using fewer variables. The research utilized data from various international track events from 1987 to 2019. Two machine learning models, Random Forest (RF) and Neural Network (NN), were employed to predict the velocity-time curve, focusing on the acceleration phase of the sprint. The models were evaluated against the traditional exponential speed model using Mean Squared Error (MSE), with the NN model demonstrating superior performance. Additionally, the study explored the correlation between maximum velocity, the time of maximum velocity occurrence, the duration of the maximum speed phase, and the overall 100m sprint time. The findings indicate a strong negative correlation between maximum velocity and final time, offering new insights into the dynamics of sprinting performance. This research contributes significantly to the field of sports science, particularly in optimizing training and performance analysis in sprinting.


Assuntos
Desempenho Atlético , Aprendizado de Máquina , Corrida , Humanos , Corrida/fisiologia , Desempenho Atlético/fisiologia , Redes Neurais de Computação , Algoritmos , Aceleração
9.
PLoS One ; 19(5): e0303644, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38753740

RESUMO

BACKGROUND: Parkinson's Disease is the second most common neurological disease in over 60s. Cognitive impairment is a major clinical symptom, with risk of severe dysfunction up to 20 years post-diagnosis. Processes for detection and diagnosis of cognitive impairments are not sufficient to predict decline at an early stage for significant impact. Ageing populations, neurologist shortages and subjective interpretations reduce the effectiveness of decisions and diagnoses. Researchers are now utilising machine learning for detection and diagnosis of cognitive impairment based on symptom presentation and clinical investigation. This work aims to provide an overview of published studies applying machine learning to detecting and diagnosing cognitive impairment, evaluate the feasibility of implemented methods, their impacts, and provide suitable recommendations for methods, modalities and outcomes. METHODS: To provide an overview of the machine learning techniques, data sources and modalities used for detection and diagnosis of cognitive impairment in Parkinson's Disease, we conducted a review of studies published on the PubMed, IEEE Xplore, Scopus and ScienceDirect databases. 70 studies were included in this review, with the most relevant information extracted from each. From each study, strategy, modalities, sources, methods and outcomes were extracted. RESULTS: Literatures demonstrate that machine learning techniques have potential to provide considerable insight into investigation of cognitive impairment in Parkinson's Disease. Our review demonstrates the versatility of machine learning in analysing a wide range of different modalities for the detection and diagnosis of cognitive impairment in Parkinson's Disease, including imaging, EEG, speech and more, yielding notable diagnostic accuracy. CONCLUSIONS: Machine learning based interventions have the potential to glean meaningful insight from data, and may offer non-invasive means of enhancing cognitive impairment assessment, providing clear and formidable potential for implementation of machine learning into clinical practice.


Assuntos
Disfunção Cognitiva , Aprendizado de Máquina , Doença de Parkinson , Humanos , Doença de Parkinson/diagnóstico , Doença de Parkinson/complicações , Disfunção Cognitiva/diagnóstico
10.
Sci Rep ; 14(1): 11233, 2024 05 16.
Artigo em Inglês | MEDLINE | ID: mdl-38755269

RESUMO

Automated disease diagnosis and prediction, powered by AI, play a crucial role in enabling medical professionals to deliver effective care to patients. While such predictive tools have been extensively explored in resource-rich languages like English, this manuscript focuses on predicting disease categories automatically from symptoms documented in the Afaan Oromo language, employing various classification algorithms. This study encompasses machine learning techniques such as support vector machines, random forests, logistic regression, and Naïve Bayes, as well as deep learning approaches including LSTM, GRU, and Bi-LSTM. Due to the unavailability of a standard corpus, we prepared three data sets with different numbers of patient symptoms arranged into 10 categories. The two feature representations, TF-IDF and word embedding, were employed. The performance of the proposed methodology has been evaluated using accuracy, recall, precision, and F1 score. The experimental results show that, among machine learning models, the SVM model using TF-IDF had the highest accuracy and F1 score of 94.7%, while the LSTM model using word2vec embedding showed an accuracy rate of 95.7% and F1 score of 96.0% from deep learning models. To enhance the optimal performance of each model, several hyper-parameter tuning settings were used. This study shows that the LSTM model verifies to be the best of all the other models over the entire dataset.


Assuntos
Aprendizado Profundo , Humanos , Etiópia , Máquina de Vetores de Suporte , Idioma , Algoritmos , Aprendizado de Máquina , Teorema de Bayes , Inteligência Artificial
11.
PLoS One ; 19(5): e0303610, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38758931

RESUMO

We have previously shown that polygenic risk scores (PRS) can improve risk stratification of peripheral artery disease (PAD) in a large, retrospective cohort. Here, we evaluate the potential of PRS in improving the detection of PAD and prediction of major adverse cardiovascular and cerebrovascular events (MACCE) and adverse events (AE) in an institutional patient cohort. We created a cohort of 278 patients (52 cases and 226 controls) and fit a PAD-specific PRS based on the weighted sum of risk alleles. We built traditional clinical risk models and machine learning (ML) models using clinical and genetic variables to detect PAD, MACCE, and AE. The models' performances were measured using the area under the curve (AUC), net reclassification index (NRI), integrated discrimination improvement (IDI), and Brier score. We also evaluated the clinical utility of our PAD model using decision curve analysis (DCA). We found a modest, but not statistically significant improvement in the PAD detection model's performance with the inclusion of PRS from 0.902 (95% CI: 0.846-0.957) (clinical variables only) to 0.909 (95% CI: 0.856-0.961) (clinical variables with PRS). The PRS inclusion significantly improved risk re-classification of PAD with an NRI of 0.07 (95% CI: 0.002-0.137), p = 0.04. For our ML model predicting MACCE, the addition of PRS did not significantly improve the AUC, however, NRI analysis demonstrated significant improvement in risk re-classification (p = 2e-05). Decision curve analysis showed higher net benefit of our combined PRS-clinical model across all thresholds of PAD detection. Including PRS to a clinical PAD-risk model was associated with improvement in risk stratification and clinical utility, although we did not see a significant change in AUC. This result underscores the potential clinical utility of incorporating PRS data into clinical risk models for prevalent PAD and the need for use of evaluation metrics that can discern the clinical impact of using new biomarkers in smaller populations.


Assuntos
Doença Arterial Periférica , Humanos , Doença Arterial Periférica/genética , Doença Arterial Periférica/diagnóstico , Feminino , Masculino , Idoso , Pessoa de Meia-Idade , Medição de Risco/métodos , Fatores de Risco , Aprendizado de Máquina , Doenças Cardiovasculares/genética , Doenças Cardiovasculares/diagnóstico , Estudos Retrospectivos , Herança Multifatorial/genética , Estudos de Casos e Controles , Área Sob a Curva , Estratificação de Risco Genético
12.
Zhong Nan Da Xue Xue Bao Yi Xue Ban ; 49(2): 207-219, 2024 Feb 28.
Artigo em Inglês, Chinês | MEDLINE | ID: mdl-38755717

RESUMO

OBJECTIVES: Abnormal immune system activation and inflammation are crucial in causing Parkinson's disease. However, we still don't fully understand how certain immune-related genes contribute to the disease's development and progression. This study aims to screen key immune-related gene in Parkinson's disease based on weighted gene co-expression network analysis (WGCNA) and machine learning. METHODS: This study downloaded the gene chip data from the Gene Expression Omnibus (GEO) database, and used WGCNA to screen out important gene modules related to Parkinson's disease. Genes from important modules were exported and a Venn diagram of important Parkinson's disease-related genes and immune-related genes was drawn to screen out immune related genes of Parkinson's disease. Gene ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) were used to analyze the the functions of immune-related genes and signaling pathways involved. Immune cell infiltration analysis was performed using the CIBERSORT package of R language. Using bioinformatics method and 3 machine learning methods [least absolute shrinkage and selection operator (LASSO) regression, random forest (RF), and support vector machine (SVM)], the immune-related genes of Parkinson's disease were further screened. A Venn diagram of differentially expressed genes screened using the 4 methods was drawn with the intersection gene being hub nodes (hub) gene. The downstream proteins of the Parkinson's disease hub gene was identified through the STRING database and a protein-protein interaction network diagram was drawn. RESULTS: A total of 218 immune genes related to Parkinson's disease were identified, including 45 upregulated genes and 50 downregulated genes. Enrichment analysis showed that the 218 genes were mainly enriched in immune system response to foreign substances and viral infection pathways. The results of immune infiltration analysis showed that the infiltration percentages of CD4+ T cells, NK cells, CD8+ T cells, and B cells were higher in the samples of Parkinson's disease patients, while resting NK cells and resting CD4+ T cells were significantly infiltrated in the samples of Parkinson's disease patients. ANK1 was screened out as the hub gene. The analysis of the protein-protein interaction network showed that the ANK1 translated and expressed 11 proteins which mainly participated in functions such as signal transduction, iron homeostasis regulation, and immune system activation. CONCLUSIONS: This study identifies the Parkinson's disease immune-related key gene ANK1 via WGCNA and machine learning methods, suggesting its potential as a candidate therapeutic target for Parkinson's disease.


Assuntos
Redes Reguladoras de Genes , Aprendizado de Máquina , Doença de Parkinson , Doença de Parkinson/genética , Doença de Parkinson/imunologia , Humanos , Perfilação da Expressão Gênica , Biologia Computacional/métodos , Ontologia Genética , Bases de Dados Genéticas , Transdução de Sinais/genética , Análise de Sequência com Séries de Oligonucleotídeos
13.
Environ Monit Assess ; 196(6): 515, 2024 May 06.
Artigo em Inglês | MEDLINE | ID: mdl-38709284

RESUMO

Drought events threaten freshwater reservoirs and agricultural productivity, particularly in semi-arid regions characterized by erratic rainfall. This study evaluates a novel technique for assessing the impact of drought on LULC variations in the context of climate change from 2018 to 2022. Various data sources were harnessed, encompassing Sentinel-2 satellite imagery for LULC classification, climate data from the CHIRPS and AgERA5 databases, geomorphological data from JAXA's ALOS satellite, and a drought indicator (Vegetation Health Index (VHI)) derived from MODIS data. Two classifier models, namely gradient tree boost (GTB) and random forest (RF), were trained and assessed for LULC classification, with performance evaluated by overall accuracy (OA) and kappa coefficient (K). Notably, the GTB model exhibited superior performance, with OA > 90% and a K > 0.9. Over the period from 2018 to 2022, Fez experienced LULC changes of 19.92% expansion in built-up areas, a 34.86% increase in bare land, a 17.86% reduction in water bodies, and a 37.30% decrease in agricultural land. Positive correlations of 0.81 and 0.89 were observed between changes in agricultural LULC, rainfall, and VHI. Furthermore, mild drought conditions were identified in the years 2020 and 2022. This study emphasizes the importance of AI and remote sensing techniques in assessing drought and environmental changes, with potential applications for improving existing drought monitoring systems.


Assuntos
Agricultura , Secas , Monitoramento Ambiental , Aprendizado de Máquina , Tecnologia de Sensoriamento Remoto , Agricultura/métodos , Monitoramento Ambiental/métodos , Mudança Climática , Imagens de Satélites
14.
BMC Med Inform Decis Mak ; 24(1): 116, 2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38698395

RESUMO

BACKGROUND: Machine learning (ML) classifiers are increasingly used for predicting cardiovascular disease (CVD) and related risk factors using omics data, although these outcomes often exhibit categorical nature and class imbalances. However, little is known about which ML classifier, omics data, or upstream dimension reduction strategy has the strongest influence on prediction quality in such settings. Our study aimed to illustrate and compare different machine learning strategies to predict CVD risk factors under different scenarios. METHODS: We compared the use of six ML classifiers in predicting CVD risk factors using blood-derived metabolomics, epigenetics and transcriptomics data. Upstream omic dimension reduction was performed using either unsupervised or semi-supervised autoencoders, whose downstream ML classifier performance we compared. CVD risk factors included systolic and diastolic blood pressure measurements and ultrasound-based biomarkers of left ventricular diastolic dysfunction (LVDD; E/e' ratio, E/A ratio, LAVI) collected from 1,249 Finnish participants, of which 80% were used for model fitting. We predicted individuals with low, high or average levels of CVD risk factors, the latter class being the most common. We constructed multi-omic predictions using a meta-learner that weighted single-omic predictions. Model performance comparisons were based on the F1 score. Finally, we investigated whether learned omic representations from pre-trained semi-supervised autoencoders could improve outcome prediction in an external cohort using transfer learning. RESULTS: Depending on the ML classifier or omic used, the quality of single-omic predictions varied. Multi-omics predictions outperformed single-omics predictions in most cases, particularly in the prediction of individuals with high or low CVD risk factor levels. Semi-supervised autoencoders improved downstream predictions compared to the use of unsupervised autoencoders. In addition, median gains in Area Under the Curve by transfer learning compared to modelling from scratch ranged from 0.09 to 0.14 and 0.07 to 0.11 units for transcriptomic and metabolomic data, respectively. CONCLUSIONS: By illustrating the use of different machine learning strategies in different scenarios, our study provides a platform for researchers to evaluate how the choice of omics, ML classifiers, and dimension reduction can influence the quality of CVD risk factor predictions.


Assuntos
Doenças Cardiovasculares , Aprendizado de Máquina , Humanos , Pessoa de Meia-Idade , Masculino , Feminino , Fatores de Risco de Doenças Cardíacas , Adulto , Metabolômica , Idoso , Fatores de Risco , Medição de Risco , Finlândia , Multiômica
15.
BMC Med Inform Decis Mak ; 24(1): 115, 2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38698412

RESUMO

BACKGROUND: Glaucoma, the second leading cause of global blindness, demands timely detection due to its asymptomatic progression. This paper introduces an advanced computerized system, integrates Machine Learning (ML), convolutional neural networks (CNNs), and image processing for accurate glaucoma detection using medical imaging data, surpassing prior research efforts. METHOD: Developing a hybrid glaucoma detection framework using CNNs (ResNet50, VGG-16) and Random Forest. Models analyze pre-processed retinal images independently, and post-processing rules combine predictions for an overall glaucoma impact assessment. RESULT: The hybrid framework achieves a significant 95.41% accuracy, with precision and recall at 99.37% and 88.37%, respectively. The F1 score, balancing precision and recall, reaches a commendable 93.52%. These results highlight the robustness and effectiveness of the hybrid framework in accurate glaucoma diagnosis. CONCLUSION: In summary, our research presents an innovative hybrid framework combining CNNs and traditional ML models for glaucoma detection. Using ResNet50, VGG-16, and Random Forest in an ensemble approach yields remarkable accuracy, precision, recall, and F1 score. These results showcase the methodology's potential to enhance glaucoma diagnosis, emphasizing its promising role in early detection and preventing irreversible vision loss. The integration of ML and DNNs in medical imaging analysis suggests a valuable path for future advancements in ophthalmic healthcare.


Assuntos
Aprendizado Profundo , Glaucoma , Aprendizado de Máquina , Humanos , Glaucoma/diagnóstico por imagem , Glaucoma/diagnóstico , Redes Neurais de Computação
16.
Stud Health Technol Inform ; 314: 42-46, 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38785001

RESUMO

This study focuses on the complex interplay of healthcare, economic factors, and population dynamics, addressing a research gap in regional-level models that integrate diverse features within a temporal framework. Our primary objective is to develop an advanced temporal model for predicting cardiovascular mortality in Russian regions by integrating global and local healthcare features with economic and population dynamics. Utilizing a dataset from the Almazov Center's Department of Mortality Performance Monitoring, covering 94 regions and 752 records from January 1, 2015, to December 31, 2023, our analysis incorporates key parameters such as angioplasty procedures, population morbidity rates, Ischemic Heart Disease (IHD) and Cardiovascular Diseases (CVD) monitoring, and demographic data. Employing XGBoost and a regression model, our methodology ensures the model's robustness and generalizability.


Assuntos
Doenças Cardiovasculares , Previsões , Aprendizado de Máquina , Humanos , Doenças Cardiovasculares/mortalidade , Federação Russa/epidemiologia
17.
Stud Health Technol Inform ; 314: 108-112, 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38785013

RESUMO

The growing integration of Internet of Things (IoT) technology within the healthcare sector has revolutionized healthcare delivery, enabling advanced personalized care and precise treatments. However, this raises significant challenges, demanding robust, intelligible, and effective monitoring mechanisms. We propose an interpretable machine-learning approach to the trustworthy and effective detection of behavioral anomalies within the realm of medical IoT. The discovered anomalies serve as indicators of potential system failures and security threats. Essentially, the detection of anomalies is accomplished by learning a classifier from the operational data generated by smart devices. The learning problem is dealt with in predictive association modeling, whose expressiveness and intelligibility enforce trustworthiness to offer a comprehensive, fully interpretable, and effective monitoring solution for the medical IoT ecosystem. Preliminary results show the effectiveness of our approach.


Assuntos
Internet das Coisas , Aprendizado de Máquina , Medicina de Precisão , Humanos , Segurança Computacional
18.
Stud Health Technol Inform ; 314: 93-97, 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38785010

RESUMO

Inconsistent disease coding standards in medicine create hurdles in data exchange and analysis. This paper proposes a machine learning system to address this challenge. The system automatically matches unstructured medical text (doctor notes, complaints) to ICD-10 codes. It leverages a unique architecture featuring a training layer for model development and a knowledge base that captures relationships between symptoms and diseases. Experiments using data from a large medical research center demonstrated the system's effectiveness in disease classification prediction. Logistic regression emerged as the optimal model due to its superior processing speed, achieving an accuracy of 81.07% with acceptable error rates during high-load testing. This approach offers a promising solution to improve healthcare informatics by overcoming coding standard incompatibility and automating code prediction from unstructured medical text.


Assuntos
Registros Eletrônicos de Saúde , Classificação Internacional de Doenças , Aprendizado de Máquina , Processamento de Linguagem Natural , Humanos , Codificação Clínica
19.
Stud Health Technol Inform ; 314: 127-131, 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38785018

RESUMO

This study explores endometrial cancer (EC) within the broader context of oncogynecology, focusing on 3,845 EC patients at the Almazov National Research Center. The research analyzes clinical data, employing machine learning techniques like random forest regression and decision tree analysis. Key findings include age-dependent impacts on EC outcomes, unexpected correlations between dietary habits and recurrence risk (e.g., higher risk for vegans), and intriguing associations like soft drink consumption influencing relapse. Despite limitations like a retrospective design and self-reported data, the study's extended eight-year follow-up and robust database enhance its credibility. The nuanced insights into EC risk factors, influenced by factors like physical activity and diet, open avenues for targeted diagnostics and prevention strategies, showcasing the potential of machine learning in predicting outcomes.


Assuntos
Neoplasias do Endométrio , Aprendizado de Máquina , Humanos , Feminino , Neoplasias do Endométrio/mortalidade , Pessoa de Meia-Idade , Fatores de Risco , Idoso , Prognóstico , Análise de Sobrevida , Estudos Retrospectivos
20.
Methods Mol Biol ; 2726: 45-83, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38780727

RESUMO

Several different ways to predict RNA secondary structures have been suggested in the literature. Statistical methods, such as those that utilize stochastic context-free grammars (SCFGs), or approaches based on machine learning aim to predict the best representative structure for the underlying ensemble of possible conformations. Their parameters have therefore been trained on larger subsets of well-curated, known secondary structures. Physics-based methods, on the other hand, usually refrain from using optimized parameters. They model secondary structures from loops as individual building blocks which have been assigned a physical property instead: the free energy of the respective loop. Such free energies are either derived from experiments or from mathematical modeling. This rigorous use of physical properties then allows for the application of statistical mechanics to describe the entire state space of RNA secondary structures in terms of equilibrium probabilities. On that basis, and by using efficient algorithms, many more descriptors of the conformational state space of RNA molecules can be derived to investigate and explain the many functions of RNA molecules. Moreover, compared to other methods, physics-based models allow for a much easier extension with other properties that can be measured experimentally. For instance, small molecules or proteins can bind to an RNA and their binding affinity can be assessed experimentally. Under certain conditions, existing RNA secondary structure prediction tools can be used to model this RNA-ligand binding and to eventually shed light on its impact on structure formation and function.


Assuntos
Conformação de Ácido Nucleico , RNA , Termodinâmica , RNA/química , Algoritmos , Biologia Computacional/métodos , Aprendizado de Máquina , Modelos Moleculares
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...