Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 316.447
Filter
1.
Rev. esp. patol ; 57(2): 77-83, Abr-Jun, 2024. tab, ilus
Article in Spanish | IBECS | ID: ibc-232410

ABSTRACT

Introducción: En un servicio de anatomía patológica se analiza la carga laboral en tiempo médico en función de la complejidad de las muestras recibidas, y se valora su distribución entre los patólogos, presentado un nuevo algoritmo informático que favorece una distribución equitativa. Métodos: Siguiendo las directrices para la «Estimación de la carga de trabajo en citopatología e histopatología (tiempo médico) atendiendo al catálogo de muestras y procedimientos de la SEAP-IAP (2.ª edición)» se determinan las unidades de carga laboral (UCL) por patólogo y UCL global del servicio, la carga media laboral que soporta el servicio (factor MU), el tiempo de dedicación de cada patólogo a la actividad asistencial y el número de patólogos óptimo según la carga laboral del servicio. Resultados: Determinamos 12.197 UCL totales anuales para el patólogo jefe de servicio, así como 14.702 y 13.842 para los patólogos adjuntos, con una UCL global del servicio de 40.742. El factor MU calculado es 4,97. El jefe ha dedicado el 72,25% de su jornada a la asistencia y los adjuntos el 87,09 y 82,01%. El número de patólogos óptimo para el servicio es de 3,55. Conclusiones: Todos los resultados obtenidos demuestran la sobrecarga laboral médica, y la distribución de las UCL entre los patólogos no resulta equitativa. Se propone un algoritmo informático capaz de distribuir la carga laboral de manera equitativa, asociado al sistema de información del laboratorio, y que tenga en cuenta el tipo de muestra, su complejidad y la dedicación asistencial de cada patólogo.(AU)


Introduction: In a pathological anatomy service, the workload in medical time is analyzed based on the complexity of the samples received and its distribution among pathologists is assessed, presenting a new computer algorithm that favors an equitable distribution. Methods: Following the second edition of the Spanish guidelines for the estimation of workload in cytopathology and histopathology (medical time) according to the Spanish Pathology Society-International Academy of Pathology (SEAP-IAP) catalog of samples and procedures, we determined the workload units (UCL) per pathologist and the overall UCL of the service, the average workload of the service (MU factor), the time dedicated by each pathologist to healthcare activity and the optimal number of pathologists according to the workload of the service. Results: We determined 12 197 total annual UCL for the chief pathologist, as well as 14 702 and 13 842 UCL for associate pathologists, with an overall of 40 742 UCL for the whole service. The calculated MU factor is 4.97. The chief pathologist devoted 72.25% of his working day to healthcare activity while associate pathologists dedicated 87.09% and 82.01% of their working hours. The optimal number of pathologists for the service is found to be 3.55. Conclusions: The results demonstrate medical work overload and a non-equitable distribution of UCLs among pathologists. We propose a computer algorithm capable of distributing the workload in an equitable manner. It would be associated with the laboratory information system and take into account the type of specimen, its complexity and the dedication of each pathologist to healthcare activity.(AU)


Subject(s)
Humans , Male , Female , Pathology , Workload , Pathologists , Pathology Department, Hospital , Algorithms
2.
Clin Transl Sci ; 17(5): e13824, 2024 May.
Article in English | MEDLINE | ID: mdl-38752574

ABSTRACT

Accurate prediction of a new compound's pharmacokinetic (PK) profile is pivotal for the success of drug discovery programs. An initial assessment of PK in preclinical species and humans is typically performed through allometric scaling and mathematical modeling. These methods use parameters estimated from in vitro or in vivo experiments, which although helpful for an initial estimation, require extensive animal experiments. Furthermore, mathematical models are limited by the mechanistic underpinning of the drugs' absorption, distribution, metabolism, and elimination (ADME) which are largely unknown in the early stages of drug discovery. In this work, we propose a novel methodology in which concentration versus time profile of small molecules in rats is directly predicted by machine learning (ML) using structure-driven molecular properties as input and thus mitigating the need for animal experimentation. The proposed framework initially predicts ADME properties based on molecular structure and then uses them as input to a ML model to predict the PK profile. For the compounds tested, our results demonstrate that PK profiles can be adequately predicted using the proposed algorithm, especially for compounds with Tanimoto score greater than 0.5, the average mean absolute percentage error between predicted PK profile and observed PK profile data was found to be less than 150%. The suggested framework aims to facilitate PK predictions and thus support molecular screening and design earlier in the drug discovery process.


Subject(s)
Drug Discovery , Machine Learning , Animals , Rats , Drug Discovery/methods , Pharmaceutical Preparations/metabolism , Pharmaceutical Preparations/chemistry , Humans , Models, Biological , Algorithms , Molecular Structure , Pharmacokinetics , Small Molecule Libraries/pharmacokinetics
3.
J Cell Mol Med ; 28(10): e18379, 2024 May.
Article in English | MEDLINE | ID: mdl-38752750

ABSTRACT

Gastric cancer is a prevalent and deadly malignancy, and the response to immunotherapy varies among patients. This study aimed to develop a prognostic model for gastric cancer patients and investigate immune escape mechanisms using deep machine learning and single-cell sequencing analysis. Data from public databases were analysed, and a prediction model was constructed using 101 algorithms. The high-AIDPS group, characterized by increased AIDPS expression, exhibited worse survival, genomic variations and immune cell infiltration. These patients also showed immunotherapy tolerance. Treatment strategies targeting the high-AIDPS group identified three potential drugs. Additionally, distinct cluster groups and upregulated AIDPS-associated genes were observed in gastric adenocarcinoma cell lines. Inhibition of GHRL expression suppressed cancer cell activity, inhibited M2 polarization in macrophages and reduced invasiveness. Overall, AIDPS plays a critical role in gastric cancer prognosis, genomic variations, immune cell infiltration and immunotherapy response, and targeting GHRL expression holds promise for personalized treatment. These findings contribute to improved clinical management in gastric cancer.


Subject(s)
Algorithms , Gene Expression Regulation, Neoplastic , Single-Cell Analysis , Stomach Neoplasms , Humans , Stomach Neoplasms/genetics , Stomach Neoplasms/immunology , Stomach Neoplasms/pathology , Single-Cell Analysis/methods , Prognosis , Tumor Escape/genetics , Cell Line, Tumor , Immunotherapy/methods , Biomarkers, Tumor/genetics , Machine Learning
4.
PLoS One ; 19(5): e0302641, 2024.
Article in English | MEDLINE | ID: mdl-38753596

ABSTRACT

The development of automated tools using advanced technologies like deep learning holds great promise for improving the accuracy of lung nodule classification in computed tomography (CT) imaging, ultimately reducing lung cancer mortality rates. However, lung nodules can be difficult to detect and classify, from CT images since different imaging modalities may provide varying levels of detail and clarity. Besides, the existing convolutional neural network may struggle to detect nodules that are small or located in difficult-to-detect regions of the lung. Therefore, the attention pyramid pooling network (APPN) is proposed to identify and classify lung nodules. First, a strong feature extractor, named vgg16, is used to obtain features from CT images. Then, the attention primary pyramid module is proposed by combining the attention mechanism and pyramid pooling module, which allows for the fusion of features at different scales and focuses on the most important features for nodule classification. Finally, we use the gated spatial memory technique to decode the general features, which is able to extract more accurate features for classifying lung nodules. The experimental results on the LIDC-IDRI dataset show that the APPN can achieve highly accurate and effective for classifying lung nodules, with sensitivity of 87.59%, specificity of 90.46%, accuracy of 88.47%, positive predictive value of 95.41%, negative predictive value of 76.29% and area under receiver operating characteristic curve of 0.914.


Subject(s)
Lung Neoplasms , Neural Networks, Computer , Tomography, X-Ray Computed , Humans , Lung Neoplasms/diagnostic imaging , Lung Neoplasms/diagnosis , Tomography, X-Ray Computed/methods , Deep Learning , Solitary Pulmonary Nodule/diagnostic imaging , Solitary Pulmonary Nodule/diagnosis , Multiple Pulmonary Nodules/diagnostic imaging , Multiple Pulmonary Nodules/diagnosis , Algorithms , Lung/diagnostic imaging , Lung/pathology , Radiographic Image Interpretation, Computer-Assisted/methods
5.
PLoS One ; 19(5): e0295106, 2024.
Article in English | MEDLINE | ID: mdl-38753609

ABSTRACT

Camouflage is a widespread and well-studied anti-predator strategy, yet identifying which patterns provide optimal protection in any given scenario remains challenging. Besides the virtually limitless combinations of colours and patterns available to prey, selection for camouflage strategies will depend on complex interactions between prey appearance, background properties and predator traits, across repeated encounters between co-evolving predators and prey. Experiments in artificial evolution, pairing psychophysics detection tasks with genetic algorithms, offer a promising way to tackle this complexity, but sophisticated genetic algorithms have so far been restricted to screen-based experiments. Here, we present methods to test the evolution of colour patterns on physical prey items, under selection from wild predators in the field. Our techniques expand on a recently-developed open-access pattern generation and genetic algorithm framework, modified to operate alongside artificial predation experiments. In this system, predators freely interact with prey, and the order of attack determines the survival and reproduction of prey patterns into future generations. We demonstrate the feasibility of these methods with a case study, in which free-flying birds feed on artificial prey deployed in semi-natural conditions, against backgrounds differing in three-dimensional complexity. Wild predators reliably participated in this experiment, foraging for 11 to 16 generations of artificial prey and encountering a total of 1,296 evolved prey items. Changes in prey pattern across generations indicated improvements in several metrics of similarity to the background, and greater edge disruption, although effect sizes were relatively small. Computer-based replicates of these trials, with human volunteers, highlighted the importance of starting population parameters for subsequent evolution, a key consideration when applying these methods. Ultimately, these methods provide pathways for integrating complex genetic algorithms into more naturalistic predation trials. Customisable open-access tools should facilitate application of these tools to investigate a wide range of visual pattern types in more ecologically-relevant contexts.


Subject(s)
Algorithms , Biological Evolution , Predatory Behavior , Animals , Predatory Behavior/physiology , Birds/physiology , Selection, Genetic
6.
PLoS One ; 19(5): e0301862, 2024.
Article in English | MEDLINE | ID: mdl-38753628

ABSTRACT

Recognition of the key text of the Chinese seal can speed up the approval of documents, and improve the office efficiency of enterprises or government administrative departments. Due to image blurring and occlusion, the accuracy of Chinese seal recognition is low. In addition, the real dataset is very limited. In order to solve these problems, we improve the differentiable binarization detection algorithm (DBnet) to construct a model DB-ECA for text region detection, and propose a model named LSTR (Lightweight Seal Text Recognition) for text recognition. The efficient channel attention module is added to the differentiable binarization network to solve the feature pyramid conflict, and the convolutional layer network structure is improved to delay downsampling for reducing semantic feature loss. LSTR uses a lightweight CNN more suitable for small-sample generalization, and dynamically fuses positional and visual information through a self-attention-based inference layer to predict the label distribution of feature sequences in parallel. The inference layer not only solves the weak discriminative power of CNN in the shallow layer, but also facilitates CTC (Connectionist Temporal Classification) to accurately align the feature region with the target character. Experiments on the homemade dataset in this paper, DB-ECA compared with the other five commonly used detection models, the precision, recall, F-measure are the best effect of 90.29, 85.17, 87.65, respectively. LSTR compared with the other five kinds of recognition models in the last three years, to achieve the highest effect of accuracy 91.29%, and has the advantages of a small number of parameters and fast inference. The experimental results fully prove the innovation and effectiveness of our model.


Subject(s)
Algorithms , Neural Networks, Computer , Pattern Recognition, Automated/methods
7.
PLoS One ; 19(5): e0301975, 2024.
Article in English | MEDLINE | ID: mdl-38753654

ABSTRACT

In this paper, the Integrated Nested Laplace Algorithm (INLA) is applied to the Epidemic Type Aftershock Sequence (ETAS) model, and the parameters of the ETAS model are obtained for the earthquake sequences active in different regions of Xinjiang. By analyzing the characteristics of the model parameters over time, the changes in each earthquake sequence are studied in more detail. The estimated values of the ETAS model parameters are used as inputs to forecast strong aftershocks in the next period. We find that there are significant differences in the aftershock triggering capacity and aftershock attenuation capacity of earthquake sequences in different seismic regions of Xinjiang. With different cutoff dates set, we observe the characteristics of the earthquake sequence parameters changing with time after the mainshock occurs, and the model parameters of the Ms7.3 earthquake sequence in Hotan region change significantly with time within 15 days after the earthquake. Compared with the MCMC algorithm, the ETAS model fitted with the INLA algorithm can forecast the number of earthquakes in the early period after the occurrence of strong aftershocks more effectively and can forecast the sudden occurrence time of earthquakes more accurately.


Subject(s)
Algorithms , Earthquakes , Forecasting , China , Forecasting/methods , Humans , Models, Theoretical , Spatio-Temporal Analysis
8.
PLoS One ; 19(5): e0300374, 2024.
Article in English | MEDLINE | ID: mdl-38753659

ABSTRACT

Combustible gas concentration detection faces challenges of increasing accuracy, and sensitivity, as well as high reliability in harsh using environments. The special design of the optical path structure of the sensitive element provides an opportunity to improve combustible gas concentration detection. In this study, the optical path structure of the sensitive element was newly designed based on the Pyramidal beam splitter matrix. The infrared light source was modulated by multi-frequency point signal superimposed modulation technology. At the same time, concentration detection results and confidence levels were calculated using the 4-channel combustible gas concentration detection algorithm based on spectral refinement. Through experiment, it is found that the sensor enables full-range measurement of CH4, at the lower explosive limit (LEL, CH4 LEL of 5%), the reliability level is 0.01 parts-per-million (PPM), and the sensor sensitivity is up to 0.5PPM. The sensor is still capable of achieving PPM-level detections, under extreme conditions in which the sensor's optical window is covered by 2/3, and humidity is 85% or dust concentration is 100mg/m3. Those improve the sensitivity, robustness, reliability, and accuracy of the sensor.


Subject(s)
Gases , Gases/analysis , Algorithms , Reproducibility of Results , Environmental Monitoring/instrumentation , Environmental Monitoring/methods , Equipment Design
9.
PLoS One ; 19(5): e0296909, 2024.
Article in English | MEDLINE | ID: mdl-38753667

ABSTRACT

The time fractional Schrödinger equation contributes to our understanding of complex quantum systems, anomalous diffusion processes, and the application of fractional calculus in physics and cubic B-spline is a versatile tool in numerical analysis and computer graphics. This paper introduces a numerical method for solving the time fractional Schrödinger equation using B-spline functions and the Atangana-Baleanu fractional derivative. The proposed method employs a finite difference scheme to discretize the fractional derivative in time, while a θ-weighted scheme is used to discretize the space directions. The efficiency of the method is demonstrated through numerical results, and error norms are examined at various values of the non-integer parameter, temporal directions, and spatial directions.


Subject(s)
Algorithms , Models, Theoretical , Quantum Theory , Computer Simulation
10.
PLoS One ; 19(5): e0303084, 2024.
Article in English | MEDLINE | ID: mdl-38753685

ABSTRACT

The advent of smart grid technologies has brought about a paradigm shift in the management and operation of distribution networks, allowing for intricate system information to be encapsulated within semantic network models. These models, while robust, are not immune to faults within their knowledge entities, which can arise from a myriad of issues, potentially leading to verification failures and operational disruptions. Addressing this critical vulnerability, our research delves into the development of a novel fault detection methodology specifically tailored for the knowledge entity variables of semantic networks in distribution networks. In our approach, we first construct a state space equation that models the behavior of knowledge entity variables in the presence of faults. This foundational framework enables us to apply an unknown input observer strategy to effectively detect anomalies within the system. To bolster the fault identification process, we introduce the innovative use of a siamese network, a neural network architecture which is proficient in differentiating between similar datasets. Through simulation scenarios, we demonstrate the efficacy of our proposed fault detection method.


Subject(s)
Neural Networks, Computer , Semantics , Algorithms , Computer Simulation
11.
PLoS One ; 19(5): e0299386, 2024.
Article in English | MEDLINE | ID: mdl-38753678

ABSTRACT

Malaria is the most common cause of death among the parasitic diseases. Malaria continues to pose a growing threat to the public health and economic growth of nations in the tropical and subtropical parts of the world. This study aims to address this challenge by developing a predictive model for malaria outbreaks in each district of The Gambia, leveraging historical meteorological data. To achieve this objective, we employ and compare the performance of eight machine learning algorithms, including C5.0 decision trees, artificial neural networks, k-nearest neighbors, support vector machines with linear and radial kernels, logistic regression, extreme gradient boosting, and random forests. The models are evaluated using 10-fold cross-validation during the training phase, repeated five times to ensure robust validation. Our findings reveal that extreme gradient boosting and decision trees exhibit the highest prediction accuracy on the testing set, achieving 93.3% accuracy, followed closely by random forests with 91.5% accuracy. In contrast, the support vector machine with a linear kernel performs less favorably, showing a prediction accuracy of 84.8% and underperforming in specificity analysis. Notably, the integration of both climatic and non-climatic features proves to be a crucial factor in accurately predicting malaria outbreaks in The Gambia.


Subject(s)
Disease Outbreaks , Machine Learning , Malaria , Support Vector Machine , Gambia/epidemiology , Humans , Malaria/epidemiology , Neural Networks, Computer , Algorithms
12.
PLoS One ; 19(5): e0301505, 2024.
Article in English | MEDLINE | ID: mdl-38753696

ABSTRACT

In the era of computational advancements, harnessing computer algorithms for approximating solutions to differential equations has become indispensable for its unparalleled productivity. The numerical approximation of partial differential equation (PDE) models holds crucial significance in modelling physical systems, driving the necessity for robust methodologies. In this article, we introduce the Implicit Six-Point Block Scheme (ISBS), employing a collocation approach for second-order numerical approximations of ordinary differential equations (ODEs) derived from one or two-dimensional physical systems. The methodology involves transforming the governing PDEs into a fully-fledged system of algebraic ordinary differential equations by employing ISBS to replace spatial derivatives while utilizing a central difference scheme for temporal or y-derivatives. In this report, the convergence properties of ISBS, aligning with the principles of multi-step methods, are rigorously analyzed. The numerical results obtained through ISBS demonstrate excellent agreement with theoretical solutions. Additionally, we compute absolute errors across various problem instances, showcasing the robustness and efficacy of ISBS in practical applications. Furthermore, we present a comprehensive comparative analysis with existing methodologies from recent literature, highlighting the superior performance of ISBS. Our findings are substantiated through illustrative tables and figures, underscoring the transformative potential of ISBS in advancing the numerical approximation of two-dimensional PDEs in physical systems.


Subject(s)
Algorithms , Models, Theoretical , Computer Simulation
13.
PLoS One ; 19(5): e0302129, 2024.
Article in English | MEDLINE | ID: mdl-38753705

ABSTRACT

Emerging technologies focused on the detection and quantification of circulating tumor DNA (ctDNA) in blood show extensive potential for managing patient treatment decisions, informing risk of recurrence, and predicting response to therapy. Currently available tissue-informed approaches are often limited by the need for additional sequencing of normal tissue or peripheral mononuclear cells to identify non-tumor-derived alterations while tissue-naïve approaches are often limited in sensitivity. Here we present the analytical validation for a novel ctDNA monitoring assay, FoundationOne®Tracker. The assay utilizes somatic alterations from comprehensive genomic profiling (CGP) of tumor tissue. A novel algorithm identifies monitorable alterations with a high probability of being somatic and computationally filters non-tumor-derived alterations such as germline or clonal hematopoiesis variants without the need for sequencing of additional samples. Monitorable alterations identified from tissue CGP are then quantified in blood using a multiplex polymerase chain reaction assay based on the validated SignateraTM assay. The analytical specificity of the plasma workflow is shown to be 99.6% at the sample level. Analytical sensitivity is shown to be >97.3% at ≥5 mean tumor molecules per mL of plasma (MTM/mL) when tested with the most conservative configuration using only two monitorable alterations. The assay also demonstrates high analytical accuracy when compared to liquid biopsy-based CGP as well as high qualitative (measured 100% PPA) and quantitative precision (<11.2% coefficient of variation).


Subject(s)
Circulating Tumor DNA , Neoplasms , Humans , Circulating Tumor DNA/blood , Circulating Tumor DNA/genetics , Neoplasms/genetics , Neoplasms/blood , Neoplasms/diagnosis , Genomics/methods , Biomarkers, Tumor/blood , Biomarkers, Tumor/genetics , Sensitivity and Specificity , Algorithms , Multiplex Polymerase Chain Reaction/methods , Liquid Biopsy/methods
14.
PLoS One ; 19(5): e0302507, 2024.
Article in English | MEDLINE | ID: mdl-38753712

ABSTRACT

Diagnosing lung diseases accurately and promptly is essential for effectively managing this significant public health challenge on a global scale. This paper introduces a new framework called Modified Segnet-based Lung Disease Segmentation and Severity Classification (MSLDSSC). The MSLDSSC model comprises four phases: "preprocessing, segmentation, feature extraction, and classification." Initially, the input image undergoes preprocessing using an improved Wiener filter technique. This technique estimates the power spectral density of the noisy and original images and computes the SNR assisted by PSNR to evaluate image quality. Next, the preprocessed image undergoes Segmentation to identify and separate the RoI from the background objects in the lung image. We employ a Modified Segnet mechanism that utilizes a proposed hard tanh-Softplus activation function for effective Segmentation. Following Segmentation, features such as MLDN, entropy with MRELBP, shape features, and deep features are extracted. Following the feature extraction phase, the retrieved feature set is input into a hybrid severity classification model. This hybrid model comprises two classifiers: SDPA-Squeezenet and DCNN. These classifiers train on the retrieved feature set and effectively classify the severity level of lung diseases.


Subject(s)
Lung Diseases , Tomography, X-Ray Computed , Humans , Lung Diseases/diagnostic imaging , Lung Diseases/classification , Tomography, X-Ray Computed/methods , Neural Networks, Computer , Lung/diagnostic imaging , Lung/pathology , Algorithms , Image Processing, Computer-Assisted/methods
15.
PeerJ ; 12: e17340, 2024.
Article in English | MEDLINE | ID: mdl-38756444

ABSTRACT

Introduction: This study aimed to evaluate the prognosis of patients with COVID-19 and hypertension who were treated with angiotensin-converting enzyme inhibitor (ACEI)/angiotensin receptor B (ARB) drugs and to identify key features affecting patient prognosis using an unsupervised learning method. Methods: A large-scale clinical dataset, including patient information, medical history, and laboratory test results, was collected. Two hundred patients with COVID-19 and hypertension were included. After cluster analysis, patients were divided into good and poor prognosis groups. The unsupervised learning method was used to evaluate clinical characteristics and prognosis, and patients were divided into different prognosis groups. The improved wild dog optimization algorithm (IDOA) was used for feature selection and cluster analysis, followed by the IDOA-k-means algorithm. The impact of ACEI/ARB drugs on patient prognosis and key characteristics affecting patient prognosis were also analysed. Results: Key features related to prognosis included baseline information and laboratory test results, while clinical symptoms and imaging results had low predictive power. The top six important features were age, hypertension grade, MuLBSTA, ACEI/ARB, NT-proBNP, and high-sensitivity troponin I. These features were consistent with the results of the unsupervised prediction model. A visualization system was developed based on these key features. Conclusion: Using unsupervised learning and the improved k-means algorithm, this study accurately analysed the prognosis of patients with COVID-19 and hypertension. The use of ACEI/ARB drugs was found to be a protective factor for poor clinical prognosis. Unsupervised learning methods can be used to differentiate patient populations and assess treatment effects. This study identified important features affecting patient prognosis and developed a visualization system with clinical significance for prognosis assessment and treatment decision-making.


Subject(s)
Angiotensin Receptor Antagonists , Angiotensin-Converting Enzyme Inhibitors , COVID-19 , Hypertension , SARS-CoV-2 , Unsupervised Machine Learning , Humans , Hypertension/drug therapy , Angiotensin-Converting Enzyme Inhibitors/therapeutic use , Male , Prognosis , Retrospective Studies , Female , Middle Aged , Angiotensin Receptor Antagonists/therapeutic use , Aged , COVID-19 Drug Treatment , Algorithms , Cluster Analysis
16.
J Med Internet Res ; 26: e55913, 2024 May 17.
Article in English | MEDLINE | ID: mdl-38758578

ABSTRACT

BACKGROUND: Suicide is the second-leading cause of death among adolescents and is associated with clusters of suicides. Despite numerous studies on this preventable cause of death, the focus has primarily been on single nations and traditional statistical methods. OBJECTIVE: This study aims to develop a predictive model for adolescent suicidal thinking using multinational data sets and machine learning (ML). METHODS: We used data from the Korea Youth Risk Behavior Web-based Survey with 566,875 adolescents aged between 13 and 18 years and conducted external validation using the Youth Risk Behavior Survey with 103,874 adolescents and Norway's University National General Survey with 19,574 adolescents. Several tree-based ML models were developed, and feature importance and Shapley additive explanations values were analyzed to identify risk factors for adolescent suicidal thinking. RESULTS: When trained on the Korea Youth Risk Behavior Web-based Survey data from South Korea with a 95% CI, the XGBoost model reported an area under the receiver operating characteristic (AUROC) curve of 90.06% (95% CI 89.97-90.16), displaying superior performance compared to other models. For external validation using the Youth Risk Behavior Survey data from the United States and the University National General Survey from Norway, the XGBoost model achieved AUROCs of 83.09% and 81.27%, respectively. Across all data sets, XGBoost consistently outperformed the other models with the highest AUROC score, and was selected as the optimal model. In terms of predictors of suicidal thinking, feelings of sadness and despair were the most influential, accounting for 57.4% of the impact, followed by stress status at 19.8%. This was followed by age (5.7%), household income (4%), academic achievement (3.4%), sex (2.1%), and others, which contributed less than 2% each. CONCLUSIONS: This study used ML by integrating diverse data sets from 3 countries to address adolescent suicide. The findings highlight the important role of emotional health indicators in predicting suicidal thinking among adolescents. Specifically, sadness and despair were identified as the most significant predictors, followed by stressful conditions and age. These findings emphasize the critical need for early diagnosis and prevention of mental health issues during adolescence.


Subject(s)
Machine Learning , Suicidal Ideation , Humans , Adolescent , Female , Male , Republic of Korea , Algorithms , Cohort Studies , Adolescent Behavior/psychology , Suicide/statistics & numerical data , Suicide/psychology , Norway , Surveys and Questionnaires , Risk Factors , Risk-Taking
17.
Genome Biol ; 25(1): 130, 2024 May 21.
Article in English | MEDLINE | ID: mdl-38773520

ABSTRACT

Bulk DNA sequencing of multiple samples from the same tumor is becoming common, yet most methods to infer copy-number aberrations (CNAs) from this data analyze individual samples independently. We introduce HATCHet2, an algorithm to identify haplotype- and clone-specific CNAs simultaneously from multiple bulk samples. HATCHet2 extends the earlier HATCHet method by improving identification of focal CNAs and introducing a novel statistic, the minor haplotype B-allele frequency (mhBAF), that enables identification of mirrored-subclonal CNAs. We demonstrate HATCHet2's improved accuracy using simulations and a single-cell sequencing dataset. HATCHet2 analysis of 10 prostate cancer patients reveals previously unreported mirrored-subclonal CNAs affecting cancer genes.


Subject(s)
Algorithms , DNA Copy Number Variations , Haplotypes , Prostatic Neoplasms , Humans , Prostatic Neoplasms/genetics , Male , Sequence Analysis, DNA/methods , Neoplasms/genetics , Gene Frequency , Single-Cell Analysis
18.
Lipids Health Dis ; 23(1): 152, 2024 May 21.
Article in English | MEDLINE | ID: mdl-38773573

ABSTRACT

BACKGROUND: Alzheimer's disease (AD) is a chronic neurodegenerative disorder that poses a substantial economic burden. The Random forest algorithm is effective in predicting AD; however, the key factors influencing AD onset remain unclear. This study aimed to analyze the key lipoprotein and metabolite factors influencing AD onset using machine-learning methods. It provides new insights for researchers and medical personnel to understand AD and provides a reference for the early diagnosis, treatment, and early prevention of AD. METHODS: A total of 603 participants, including controls and patients with AD with complete lipoprotein and metabolite data from the Alzheimer's disease Neuroimaging Initiative (ADNI) database between 2005 and 2016, were enrolled. Random forest, Lasso regression, and CatBoost algorithms were employed to rank and filter 213 lipoprotein and metabolite variables. Variables with consistently high importance rankings from any two methods were incorporated into the models. Finally, the variables selected from the three methods, with the participants' age, sex, and marital status, were used to construct a random forest predictive model. RESULTS: Fourteen lipoprotein and metabolite variables were screened using the three methods, and 17 variables were included in the AD prediction model based on age, sex, and marital status of the participants. The optimal random forest modeling was constructed with "mtry" set to 3 and "ntree" set to 300. The model exhibited an accuracy of 71.01%, a sensitivity of 79.59%, a specificity of 65.28%, and an AUC (95%CI) of 0.724 (0.645-0.804). When Mean Decrease Accuracy and Gini were used to rank the proteins, age, phospholipids to total lipids ratio in intermediate-density lipoproteins (IDL_PL_PCT), and creatinine were among the top five variables. CONCLUSIONS: Age, IDL_PL_PCT, and creatinine levels play crucial roles in AD onset. Regular monitoring of lipoproteins and their metabolites in older individuals is significant for early AD diagnosis and prevention.


Subject(s)
Alzheimer Disease , Lipoproteins , Machine Learning , Humans , Alzheimer Disease/diagnosis , Alzheimer Disease/blood , Alzheimer Disease/metabolism , Female , Male , Aged , Lipoproteins/blood , Aged, 80 and over , Algorithms , Biomarkers/blood
19.
J R Soc Interface ; 21(214): 20230732, 2024 May.
Article in English | MEDLINE | ID: mdl-38774958

ABSTRACT

The concept of an autocatalytic network of reactions that can form and persist, starting from just an available food source, has been formalized by the notion of a reflexively autocatalytic and food-generated (RAF) set. The theory and algorithmic results concerning RAFs have been applied to a range of settings, from metabolic questions arising at the origin of life, to ecological networks, and cognitive models in cultural evolution. In this article, we present new structural and algorithmic results concerning RAF sets, by studying more complex modes of catalysis that allow certain reactions to require multiple catalysts (or to not require catalysis at all), and discuss the differing ways catalysis has been viewed in the literature. We also focus on the structure and analysis of minimal RAFs and derive structural results and polynomial-time algorithms. We then apply these new methods to a large metabolic network to gain insights into possible biochemical scenarios near the origin of life.


Subject(s)
Algorithms , Catalysis , Models, Biological , Biochemistry , Origin of Life
20.
Technol Cancer Res Treat ; 23: 15330338241250324, 2024.
Article in English | MEDLINE | ID: mdl-38775067

ABSTRACT

Advancements in AI have notably changed cancer research, improving patient care by enhancing detection, survival prediction, and treatment efficacy. This review covers the role of Machine Learning, Soft Computing, and Deep Learning in oncology, explaining key concepts and algorithms (like SVM, Naïve Bayes, and CNN) in a clear, accessible manner. It aims to make AI advancements understandable to a broad audience, focusing on their application in diagnosing, classifying, and predicting various cancer types, thereby underlining AI's potential to better patient outcomes. Moreover, we present a tabular summary of the most significant advances from the literature, offering a time-saving resource for readers to grasp each study's main contributions. The remarkable benefits of AI-powered algorithms in cancer care underscore their potential for advancing cancer research and clinical practice. This review is a valuable resource for researchers and clinicians interested in the transformative implications of AI in cancer care.


Subject(s)
Algorithms , Artificial Intelligence , Neoplasms , Humans , Neoplasms/diagnosis , Neoplasms/therapy , Biomedical Research , Machine Learning
SELECTION OF CITATIONS
SEARCH DETAIL
...