Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 5.293
Filter
1.
medRxiv ; 2024 Jun 20.
Article in English | MEDLINE | ID: mdl-38946964

ABSTRACT

Background: The use of big data and large language models in healthcare can play a key role in improving patient treatment and healthcare management, especially when applied to large-scale administrative data. A major challenge to achieving this is ensuring that patient confidentiality and personal information is protected. One way to overcome this is by augmenting clinical data with administrative laboratory dataset linkages in order to avoid the use of demographic information. Methods: We explored an alternative method to examine patient files from a large administrative dataset in South Africa (the National Health Laboratory Services, or NHLS), by linking external data to the NHLS database using specimen barcodes associated with laboratory tests. This offers us with a deterministic way of performing data linkages without accessing demographic information. In this paper, we quantify the performance metrics of this approach. Results: The linkage of the large NHLS data to external hospital data using specimen barcodes achieved a 95% success. Out of the 1200 records in the validation sample, 87% were exact matches and 9% were matches with typographic correction. The remaining 5% were either complete mismatches or were due to duplicates in the administrative data. Conclusions: The high success rate indicates the reliability of using barcodes for linking data without demographic identifiers. Specimen barcodes are an effective tool for deterministic linking in health data, and may provide a method of creating large, linked data sets without compromising patient confidentiality.

2.
Front Public Health ; 12: 1389641, 2024.
Article in English | MEDLINE | ID: mdl-38952731

ABSTRACT

Aims: To assess the impact of the COVID-19 pandemic on the health condition of people ≥75 years of age and on their family caregivers in Spain. Design: Multicentric, mixed method concurrent study. Methods: This work, which will be conducted within the primary care setting in 11 administrative regions of Spain, will include three coordinated studies with different methodologies. The first is a population-based cohort study that will use real-life data to analyze the rates and evolution of health needs, care provision, and services utilization before, during, and after the pandemic. The second is a prospective cohort study with 18 months of follow-up that will evaluate the impact of COVID-19 disease on mortality, frailty, functional and cognitive capacity, and quality of life of the participants. Finally, the third will be a qualitative study with a critical social approach to understand and interpret the social, political, and economic dimensions associated with the use of health services during the pandemic. We have followed the SPIRIT Checklist to address trial protocol and related documents. This research is being funded by the Instituto de Salud Carlos III since 2021 and was approved by its ethics committee (June 2022). Discussion: The study findings will reveal the long-term impact of the COVID-19 pandemic on the older adults and their caregivers. This information will serve policymakers to adapt health policies to the needs of this population in situations of maximum stress, such as that produced by the COVID-19 pandemic. Trial Registration: Identifier: NCT05249868 [ClinicalTrials.gov].


Subject(s)
COVID-19 , Self Care , Humans , COVID-19/epidemiology , Spain/epidemiology , Aged , Prospective Studies , Caregivers/statistics & numerical data , Caregivers/psychology , Female , Aged, 80 and over , Quality of Life , Male , Health Status , SARS-CoV-2 , Pandemics , Primary Health Care/statistics & numerical data
3.
Pediatr Blood Cancer ; : e31140, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38956808

ABSTRACT

BACKGROUND: Direct oral anticoagulants (DOACs) have had significant impact on the management of venous thromboembolism (VTE) in adults, but these agents were not approved for use in pediatric patients until 2021. Our objective was to analyze the characteristics of pediatric patients treated with DOACs prior to and following U.S. Food and Drug Administration (FDA) approval for children and evaluate their impact on hospital outcomes. PROCEDURE: We utilized the Epic Cosmos dataset (Cosmos), a de-identified dataset of over 220 million patients, to identify patients aged 1-18 years admitted with a first-occurrence diagnosis of VTE between January 1, 2017 and June 30, 2023. Patients were grouped by anticoagulation received (unfractionated heparin, low molecular weight heparin, and/or DOACs). RESULTS: Among 5138 eligible patients, 18.1% received DOACs as all or part of their anticoagulation treatment, while 81.9% received heparin therapies alone. Patients treated with DOACs were older than patients treated with heparin monotherapy at 17.4 and 13.0 years, respectively. Non-DOAC patients were more likely to have chronic conditions and were less likely to have pulmonary embolism. Patients treated with DOACs demonstrated shorter overall length of stay and duration of intensive care unit (ICU) admission. CONCLUSIONS: DOACs remain infrequently utilized in pediatric patients, especially in those under 13 years old. Initiation on heparin therapy and transition to DOACs remains common, with 80.6% of DOAC patients receiving heparin during their hospitalization. While DOAC monotherapy is not currently endorsed as first-line therapy for DVT or PE in children, it is being used clinically. Further research is needed to clarify the impact of DOAC use on patient adherence, VTE recurrence, and healthcare cost.

4.
Heliyon ; 10(12): e33191, 2024 Jun 30.
Article in English | MEDLINE | ID: mdl-39022026

ABSTRACT

In modern society, people's pace of life is fast, and the pressure is enormous, leading to increasingly prominent issues such as obesity and sub-health. Traditional fitness methods cannot meet people's needs to a certain extent. Therefore, this work aims to use technology to change people's lifestyles and compensate for traditional fitness methods' shortcomings. Firstly, this work overviews neurorobotics, providing neural perception and control functions for aerobics intelligent fitness system. Secondly, the connection between big data and machine learning (ML), big data technology products, and the ML process are discussed. The Spark big data platform builds node data for calculation, and the decision tree algorithm is used for data preprocessing. These are important for future intelligent fitness analysis. This work proposes an aerobics intelligent fitness system based on neurorobotics technology and big data analysis and develops a recommendation system for the best fitness exercise. This system utilizes neural perception and control functions, combined with big data and ML technology, to solve the obesity and sub-health problems faced by people in fast-paced and high-pressure lifestyles. By harnessing the computational capabilities of the Spark big data platform and applying the decision tree algorithm for data preprocessing, the system can furnish users with personalized fitness plans and optimization recommendations. This work conducts a model performance study on 35 % aerobic fitness data on intelligent fitness Android v1.0.8 to evaluate the system's data processing ability and training effectiveness. Moreover, the aerobics intelligent fitness system models based on neurorobotics, big data, and ML are evaluated. The results indicate that normalizing the data using the Min-Max method leads to a decrease in the F1 value and a reduction in data set errors. Consequently, the dataset studied by the system model is beneficial to improving the work efficiency of the aerobics intelligent fitness system. After the comprehensive human quality of the system model is evaluated, the actual average score of the comprehensive human quality of the 13 users tested before the aerobics intelligent fitness system test is 91.44, and the average prediction score is 90.88. The results of the two tests are similar. Thus, using the intelligent fitness system can enable the user to obtain system feedback according to the actual training effect, thereby playing a guiding role in the intelligent fitness of aerobics for the user. This work designs and implements the aerobics intelligent fitness system close to the human body's training effect, further enhancing the specialization and individualization of sports and fitness.

5.
Front Public Health ; 12: 1414076, 2024.
Article in English | MEDLINE | ID: mdl-39022418

ABSTRACT

While healthcare big data brings great opportunities and convenience to the healthcare industry, it also inevitably raises the issue of privacy leakage. Nowadays, the whole world is facing the security threat of healthcare big data, for which a sound policy framework can help reduce privacy risks of healthcare big data. In recent years, the Chinese government and industry self-regulatory organizations have issued a series of policy documents to reduce privacy risks of healthcare big data. However, China's policy framework suffers from the drawbacks of the mismatched operational model, the inappropriate operational method, and the poorly actionable operational content. Based on the experiences of the European Union, Australia, the United States, and other extra-territorial regions, strategies are proposed for China to amend the operational model of the policy framework, improve the operational method of the policy framework, and enhance the operability of the operational content of the policy framework. This study enriches the research on China's policy framework to reduce privacy risks of healthcare big data and provides some inspiration for other countries.


Subject(s)
Big Data , Health Policy , China , Humans , Privacy , Confidentiality , Computer Security
6.
Yale J Biol Med ; 97(2): 239-245, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38947107

ABSTRACT

Community-based participatory research (CBPR) using barbershop interventions is an emerging approach to address health disparities and promote health equity. Barbershops serve as trusted community settings for health education, screening services, and referrals. This narrative mini-review provides an overview of the current state of knowledge regarding CBPR employing barbershop interventions and explores the potential for big data involvement to enhance the impact and reach of this approach in combating chronic disease. CBPR using barbershop interventions has shown promising results in reducing blood pressure among Black men and improving diabetes awareness and self-management. By increasing testing rates and promoting preventive behaviors, barbershop interventions have been successful in addressing infectious diseases, including HIV and COVID-19. Barbershops have also played roles in promoting cancer screening and increasing awareness of cancer risks, namely prostate cancer and colorectal cancer. Further, leveraging the trusted relationships between barbers and their clients, mental health promotion and prevention efforts have been successful in barbershops. The potential for big data involvement in barbershop interventions for chronic disease management offers new opportunities for targeted programs, real-time monitoring, and personalized approaches. However, ethical considerations regarding privacy, confidentiality, and data ownership need to be carefully addressed. To maximize the impact of barbershop interventions, challenges such as training and resource provision for barbers, cultural appropriateness of interventions, sustainability, and scalability must be addressed. Further research is needed to evaluate long-term impact, cost-effectiveness, and best practices for implementation. Overall, barbershops have the potential to serve as key partners in addressing chronic health disparities and promoting health equity.


Subject(s)
Big Data , Humans , Chronic Disease/prevention & control , Community-Based Participatory Research , Health Promotion/methods , COVID-19/prevention & control , COVID-19/epidemiology , Barbering , SARS-CoV-2
7.
Ann Lab Med ; 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38953115

ABSTRACT

Background: Healthcare 4.0. refers to the integration of advanced technologies, such as artificial intelligence (AI) and big data analysis, into the healthcare sector. Recognizing the impact of Healthcare 4.0 technologies in laboratory medicine (LM), we seek to assess the overall awareness and implementation of Healthcare 4.0 among members of the Korean Society for Laboratory Medicine (KSLM). Methods: A web-based survey was conducted using an anonymous questionnaire. The survey comprised 36 questions covering demographic information (seven questions), big data (10 questions), and AI (19 questions). Results: In total, 182 (17.9%) of 1,017 KSLM members participated in the survey. Thirty-two percent of respondents considered AI to be the most important technology in LM in the era of Healthcare 4.0, closely followed by 31% who favored big data. Approximately 80% of respondents were familiar with big data but had not conducted research using it, and 71% were willing to participate in future big data research conducted by the KSLM. Respondents viewed AI as the most valuable tool in molecular genetics within various divisions. More than half of the respondents were open to the notion of using AI as assistance rather than a complete replacement for their roles. Conclusions: This survey highlighted KSLM members' awareness of the potential applications and implications of big data and AI. We emphasize the complexity of AI integration in healthcare, citing technical and ethical challenges leading to diverse opinions on its impact on employment and training. This highlights the need for a holistic approach to adopting new technologies.

8.
BioData Min ; 17(1): 22, 2024 Jul 12.
Article in English | MEDLINE | ID: mdl-38997749

ABSTRACT

BACKGROUND: The use of machine learning in medical diagnosis and treatment has grown significantly in recent years with the development of computer-aided diagnosis systems, often based on annotated medical radiology images. However, the lack of large annotated image datasets remains a major obstacle, as the annotation process is time-consuming and costly. This study aims to overcome this challenge by proposing an automated method for annotating a large database of medical radiology images based on their semantic similarity. RESULTS: An automated, unsupervised approach is used to create a large annotated dataset of medical radiology images originating from the Clinical Hospital Centre Rijeka, Croatia. The pipeline is built by data-mining three different types of medical data: images, DICOM metadata and narrative diagnoses. The optimal feature extractors are then integrated into a multimodal representation, which is then clustered to create an automated pipeline for labelling a precursor dataset of 1,337,926 medical images into 50 clusters of visually similar images. The quality of the clusters is assessed by examining their homogeneity and mutual information, taking into account the anatomical region and modality representation. CONCLUSIONS: The results indicate that fusing the embeddings of all three data sources together provides the best results for the task of unsupervised clustering of large-scale medical data and leads to the most concise clusters. Hence, this work marks the initial step towards building a much larger and more fine-grained annotated dataset of medical radiology images.

9.
Methods Mol Biol ; 2814: 223-245, 2024.
Article in English | MEDLINE | ID: mdl-38954209

ABSTRACT

Dictyostelium represents a stripped-down model for understanding how cells make decisions during development. The complete life cycle takes around a day and the fully differentiated structure is composed of only two major cell types. With this apparent reduction in "complexity," single cell transcriptomics has proven to be a valuable tool in defining the features of developmental transitions and cell fate separation events, even providing causal information on how mechanisms of gene expression can feed into cell decision-making. These scientific outputs have been strongly facilitated by the ease of non-disruptive single cell isolation-allowing access to more physiological measures of transcript levels. In addition, the limited number of cell states during development allows the use of more straightforward analysis tools for handling the ensuing large datasets, which provides enhanced confidence in inferences made from the data. In this chapter, we will outline the approaches we have used for handling Dictyostelium single cell transcriptomic data, illustrating how these approaches have contributed to our understanding of cell decision-making during development.


Subject(s)
Dictyostelium , Gene Expression Profiling , Single-Cell Analysis , Transcriptome , Dictyostelium/genetics , Dictyostelium/growth & development , Single-Cell Analysis/methods , Gene Expression Profiling/methods , Gene Expression Regulation, Developmental , Single-Cell Gene Expression Analysis
10.
Sensors (Basel) ; 24(13)2024 Jun 26.
Article in English | MEDLINE | ID: mdl-39000931

ABSTRACT

Internet of Things (IoT) applications and resources are highly vulnerable to flood attacks, including Distributed Denial of Service (DDoS) attacks. These attacks overwhelm the targeted device with numerous network packets, making its resources inaccessible to authorized users. Such attacks may comprise attack references, attack types, sub-categories, host information, malicious scripts, etc. These details assist security professionals in identifying weaknesses, tailoring defense measures, and responding rapidly to possible threats, thereby improving the overall security posture of IoT devices. Developing an intelligent Intrusion Detection System (IDS) is highly complex due to its numerous network features. This study presents an improved IDS for IoT security that employs multimodal big data representation and transfer learning. First, the Packet Capture (PCAP) files are crawled to retrieve the necessary attacks and bytes. Second, Spark-based big data optimization algorithms handle huge volumes of data. Second, a transfer learning approach such as word2vec retrieves semantically-based observed features. Third, an algorithm is developed to convert network bytes into images, and texture features are extracted by configuring an attention-based Residual Network (ResNet). Finally, the trained text and texture features are combined and used as multimodal features to classify various attacks. The proposed method is thoroughly evaluated on three widely used IoT-based datasets: CIC-IoT 2022, CIC-IoT 2023, and Edge-IIoT. The proposed method achieves excellent classification performance, with an accuracy of 98.2%. In addition, we present a game theory-based process to validate the proposed approach formally.

11.
Article in English | MEDLINE | ID: mdl-38981117

ABSTRACT

OBJECTIVES: We describe new curriculum materials for engaging secondary school students in exploring the "big data" in the NIH All of Us Research Program's Public Data Browser and the co-design processes used to collaboratively develop the materials. We also describe the methods used to develop and validate assessment items for studying the efficacy of the materials for student learning as well as preliminary findings from these studies. MATERIALS AND METHODS: Secondary-level biology teachers from across the United States participated in a 2.5-day Co-design Summer Institute. After learning about the All of Us Research Program and its Data Browser, they collaboratively developed learning objectives and initial ideas for learning experiences related to exploring the Data Browser and big data. The Genetic Science Learning Center team at the University of Utah further developed the educators' ideas. Additional teachers and their students participated in classroom pilot studies to validate a 22-item instrument that assesses students' knowledge. Educators completed surveys about the materials and their experiences. RESULTS: The "Exploring Big Data with the All of Us Data Browser" curriculum module includes 3 data exploration guides that engage students in using the Data Browser, 3 related multimedia pieces, and teacher support materials. Pilot testing showed substantial growth in students' understanding of key big data concepts and research applications. DISCUSSION AND CONCLUSION: Our co-design process provides a model for educator engagement. The new curriculum module serves as a model for introducing secondary students to big data and precision medicine research by exploring diverse real-world datasets.

12.
Sci Rep ; 14(1): 15584, 2024 Jul 06.
Article in English | MEDLINE | ID: mdl-38971827

ABSTRACT

To address the shortcomings of traditional reliability theory in characterizing the stability of deep underground structures, the advanced first order second moment of reliability was improved to obtain fuzzy random reliability, which is more consistent with the working conditions. The traditional sensitivity analysis model was optimized using fuzzy random optimization, and an analytical calculation model of the mean and standard deviation of the fuzzy random reliability sensitivity was established. A big data hidden Markov model and expectation-maximization algorithm were used to improve the digital characteristics of fuzzy random variables. The fuzzy random sensitivity optimization model was used to confirm the effect of concrete compressive strength, thick-diameter ratio, reinforcement ratio, uncertainty coefficient of calculation model, and soil depth on the overall structural reliability of a reinforced concrete double-layer wellbore in deep alluvial soil. Through numerical calculations, these characteristics were observed to be the main influencing factors. Furthermore, while the soil depth was negatively correlated, the other influencing factors were all positively correlated with the overall reliability. This study provides an effective reference for the safe construction of deep underground structures in the future.

13.
Brain Inform ; 11(1): 19, 2024 Jul 10.
Article in English | MEDLINE | ID: mdl-38987395

ABSTRACT

Bipolar psychometric scales data are widely used in psychologic healthcare. Adequate psychological profiling benefits patients and saves time and costs. Grant funding depends on the quality of psychotherapeutic measures. Bipolar Likert scales yield compositional data because any order of magnitude of agreement towards an item assertion implies a complementary order of magnitude of disagreement. Using an isometric log-ratio (ilr) transformation the bivariate information can be transformed towards the real valued interval scale yielding unbiased statistical results increasing the statistical power of the Pearson correlation significance test if the Central Limit Theorem (CLT) of statistics is satisfied. In practice, however, the applicability of the CLT depends on the number of summands (i.e., the number of items) and the variance of the data generating process (DGP) of the ilr transformed data. Via simulation we provide evidence that the ilr approach also works satisfactory if the CLT is violated. That is, the ilr approach is robust towards extremely large or infinite variances of the underlying DGP increasing the statistical power of the correlation test. The study generalizes former results pointing out the universality and reliability of the ilr approach in psychometric big data analysis affecting psychometric health economics, patient welfare, grant funding, economic decision making and profits.

14.
Int J Cardiol ; 411: 132329, 2024 Sep 15.
Article in English | MEDLINE | ID: mdl-38964554

ABSTRACT

BACKGROUND: Left ventricular (LV) thrombus is not common but poses significant risks of embolic stroke or systemic embolism. However, the distinction in embolic risk between nonischemic cardiomyopathy (NICM) and ischemic cardiomyopathy (ICM) remains unclear. METHODS AND RESULTS: In total, 2738 LV thrombus patients from the JROAD-DPC (Japanese Registry of All Cardiac and Vascular Diseases Diagnosis Procedure Combination) database were included. Among these patients, 1037 patients were analyzed, with 826 (79.7%) having ICM and 211 with NICM (20.3%). Within the NICM group, the distribution was as follows: dilated cardiomyopathy (DCM; 41.2%), takotsubo cardiomyopathy (27.0%), hypertrophic cardiomyopathy (18.0%), and other causes (13.8%). The primary outcome was a composite of embolic stroke or systemic embolism (SSE) during hospitalization. The ICM and NICM groups showed no significant difference in the primary outcome (5.8% vs. 7.6%, p = 0.34). Among NICM, SSE occurred in 12.6% of patients with DCM, 7.0% with takotsubo cardiomyopathy, and 2.6% with hypertrophic cardiomyopathy. Multivariate logistic regression analysis for SSE revealed an odds ratio of 1.4 (95% confidence interval [CI], 0.7-2.7, p = 0.37) for NICM compared to ICM. However, DCM exhibited a higher adjusted odds ratio for SSE compared to ICM (2.6, 95% CI 1.2-6.0, p = 0.022). CONCLUSIONS: This nationwide shows comparable rates of embolic events between ICM and NICM in LV thrombus patients, with DCM posing a greater risk of SSE than ICM. The findings emphasize the importance of assessing the specific cause of heart disease in NICM, within LV thrombus management strategies.


Subject(s)
Databases, Factual , Myocardial Ischemia , Registries , Thrombosis , Humans , Female , Male , Aged , Middle Aged , Thrombosis/epidemiology , Myocardial Ischemia/epidemiology , Myocardial Ischemia/diagnosis , Japan/epidemiology , Risk Factors , Embolism/epidemiology , Embolism/complications , Heart Ventricles/diagnostic imaging , Cardiomyopathies/epidemiology , Aged, 80 and over
15.
J Anesth Analg Crit Care ; 4(1): 44, 2024 Jul 12.
Article in English | MEDLINE | ID: mdl-38992794

ABSTRACT

We are in the era of Health 4.0 when novel technologies are providing tools capable of improving the quality and safety of the services provided. Our project involves the integration of different technologies (AI, big data, robotics, and telemedicine) to create a unique system for patients admitted to intensive care units suffering from infectious diseases capable of both increasing the personalization of care and ensuring a safer environment for caregivers.

16.
Front Oncol ; 14: 1444543, 2024.
Article in English | MEDLINE | ID: mdl-39015491
17.
J Med Internet Res ; 26: e49570, 2024 Jul 16.
Article in English | MEDLINE | ID: mdl-39012659

ABSTRACT

BACKGROUND: Evidence-based clinical intake tools (EBCITs) are structured assessment tools used to gather information about patients and help health care providers make informed decisions. The growing demand for personalized medicine, along with the big data revolution, has rendered EBCITs a promising solution. EBCITs have the potential to provide comprehensive and individualized assessments of symptoms, enabling accurate diagnosis, while contributing to the grounding of medical care. OBJECTIVE: This work aims to examine whether EBCITs cover data concerning disorders and symptoms to a similar extent as physicians, and thus can reliably address medical conditions in clinical settings. We also explore the potential of EBCITs to discover and ground the real prevalence of symptoms in different disorders thereby expanding medical knowledge and further supporting medical diagnoses made by physicians. METHODS: Between August 1, 2022, and January 15, 2023, patients who used the services of a digital health care (DH) provider in the United States were first assessed by the Kahun EBCIT. Kahun platform gathered and analyzed the information from the sessions. This study estimated the prevalence of patients' symptoms in medical disorders using 2 data sets. The first data set analyzed symptom prevalence, as determined by Kahun's knowledge engine. The second data set analyzed symptom prevalence, relying solely on data from the DH patients gathered by Kahun. The variance difference between these 2 prevalence data sets helped us assess Kahun's ability to incorporate new data while integrating existing knowledge. To analyze the comprehensiveness of Kahun's knowledge engine, we compared how well it covers weighted data for the symptoms and disorders found in the 2019 National Ambulatory Medical Care Survey (NMCAS). To assess Kahun's diagnosis accuracy, physicians independently diagnosed 250 of Kahun-DH's sessions. Their diagnoses were compared with Kahun's diagnoses. RESULTS: In this study, 2550 patients used Kahun to complete a full assessment. Kahun proposed 108,523 suggestions related to symptoms during the intake process. At the end of the intake process, 6496 conditions were presented to the caregiver. Kahun covered 94% (526,157,569/562,150,572) of the weighted symptoms and 91% (1,582,637,476/173,4783,244) of the weighted disorders in the 2019 NMCAS. In 90% (224/250) of the sessions, both physicians and Kahun suggested at least one identical disorder, with a 72% (367/507) total accuracy rate. Kahun's engine yielded 519 prevalences while the Kahun-DH cohort yielded 599; 156 prevalences were unique to the latter and 443 prevalences were shared by both data sets. CONCLUSIONS: ECBITs, such as Kahun, encompass extensive amounts of knowledge and could serve as a reliable database for inferring medical insights and diagnoses. Using this credible database, the potential prevalence of symptoms in different disorders was discovered or grounded. This highlights the ability of ECBITs to refine the understanding of relationships between disorders and symptoms, which further supports physicians in medical diagnosis.


Subject(s)
Evidence-Based Medicine , Humans , Retrospective Studies , Prevalence , Female , Male , Adult , Middle Aged , Cohort Studies , United States/epidemiology , Digital Health
18.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-39007597

ABSTRACT

Thyroid cancer incidences endure to increase even though a large number of inspection tools have been developed recently. Since there is no standard and certain procedure to follow for the thyroid cancer diagnoses, clinicians require conducting various tests. This scrutiny process yields multi-dimensional big data and lack of a common approach leads to randomly distributed missing (sparse) data, which are both formidable challenges for the machine learning algorithms. This paper aims to develop an accurate and computationally efficient deep learning algorithm to diagnose the thyroid cancer. In this respect, randomly distributed missing data stemmed singularity in learning problems is treated and dimensionality reduction with inner and target similarity approaches are developed to select the most informative input datasets. In addition, size reduction with the hierarchical clustering algorithm is performed to eliminate the considerably similar data samples. Four machine learning algorithms are trained and also tested with the unseen data to validate their generalization and robustness abilities. The results yield 100% training and 83% testing preciseness for the unseen data. Computational time efficiencies of the algorithms are also examined under the equal conditions.


Subject(s)
Algorithms , Deep Learning , Thyroid Neoplasms , Thyroid Neoplasms/diagnosis , Humans , Machine Learning , Cluster Analysis
19.
Expert Opin Drug Discov ; : 1-27, 2024 Jul 14.
Article in English | MEDLINE | ID: mdl-39004919

ABSTRACT

INTRODUCTION: Small molecules often bind to multiple targets, a behavior termed polypharmacology. Anticipating polypharmacology is essential for drug discovery since unknown off-targets can modulate safety and efficacy - profoundly affecting drug discovery success. Unfortunately, experimental methods to assess selectivity present significant limitations and drugs still fail in the clinic due to unanticipated off-targets. Computational methods are a cost-effective, complementary approach to predict polypharmacology. AREAS COVERED: This review aims to provide a comprehensive overview of the state of polypharmacology prediction and discuss its strengths and limitations, covering both classical cheminformatics methods and bioinformatic approaches. The authors review available data sources, paying close attention to their different coverage. The authors then discuss major algorithms grouped by the types of data that they exploit using selected examples. EXPERT OPINION: Polypharmacology prediction has made impressive progress over the last decades and contributed to identify many off-targets. However, data incompleteness currently limits most approaches to comprehensively predict selectivity. Moreover, our limited agreement on model assessment challenges the identification of the best algorithms - which at present show modest performance in prospective real-world applications. Despite these limitations, the exponential increase of multidisciplinary Big Data and AI hold much potential to better polypharmacology prediction and de-risk drug discovery.

20.
J Clin Med ; 13(13)2024 Jul 01.
Article in English | MEDLINE | ID: mdl-38999448

ABSTRACT

Background: This study investigated the potential link between blood pressure variability (BPV) and the incidence of aortic stenosis (AS) using Korean National Health Insurance Service data from 2002 to 2019. Methods: We collected annual systolic blood pressure variability (SBPV) measurements consisting of three consecutive blood pressure readings each year over three years. The obtained SBPV data was divided into five quantiles, with the highest quintile representing a high fluctuation of blood pressure. Results: Analyzing 9,341,629 individuals with a mean age of 40.7 years, the study found 3981 new AS diagnoses during an average 8.66-year follow-up. Independent predictors for AS included higher blood pressure levels and elevated systolic blood pressure variability (SBPV). The hazard ratios (HR) for different SBPV quintiles compared to the reference (1st quintile) were as follows: 2nd quintile HR 1.09 (p = 0.18), 3rd quintile HR 1.13 (p = 0.04), 4th quintile HR 1.13 (p = 0.04), and 5th quintile HR 1.39 (p < 0.001). Conclusion: Our findings suggest that both hypertension and high fluctuations in SBP during consecutive visits are associated with an increased risk of incident AS. These results emphasize the importance of blood pressure management and stability in the prevention of AS.

SELECTION OF CITATIONS
SEARCH DETAIL
...