Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Med Internet Res ; 23(11): e32900, 2021 11 26.
Artigo em Inglês | MEDLINE | ID: mdl-34842542

RESUMO

BACKGROUND: Multimorbidity clinical risk scores allow clinicians to quickly assess their patients' health for decision making, often for recommendation to care management programs. However, these scores are limited by several issues: existing multimorbidity scores (1) are generally limited to one data group (eg, diagnoses, labs) and may be missing vital information, (2) are usually limited to specific demographic groups (eg, age), and (3) do not formally provide any granularity in the form of more nuanced multimorbidity risk scores to direct clinician attention. OBJECTIVE: Using diagnosis, lab, prescription, procedure, and demographic data from electronic health records (EHRs), we developed a physiologically diverse and generalizable set of multimorbidity risk scores. METHODS: Using EHR data from a nationwide cohort of patients, we developed the total health profile, a set of six integrated risk scores reflecting five distinct organ systems and overall health. We selected the occurrence of an inpatient hospital visitation over a 2-year follow-up window, attributable to specific organ systems, as our risk endpoint. Using a physician-curated set of features, we trained six machine learning models on 794,294 patients to predict the calibrated probability of the aforementioned endpoint, producing risk scores for heart, lung, neuro, kidney, and digestive functions and a sixth score for combined risk. We evaluated the scores using a held-out test cohort of 198,574 patients. RESULTS: Study patients closely matched national census averages, with a median age of 41 years, a median income of $66,829, and racial averages by zip code of 73.8% White, 5.9% Asian, and 11.9% African American. All models were well calibrated and demonstrated strong performance with areas under the receiver operating curve (AUROCs) of 0.83 for the total health score (THS), 0.89 for heart, 0.86 for lung, 0.84 for neuro, 0.90 for kidney, and 0.83 for digestive functions. There was consistent performance of this scoring system across sexes, diverse patient ages, and zip code income levels. Each model learned to generate predictions by focusing on appropriate clinically relevant patient features, such as heart-related hospitalizations and chronic hypertension diagnosis for the heart model. The THS outperformed the other commonly used multimorbidity scoring systems, specifically the Charlson Comorbidity Index (CCI) and the Elixhauser Comorbidity Index (ECI) overall (AUROCs: THS=0.823, CCI=0.735, ECI=0.649) as well as for every age, sex, and income bracket. Performance improvements were most pronounced for middle-aged and lower-income subgroups. Ablation tests using only diagnosis, prescription, social determinants of health, and lab feature groups, while retaining procedure-related features, showed that the combination of feature groups has the best predictive performance, though only marginally better than the diagnosis-only model on at-risk groups. CONCLUSIONS: Massive retrospective EHR data sets have made it possible to use machine learning to build practical multimorbidity risk scores that are highly predictive, personalizable, intuitive to explain, and generalizable across diverse patient populations.


Assuntos
Aprendizado de Máquina , Multimorbidade , Adulto , Estudos de Coortes , Registros Eletrônicos de Saúde , Humanos , Pessoa de Meia-Idade , Estudos Retrospectivos , Fatores de Risco
2.
AMIA Jt Summits Transl Sci Proc ; 2021: 238-247, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34457138

RESUMO

We conduct exploratory analysis of a novel algorithm called Model Agnostic Effect Coefficients (MAgEC) for extracting clinical features of importance when assessing an individual patient's healthcare risks, alongside predicting the risk itself. Our approach uses a non-homogeneous consensus-based algorithm to assign importance to features, which differs from similar approaches, which are homogeneous (typically purely based on random forests). Using the MIMIC-III dataset, we apply our method on predicting drivers/causers of unexpected mechanical ventilation in a large cohort patient population. We validate the MAgEC method using two primary metrics: its accuracy in predicting mechanical ventilation and the similarity of the proposed feature importances to a competing algorithm (SHAP). We also more closely discuss MAgEC itself by examining the stability of our proposed feature importances under different perturbations and whether the non-homogeneity of the approach actually leads to feature importance diversity. The code to implement MAgEC is open-sourced on GitHub (https://github.com/gstef80/MAgEC).


Assuntos
Algoritmos , Respiração Artificial , Estudos de Coortes , Consenso , Humanos
3.
Artigo em Inglês | MEDLINE | ID: mdl-32528215

RESUMO

This study demonstrates that a variant of a Siamese neural network architecture is more effective at classifying high-dimensional radiomic features (extracted from T2 MRI images) than traditional models, such as a Support Vector Machine or Discriminant Analysis. Ninety-nine female patients, between the ages of 20 and 48, were imaged with T2 MRI. Using biopsy pathology, the patients were separated into two groups: those with breast cancer (N=55) and those with GLM (N=44). Lesions were segmented by a trained radiologist and the ROIs were used for radiomic feature extraction. The radiomic features include 536 published features from Aerts et al., along with 20 features recurrent quantification analysis features. A Student T-Test was used to select features found to be statistically significant between the two patient groups. These features were then used to train a Siamese neural network. The label given to test features was the label of whichever class the test features with the highest percentile similarity within the training group. Within the two highest-dimensional feature sets, the Siamese network produced an AUC of 0.853 and 0.894, respectively. This is compared to best non-Siamese model, Discriminant Analysis, which produced an AUC of 0.823 and 0.836 for the two respective feature sets. However, when it came to the lower-dimensional recurrent features and the top-20 most significant features from Aerts et al., the Siamese network performed on-par or worse than the competing models. The proposed Siamese neural network architecture can outperform competing other models in high-dimensional, low-sample size spaces with regards to tabular data.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...