Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 76
Filter
1.
medRxiv ; 2024 May 27.
Article in English | MEDLINE | ID: mdl-38854022

ABSTRACT

Importance: Despite the availability of disease-modifying therapies, scalable strategies for heart failure (HF) risk stratification remain elusive. Portable devices capable of recording single-lead electrocardiograms (ECGs) can enable large-scale community-based risk assessment. Objective: To evaluate an artificial intelligence (AI) algorithm to predict HF risk from noisy single-lead ECGs. Design: Multicohort study. Setting: Retrospective cohort of individuals with outpatient ECGs in the integrated Yale New Haven Health System (YNHHS) and prospective population-based cohorts of UK Biobank (UKB) and Brazilian Longitudinal Study of Adult Health (ELSA-Brasil). Participants: Individuals without HF at baseline. Exposures: AI-ECG-defined risk of left ventricular systolic dysfunction (LVSD). Main Outcomes and Measures: Among individuals with ECGs, we isolated lead I ECGs and deployed a noise-adapted AI-ECG model trained to identify LVSD. We evaluated the association of the model probability with new-onset HF, defined as the first HF hospitalization. We compared the discrimination of AI-ECG against the pooled cohort equations to prevent HF (PCP-HF) score for new-onset HF using Harrel's C-statistic, integrated discrimination improvement (IDI), and net reclassification improvement (NRI). Results: There were 194,340 YNHHS patients (age 56 years [IQR, 41-69], 112,082 women [58%]), 42,741 UKB participants (65 years [59-71], 21,795 women [52%]), and 13,454 ELSA-Brasil participants (56 years [41-69], 7,348 women [55%]) with baseline ECGs. A total of 3,929 developed HF in YNHHS over 4.5 years (2.6-6.6), 46 in UKB over 3.1 years (2.1-4.5), and 31 in ELSA-Brasil over 4.2 years (3.7-4.5). A positive AI-ECG screen was associated with a 3- to 7-fold higher risk for HF, and each 0.1 increment in the model probability portended a 27-65% higher hazard across cohorts, independent of age, sex, comorbidities, and competing risk of death. AI-ECG's discrimination for new-onset HF was 0.725 in YNHHS, 0.792 in UKB, and 0.833 in ELSA-Brasil. Across cohorts, incorporating AI-ECG predictions in addition to PCP-HF resulted in improved Harrel's C-statistic (Δ=0.112-0.114), with an IDI of 0.078-0.238 and an NRI of 20.1%-48.8% for AI-ECG vs. PCP-HF. Conclusions and Relevance: Across multinational cohorts, a noise-adapted AI model with lead I ECGs as the sole input defined HF risk, representing a scalable portable and wearable device-based HF risk-stratification strategy. KEY POINTS: Question: Can single-lead electrocardiogram (ECG) tracings predict heart failure (HF) risk?Findings: We evaluated a noise-adapted artificial intelligence (AI) algorithm for single-lead ECGs as the sole input across multinational cohorts, spanning a diverse integrated US health system and large community-based cohorts in the UK and Brazil. A positive AI-ECG screen was associated with a 3- to 7-fold higher HF risk, independent of age, sex, and comorbidities. The AI model achieved incremental discrimination and improved reclassification for HF over the pooled cohort equations to prevent HF (PCP-HF).Meaning: A noise-adapted AI model for single-lead ECG predicted the risk of new-onset HF, representing a scalable HF risk-stratification strategy for portable and wearable devices.

2.
Lancet ; 2024 May 29.
Article in English | MEDLINE | ID: mdl-38823406

ABSTRACT

BACKGROUND: Coronary computed tomography angiography (CCTA) is the first line investigation for chest pain, and it is used to guide revascularisation. However, the widespread adoption of CCTA has revealed a large group of individuals without obstructive coronary artery disease (CAD), with unclear prognosis and management. Measurement of coronary inflammation from CCTA using the perivascular fat attenuation index (FAI) Score could enable cardiovascular risk prediction and guide the management of individuals without obstructive CAD. The Oxford Risk Factors And Non-invasive imaging (ORFAN) study aimed to evaluate the risk profile and event rates among patients undergoing CCTA as part of routine clinical care in the UK National Health Service (NHS); to test the hypothesis that coronary arterial inflammation drives cardiac mortality or major adverse cardiac events (MACE) in patients with or without CAD; and to externally validate the performance of the previously trained artificial intelligence (AI)-Risk prognostic algorithm and the related AI-Risk classification system in a UK population. METHODS: This multicentre, longitudinal cohort study included 40 091 consecutive patients undergoing clinically indicated CCTA in eight UK hospitals, who were followed up for MACE (ie, myocardial infarction, new onset heart failure, or cardiac death) for a median of 2·7 years (IQR 1·4-5·3). The prognostic value of FAI Score in the presence and absence of obstructive CAD was evaluated in 3393 consecutive patients from the two hospitals with the longest follow-up (7·7 years [6·4-9·1]). An AI-enhanced cardiac risk prediction algorithm, which integrates FAI Score, coronary plaque metrics, and clinical risk factors, was then evaluated in this population. FINDINGS: In the 2·7 year median follow-up period, patients without obstructive CAD (32 533 [81·1%] of 40 091) accounted for 2857 (66·3%) of the 4307 total MACE and 1118 (63·7%) of the 1754 total cardiac deaths in the whole of Cohort A. Increased FAI Score in all the three coronary arteries had an additive impact on the risk for cardiac mortality (hazard ratio [HR] 29·8 [95% CI 13·9-63·9], p<0·001) or MACE (12·6 [8·5-18·6], p<0·001) comparing three vessels with an FAI Score in the top versus bottom quartile for each artery. FAI Score in any coronary artery predicted cardiac mortality and MACE independently from cardiovascular risk factors and the presence or extent of CAD. The AI-Risk classification was positively associated with cardiac mortality (6·75 [5·17-8·82], p<0·001, for very high risk vs low or medium risk) and MACE (4·68 [3·93-5·57], p<0·001 for very high risk vs low or medium risk). Finally, the AI-Risk model was well calibrated against true events. INTERPRETATION: The FAI Score captures inflammatory risk beyond the current clinical risk stratification and CCTA interpretation, particularly among patients without obstructive CAD. The AI-Risk integrates this information in a prognostic algorithm, which could be used as an alternative to traditional risk factor-based risk calculators. FUNDING: British Heart Foundation, NHS-AI award, Innovate UK, National Institute for Health and Care Research, and the Oxford Biomedical Research Centre.

3.
Neurol Ther ; 2024 May 30.
Article in English | MEDLINE | ID: mdl-38814532

ABSTRACT

INTRODUCTION: Traditional methods for assessing movement quality rely on subjective standardized scales and clinical expertise. This limitation creates challenges for assessing patients with spinocerebellar ataxia (SCA), in whom changes in mobility can be subtle and varied. We hypothesized that a machine learning analytic system might complement traditional clinician-rated measures of gait. Our objective was to use a video-based assessment of gait dispersion to compare the effects of troriluzole with placebo on gait quality in adults with SCA. METHODS: Participants with SCA underwent gait assessment in a phase 3, double-blind, placebo-controlled trial of troriluzole (NCT03701399). Videos were processed through a deep learning pose extraction algorithm, followed by the estimation of a novel gait stability measure, the Pose Dispersion Index, quantifying the frame-by-frame symmetry, balance, and stability during natural and tandem walk tasks. The effects of troriluzole treatment were assessed in mixed linear models, participant-level grouping, and treatment group-by-visit week interaction adjusted for age, sex, baseline modified Functional Scale for the Assessment and Rating of Ataxia (f-SARA), and time since diagnosis. RESULTS: From 218 randomized participants, 67 and 56 participants had interpretable videos of a tandem and natural walk attempt, respectively. At Week 48, individuals assigned to troriluzole exhibited significant (p = 0.010) improvement in tandem walk Pose Dispersion Index versus placebo {adjusted interaction coefficient: 0.584 [95% confidence interval (CI) 0.137 to 1.031]}. A similar, nonsignificant trend was observed in the natural walk assessment [coefficient: 1.198 (95% CI - 1.067 to 3.462)]. Further, lower baseline Pose Dispersion Index during the natural walk was significantly (p = 0.041) associated with a higher risk of subsequent falls [adjusted Poisson coefficient: - 0.356 [95% CI - 0.697 to - 0.014)]. CONCLUSION: Using this novel approach, troriluzole-treated subjects demonstrated improvement in gait as compared to placebo for the tandem walk. Machine learning applied to video-captured gait parameters can complement clinician-reported motor assessment in adults with SCA. The Pose Dispersion Index may enhance assessment in future research. TRIAL REGISTRATION-CLINICALTRIALS. GOV IDENTIFIER: NCT03701399.

4.
Eur Heart J Digit Health ; 5(3): 303-313, 2024 May.
Article in English | MEDLINE | ID: mdl-38774380

ABSTRACT

Aims: An algorithmic strategy for anatomical vs. functional testing in suspected coronary artery disease (CAD) (Anatomical vs. Stress teSting decIsion Support Tool; ASSIST) is associated with better outcomes than random selection. However, in the real world, this decision is rarely random. We explored the agreement between a provider-driven vs. simulated algorithmic approach to cardiac testing and its association with outcomes across multinational cohorts. Methods and results: In two cohorts of functional vs. anatomical testing in a US hospital health system [Yale; 2013-2023; n = 130 196 (97.0%) vs. n = 4020 (3.0%), respectively], and the UK Biobank [n = 3320 (85.1%) vs. n = 581 (14.9%), respectively], we examined outcomes stratified by agreement between the real-world and ASSIST-recommended strategies. Younger age, female sex, Black race, and diabetes history were independently associated with lower odds of ASSIST-aligned testing. Over a median of 4.9 (interquartile range [IQR]: 2.4-7.1) and 5.4 (IQR: 2.6-8.8) years, referral to the ASSIST-recommended strategy was associated with a lower risk of acute myocardial infarction or death (hazard ratioadjusted: 0.81, 95% confidence interval [CI] 0.77-0.85, P < 0.001 and 0.74 [95% CI 0.60-0.90], P = 0.003, respectively), an effect that remained significant across years, test types, and risk profiles. In post hoc analyses of anatomical-first testing in the Prospective Multicentre Imaging Study for Evaluation of Chest Pain (PROMISE) trial, alignment with ASSIST was independently associated with a 17% and 30% higher risk of detecting CAD in any vessel or the left main artery/proximal left anterior descending coronary artery, respectively. Conclusion: In cohorts where historical practices largely favour functional testing, alignment with an algorithmic approach to cardiac testing defined by ASSIST was associated with a lower risk of adverse outcomes. This highlights the potential utility of a data-driven approach in the diagnostic management of CAD.

5.
medRxiv ; 2024 May 16.
Article in English | MEDLINE | ID: mdl-38798457

ABSTRACT

Importance: Randomized clinical trials (RCTs) are the standard for defining an evidence-based approach to managing disease, but their generalizability to real-world patients remains challenging to quantify. Objective: To develop a multidimensional patient variable mapping algorithm to quantify the similarity and representation of electronic health record (EHR) patients corresponding to an RCT and estimate the putative treatment effects in real-world settings based on individual treatment effects observed in an RCT. Design: A retrospective analysis of the Treatment of Preserved Cardiac Function Heart Failure with an Aldosterone Antagonist Trial (TOPCAT; 2006-2012) and a multi-hospital patient cohort from the electronic health record (EHR) in the Yale New Haven Hospital System (YNHHS; 2015-2023). Setting: A multicenter international RCT (TOPCAT) and multi-hospital patient cohort (YNHHS). Participants: All TOPCAT participants and patients with heart failure with preserved ejection fraction (HFpEF) and ≥1 hospitalization within YNHHS. Exposures: 63 pre-randomization characteristics measured across the TOPCAT and YNNHS cohorts. Main Outcomes and Measures: Real-world generalizability of the RCT TOPCAT using a multidimensional phenotypic distance metric between TOPCAT and YNHHS cohorts. Estimation of the individualized treatment effect of spironolactone use on all-cause mortality within the YNHHS cohort based on phenotypic distance from the TOPCAT cohort. Results: There were 3,445 patients in TOPCAT and 11,712 HFpEF patients across five hospital sites. Across the 63 TOPCAT variables mapped by clinicians to the EHR, there were larger differences between TOPCAT and each of the 5 EHR sites (median SMD 0.200, IQR 0.037-0.410) than between the 5 EHR sites (median SMD 0.062, IQR 0.010-0.130). The synthesis of these differences across covariates using our multidimensional similarity score also suggested substantial phenotypic dissimilarity between the TOPCAT and EHR cohorts. By phenotypic distance, a majority (55%) of TOPCAT participants were closer to each other than any individual EHR patient. Using a TOPCAT-derived model of individualized treatment benefit from spironolactone, those predicted to derive benefit and receiving spironolactone in the EHR cohorts had substantially better outcomes compared with predicted benefit and not receiving the medication (HR 0.74, 95% CI 0.62-0.89). Conclusions and Relevance: We propose a novel approach to evaluating the real-world representativeness of RCT participants against corresponding patients in the EHR across the full multidimensional spectrum of the represented phenotypes. This enables the evaluation of the implications of RCTs for real-world patients. KEY POINTS: Question: How can we examine the multi-dimensional generalizability of randomized clinical trials (RCT) to real-world patient populations?Findings: We demonstrate a novel phenotypic distance metric comparing an RCT to real-world populations in a large multicenter RCT of heart failure patients and the corresponding patients in multisite electronic health records (EHRs). Across 63 pre-randomization characteristics, pairwise assessments of members of the RCT and EHR cohorts were more discordant from each other than between members of the EHR cohort (median standardized mean difference 0.200 [0.037-0.410] vs 0.062 [0.010-0.130]), with a majority (55%) of RCT participants closer to each other than any individual EHR patient. The approach also enabled the quantification of expected real world outcomes based on effects observed in the RCT.Meaning: A multidimensional phenotypic distance metric quantifies the generalizability of RCTs to a given population while also offering an avenue to examine expected real-world patient outcomes based on treatment effects observed in the RCT.

6.
medRxiv ; 2024 Apr 03.
Article in English | MEDLINE | ID: mdl-38633808

ABSTRACT

Background: Current risk stratification strategies for heart failure (HF) risk require either specific blood-based biomarkers or comprehensive clinical evaluation. In this study, we evaluated the use of artificial intelligence (AI) applied to images of electrocardiograms (ECGs) to predict HF risk. Methods: Across multinational longitudinal cohorts in the integrated Yale New Haven Health System (YNHHS) and in population-based UK Biobank (UKB) and Brazilian Longitudinal Study of Adult Health (ELSA-Brasil), we identified individuals without HF at baseline. Incident HF was defined based on the first occurrence of an HF hospitalization. We evaluated an AI-ECG model that defines the cross-sectional probability of left ventricular dysfunction from a single image of a 12-lead ECG and its association with incident HF. We accounted for the competing risk of death using the Fine-Gray subdistribution model and evaluated the discrimination using Harrel's c-statistic. The pooled cohort equations to prevent HF (PCP-HF) were used as a comparator for estimating incident HF risk. Results: Among 231,285 individuals at YNHHS, 4472 had a primary HF hospitalization over 4.5 years (IQR 2.5-6.6) of follow-up. In UKB and ELSA-Brasil, among 42,741 and 13,454 people, 46 and 31 developed HF over a follow-up of 3.1 (2.1-4.5) and 4.2 (3.7-4.5) years, respectively. A positive AI-ECG screen portended a 4-fold higher risk of incident HF among YNHHS patients (age-, sex-adjusted HR [aHR] 3.88 [95% CI, 3.63-4.14]). In UKB and ELSA-Brasil, a positive-screen ECG portended 13- and 24-fold higher hazard of incident HF, respectively (aHR: UKBB, 12.85 [6.87-24.02]; ELSA-Brasil, 23.50 [11.09-49.81]). The association was consistent after accounting for comorbidities and the competing risk of death. Higher model output probabilities were progressively associated with a higher risk for HF. The model's discrimination for incident HF was 0.718 in YNHHS, 0.769 in UKB, and 0.810 in ELSA-Brasil. Across cohorts, incorporating model probability with PCP-HF yielded a significant improvement in discrimination over PCP-HF alone. Conclusions: An AI model applied to images of 12-lead ECGs can identify those at elevated risk of HF across multinational cohorts. As a digital biomarker of HF risk that requires just an ECG image, this AI-ECG approach can enable scalable and efficient screening for HF risk.

7.
medRxiv ; 2024 Mar 19.
Article in English | MEDLINE | ID: mdl-38562897

ABSTRACT

Background: Risk stratification strategies for cancer therapeutics-related cardiac dysfunction (CTRCD) rely on serial monitoring by specialized imaging, limiting their scalability. Objectives: To examine an artificial intelligence (AI)-enhanced electrocardiographic (AI-ECG) surrogate for imaging risk biomarkers, and its association with CTRCD. Methods: Across a five-hospital U.S.-based health system (2013-2023), we identified patients with breast cancer or non-Hodgkin lymphoma (NHL) who received anthracyclines (AC) and/or trastuzumab (TZM), and a control cohort receiving immune checkpoint inhibitors (ICI). We deployed a validated AI model of left ventricular systolic dysfunction (LVSD) to ECG images (≥0.1, positive screen) and explored its association with i) global longitudinal strain (GLS) measured within 15 days (n=7,271 pairs); ii) future CTRCD (new cardiomyopathy, heart failure, or left ventricular ejection fraction [LVEF]<50%), and LVEF<40%. In the ICI cohort we correlated baseline AI-ECG-LVSD predictions with downstream myocarditis. Results: Higher AI-ECG LVSD predictions were associated with worse GLS (-18% [IQR:-20 to -17%] for predictions<0.1, to -12% [IQR:-15 to -9%] for ≥0.5 (p<0.001)). In 1,308 patients receiving AC/TZM (age 59 [IQR:49-67] years, 999 [76.4%] women, 80 [IQR:42-115] follow-up months) a positive baseline AI-ECG LVSD screen was associated with ~2-fold and ~4.8-fold increase in the incidence of the composite CTRCD endpoint (adj.HR 2.22 [95%CI:1.63-3.02]), and LVEF<40% (adj.HR 4.76 [95%CI:2.62-8.66]), respectively. Among 2,056 patients receiving ICI (age 65 [IQR:57-73] years, 913 [44.4%] women, follow-up 63 [IQR:28-99] months) AI-ECG predictions were not associated with ICI myocarditis (adj.HR 1.36 [95%CI:0.47-3.93]). Conclusion: AI applied to baseline ECG images can stratify the risk of CTRCD associated with anthracycline or trastuzumab exposure.

8.
medRxiv ; 2024 Mar 15.
Article in English | MEDLINE | ID: mdl-38559021

ABSTRACT

Background: Point-of-care ultrasonography (POCUS) enables access to cardiac imaging directly at the bedside but is limited by brief acquisition, variation in acquisition quality, and lack of advanced protocols. Objective: To develop and validate deep learning models for detecting underdiagnosed cardiomyopathies on cardiac POCUS, leveraging a novel acquisition quality-adapted modeling strategy. Methods: To develop the models, we identified transthoracic echocardiograms (TTEs) of patients across five hospitals in a large U.S. health system with transthyretin amyloid cardiomyopathy (ATTR-CM, confirmed by Tc99m-pyrophosphate imaging), hypertrophic cardiomyopathy (HCM, confirmed by cardiac magnetic resonance), and controls enriched for the presence of severe AS. In a sample of 290,245 TTE videos, we used novel augmentation approaches and a customized loss function to weigh image and view quality to train a multi-label, view agnostic video-based convolutional neural network (CNN) to discriminate the presence of ATTR-CM, HCM, and/or AS. Models were tested across 3,758 real-world POCUS videos from 1,879 studies in 1,330 independent emergency department (ED) patients from 2011 through 2023. Results: Our multi-label, view-agnostic classifier demonstrated state-of-the-art performance in discriminating ATTR-CM (AUROC 0.98 [95%CI: 0.96-0.99]) and HCM (AUROC 0.95 [95% CI: 0.94-0.96]) on standard TTE studies. Automated metrics of anatomical view correctness confirmed significantly lower quality in POCUS vs TTE videos (median view classifier confidence of 0.63 [IQR: 0.44-0.88] vs 0.93 [IQR: 0.69-1.00], p<0.001). When deployed to POCUS videos, our algorithm effectively discriminated ATTR-CM and HCM with AUROC of up to 0.94 (parasternal long-axis (PLAX)), and 0.85 (apical 4 chamber), corresponding to positive diagnostic odds ratios of 46.7 and 25.5, respectively. In total, 18/35 (51.4%) of ATTR-CM and 32/57 (41.1%) of HCM patients in the POCUS cohort had an AI-positive screen in the year before their eventual confirmatory imaging. Conclusions: We define and validate an AI framework that enables scalable, opportunistic screening of under-diagnosed cardiomyopathies using POCUS.

9.
JAMA Cardiol ; 9(6): 534-544, 2024 Jun 01.
Article in English | MEDLINE | ID: mdl-38581644

ABSTRACT

Importance: Aortic stenosis (AS) is a major public health challenge with a growing therapeutic landscape, but current biomarkers do not inform personalized screening and follow-up. A video-based artificial intelligence (AI) biomarker (Digital AS Severity index [DASSi]) can detect severe AS using single-view long-axis echocardiography without Doppler characterization. Objective: To deploy DASSi to patients with no AS or with mild or moderate AS at baseline to identify AS development and progression. Design, Setting, and Participants: This is a cohort study that examined 2 cohorts of patients without severe AS undergoing echocardiography in the Yale New Haven Health System (YNHHS; 2015-2021) and Cedars-Sinai Medical Center (CSMC; 2018-2019). A novel computational pipeline for the cross-modal translation of DASSi into cardiac magnetic resonance (CMR) imaging was further developed in the UK Biobank. Analyses were performed between August 2023 and February 2024. Exposure: DASSi (range, 0-1) derived from AI applied to echocardiography and CMR videos. Main Outcomes and Measures: Annualized change in peak aortic valve velocity (AV-Vmax) and late (>6 months) aortic valve replacement (AVR). Results: A total of 12 599 participants were included in the echocardiographic study (YNHHS: n = 8798; median [IQR] age, 71 [60-80] years; 4250 [48.3%] women; median [IQR] follow-up, 4.1 [2.4-5.4] years; and CSMC: n = 3801; median [IQR] age, 67 [54-78] years; 1685 [44.3%] women; median [IQR] follow-up, 3.4 [2.8-3.9] years). Higher baseline DASSi was associated with faster progression in AV-Vmax (per 0.1 DASSi increment: YNHHS, 0.033 m/s per year [95% CI, 0.028-0.038] among 5483 participants; CSMC, 0.082 m/s per year [95% CI, 0.053-0.111] among 1292 participants), with values of 0.2 or greater associated with a 4- to 5-fold higher AVR risk than values less than 0.2 (YNHHS: 715 events; adjusted hazard ratio [HR], 4.97 [95% CI, 2.71-5.82]; CSMC: 56 events; adjusted HR, 4.04 [95% CI, 0.92-17.70]), independent of age, sex, race, ethnicity, ejection fraction, and AV-Vmax. This was reproduced across 45 474 participants (median [IQR] age, 65 [59-71] years; 23 559 [51.8%] women; median [IQR] follow-up, 2.5 [1.6-3.9] years) undergoing CMR imaging in the UK Biobank (for participants with DASSi ≥0.2 vs those with DASSi <.02, adjusted HR, 11.38 [95% CI, 2.56-50.57]). Saliency maps and phenome-wide association studies supported associations with cardiac structure and function and traditional cardiovascular risk factors. Conclusions and Relevance: In this cohort study of patients without severe AS undergoing echocardiography or CMR imaging, a new AI-based video biomarker was independently associated with AS development and progression, enabling opportunistic risk stratification across cardiovascular imaging modalities as well as potential application on handheld devices.


Subject(s)
Aortic Valve Stenosis , Artificial Intelligence , Disease Progression , Echocardiography , Severity of Illness Index , Humans , Aortic Valve Stenosis/diagnostic imaging , Aortic Valve Stenosis/surgery , Aortic Valve Stenosis/physiopathology , Female , Male , Aged , Echocardiography/methods , Middle Aged , Biomarkers , Aged, 80 and over , Cohort Studies , Video Recording , Multimodal Imaging/methods , Magnetic Resonance Imaging/methods
10.
medRxiv ; 2024 Mar 26.
Article in English | MEDLINE | ID: mdl-38585929

ABSTRACT

Randomized clinical trials (RCTs) are essential to guide medical practice; however, their generalizability to a given population is often uncertain. We developed a statistically informed Generative Adversarial Network (GAN) model, RCT-Twin-GAN, that leverages relationships between covariates and outcomes and generates a digital twin of an RCT (RCT-Twin) conditioned on covariate distributions from a second patient population. We used RCT-Twin-GAN to reproduce treatment effect outcomes of the Systolic Blood Pressure Intervention Trial (SPRINT) and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Blood Pressure Trial, which tested the same intervention but had different treatment effect results. To demonstrate treatment effect estimates of each RCT conditioned on the other RCT patient population, we evaluated the cardiovascular event-free survival of SPRINT digital twins conditioned on the ACCORD cohort and vice versa (SPRINT-conditioned ACCORD twins). The conditioned digital twins were balanced by the intervention arm (mean absolute standardized mean difference (MASMD) of covariates between treatment arms 0.019 (SD 0.018), and the conditioned covariates of the SPRINT-Twin on ACCORD were more similar to ACCORD than a sprint (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20). Most importantly, across iterations, SPRINT conditioned ACCORD-Twin datasets reproduced the overall non-significant effect size seen in ACCORD (5-year cardiovascular outcome hazard ratio (95% confidence interval) of 0.88 (0.73-1.06) in ACCORD vs median 0.87 (0.68-1.13) in the SPRINT conditioned ACCORD-Twin), while the ACCORD conditioned SPRINT-Twins reproduced the significant effect size seen in SPRINT (0.75 (0.64-0.89) vs median 0.79 (0.72-0.86)) in ACCORD conditioned SPRINT-Twin). Finally, we describe the translation of this approach to real-world populations by conditioning the trials on an electronic health record population. Therefore, RCT-Twin-GAN simulates the direct translation of RCT-derived treatment effects across various patient populations with varying covariate distributions.

11.
medRxiv ; 2024 Jun 03.
Article in English | MEDLINE | ID: mdl-38562867

ABSTRACT

Introduction: Portable devices capable of electrocardiogram (ECG) acquisition have the potential to enhance structural heart disease (SHD) management by enabling early detection through artificial intelligence-ECG (AI-ECG) algorithms. However, the performance of these AI algorithms for identifying SHD in a real-world screening setting is unknown. To address this gap, we aim to evaluate the validity of our wearable-adapted AI algorithm, which has been previously developed and validated for detecting SHD from single-lead portable ECGs in patients undergoing routine echocardiograms in the Yale New Haven Hospital (YNHH). Research Methods and Analysis: This is the protocol for a cross-sectional study in the echocardiographic laboratories of YNHH. The study will enroll 585 patients referred for outpatient transthoracic echocardiogram (TTE) as part of their routine clinical care. Patients expressing interest in participating in the study will undergo a screening interview, followed by enrollment upon meeting eligibility criteria and providing informed consent. During their routine visit, patients will undergo a 1-lead ECG with two devices - one with an Apple Watch and the second with another portable 1-lead ECG device. With participant consent, these 1-lead ECG data will be linked to participant demographic and clinical data recorded in the YNHH electronic health records (EHR). The study will assess the performance of the AI-ECG algorithm in identifying SHD, including left ventricular systolic dysfunction (LVSD), valvular disease and severe left ventricular hypertrophy (LVH), by comparing the algorithm's results with data obtained from TTE, which is the established gold standard for diagnosing SHD. Ethics and Dissemination: All patient EHR data required for assessing eligibility and conducting the AI-ECG will be accessed through secure servers approved for protected health information. Data will be maintained on secure, encrypted servers for a minimum of five years after the publication of our findings in a peer-reviewed journal, and any unanticipated adverse events or risks will be reported by the principal investigator to the Yale Institutional Review Board, which has reviewed and approved this protocol (Protocol Number: 2000035532).

13.
medRxiv ; 2024 Feb 18.
Article in English | MEDLINE | ID: mdl-38405776

ABSTRACT

Timely and accurate assessment of electrocardiograms (ECGs) is crucial for diagnosing, triaging, and clinically managing patients. Current workflows rely on a computerized ECG interpretation using rule-based tools built into the ECG signal acquisition systems with limited accuracy and flexibility. In low-resource settings, specialists must review every single ECG for such decisions, as these computerized interpretations are not available. Additionally, high-quality interpretations are even more essential in such low-resource settings as there is a higher burden of accuracy for automated reads when access to experts is limited. Artificial Intelligence (AI)-based systems have the prospect of greater accuracy yet are frequently limited to a narrow range of conditions and do not replicate the full diagnostic range. Moreover, these models often require raw signal data, which are unavailable to physicians and necessitate costly technical integrations that are currently limited. To overcome these challenges, we developed and validated a format-independent vision encoder-decoder model - ECG-GPT - that can generate free-text, expert-level diagnosis statements directly from ECG images. The model shows robust performance, validated on 2.6 million ECGs across 6 geographically distinct health settings: (1) 2 large and diverse US health systems- Yale-New Haven and Mount Sinai Health Systems, (2) a consecutive ECG dataset from a central ECG repository from Minas Gerais, Brazil, (3) the prospective cohort study, UK Biobank, (4) a Germany-based, publicly available repository, PTB-XL, and (5) a community hospital in Missouri. The model demonstrated consistently high performance (AUROC≥0.81) across a wide range of rhythm and conduction disorders. This can be easily accessed via a web-based application capable of receiving ECG images and represents a scalable and accessible strategy for generating accurate, expert-level reports from images of ECGs, enabling accurate triage of patients globally, especially in low-resource settings.

14.
J Am Med Inform Assoc ; 31(4): 855-865, 2024 Apr 03.
Article in English | MEDLINE | ID: mdl-38269618

ABSTRACT

OBJECTIVE: Artificial intelligence (AI) detects heart disease from images of electrocardiograms (ECGs). However, traditional supervised learning is limited by the need for large amounts of labeled data. We report the development of Biometric Contrastive Learning (BCL), a self-supervised pretraining approach for label-efficient deep learning on ECG images. MATERIALS AND METHODS: Using pairs of ECGs from 78 288 individuals from Yale (2000-2015), we trained a convolutional neural network to identify temporally separated ECG pairs that varied in layouts from the same patient. We fine-tuned BCL-pretrained models to detect atrial fibrillation (AF), gender, and LVEF < 40%, using ECGs from 2015 to 2021. We externally tested the models in cohorts from Germany and the United States. We compared BCL with ImageNet initialization and general-purpose self-supervised contrastive learning for images (simCLR). RESULTS: While with 100% labeled training data, BCL performed similarly to other approaches for detecting AF/Gender/LVEF < 40% with an AUROC of 0.98/0.90/0.90 in the held-out test sets, it consistently outperformed other methods with smaller proportions of labeled data, reaching equivalent performance at 50% of data. With 0.1% data, BCL achieved AUROC of 0.88/0.79/0.75, compared with 0.51/0.52/0.60 (ImageNet) and 0.61/0.53/0.49 (simCLR). In external validation, BCL outperformed other methods even at 100% labeled training data, with an AUROC of 0.88/0.88 for Gender and LVEF < 40% compared with 0.83/0.83 (ImageNet) and 0.84/0.83 (simCLR). DISCUSSION AND CONCLUSION: A pretraining strategy that leverages biometric signatures of different ECGs from the same patient enhances the efficiency of developing AI models for ECG images. This represents a major advance in detecting disorders from ECG images with limited labeled data.


Subject(s)
Atrial Fibrillation , Deep Learning , Humans , Artificial Intelligence , Electrocardiography , Biometry
15.
medRxiv ; 2024 Mar 03.
Article in English | MEDLINE | ID: mdl-38293023

ABSTRACT

Background: Artificial intelligence-enhanced electrocardiography (AI-ECG) can identify hypertrophic cardiomyopathy (HCM) on 12-lead ECGs and offers a novel way to monitor treatment response. While the surgical or percutaneous reduction of the interventricular septum (SRT) represented initial HCM therapies, mavacamten offers an oral alternative. Objective: To evaluate biological response to SRT and mavacamten. Methods: We applied an AI-ECG model for HCM detection to ECG images from patients who underwent SRT across three sites: Yale New Haven Health System (YNHHS), Cleveland Clinic Foundation (CCF), and Atlantic Health System (AHS); and to ECG images from patients receiving mavacamten at YNHHS. Results: A total of 70 patients underwent SRT at YNHHS, 100 at CCF, and 145 at AHS. At YNHHS, there was no significant change in the AI-ECG HCM score before versus after SRT (pre-SRT: median 0.55 [IQR 0.24-0.77] vs post-SRT: 0.59 [0.40-0.75]). The AI-ECG HCM scores also did not improve post SRT at CCF (0.61 [0.32-0.79] vs 0.69 [0.52-0.79]) and AHS (0.52 [0.35-0.69] vs 0.61 [0.49-0.70]). Among 36 YNHHS patients on mavacamten therapy, the median AI-ECG score before starting mavacamten was 0.41 (0.22-0.77), which decreased significantly to 0.28 (0.11-0.50, p <0.001 by Wilcoxon signed-rank test) at the end of a median follow-up period of 237 days. Conclusions: The lack of improvement in AI-based HCM score with SRT, in contrast to a significant decrease with mavacamten, suggests the potential role of AI-ECG for serial monitoring of pathophysiological improvement in HCM at the point-of-care using ECG images.

16.
medRxiv ; 2024 Feb 29.
Article in English | MEDLINE | ID: mdl-37808685

ABSTRACT

Importance: Aortic stenosis (AS) is a major public health challenge with a growing therapeutic landscape, but current biomarkers do not inform personalized screening and follow-up. Objective: A video-based artificial intelligence (AI) biomarker (Digital AS Severity index [DASSi]) can detect severe AS using single-view long-axis echocardiography without Doppler. Here, we deploy DASSi to patients with no or mild/moderate AS at baseline to identify AS development and progression. Design Setting and Participants: We defined two cohorts of patients without severe AS undergoing echocardiography in the Yale-New Haven Health System (YNHHS) (2015-2021, 4.1[IQR:2.4-5.4] follow-up years) and Cedars-Sinai Medical Center (CSMC) (2018-2019, 3.4[IQR:2.8-3.9] follow-up years). We further developed a novel computational pipeline for the cross-modality translation of DASSi into cardiac magnetic resonance (CMR) imaging in the UK Biobank (2.5[IQR:1.6-3.9] follow-up years). Analyses were performed between August 2023-February 2024. Exposure: DASSi (range: 0-1) derived from AI applied to echocardiography and CMR videos. Main Outcomes and Measures: Annualized change in peak aortic valve velocity (AV-Vmax) and late (>6 months) aortic valve replacement (AVR). Results: A total of 12,599 participants were included in the echocardiographic study (YNHHS: n=8,798, median age of 71 [IQR (interquartile range):60-80] years, 4250 [48.3%] women, and CSMC: n=3,801, 67 [IQR:54-78] years, 1685 [44.3%] women). Higher baseline DASSi was associated with faster progression in AV-Vmax (per 0.1 DASSi increments: YNHHS: +0.033 m/s/year [95%CI:0.028-0.038], n=5,483, and CSMC: +0.082 m/s/year [0.053-0.111], n=1,292), with levels ≥ vs <0.2 linked to a 4-to-5-fold higher AVR risk (715 events in YNHHS; adj.HR 4.97 [95%CI: 2.71-5.82], 56 events in CSMC: 4.04 [0.92-17.7]), independent of age, sex, ethnicity/race, ejection fraction and AV-Vmax. This was reproduced across 45,474 participants (median age 65 [IQR:59-71] years, 23,559 [51.8%] women) undergoing CMR in the UK Biobank (adj.HR 11.4 [95%CI:2.56-50.60] for DASSi ≥vs<0.2). Saliency maps and phenome-wide association studies supported links with traditional cardiovascular risk factors and diastolic dysfunction. Conclusions and Relevance: In this cohort study of patients without severe AS undergoing echocardiography or CMR imaging, a new AI-based video biomarker is independently associated with AS development and progression, enabling opportunistic risk stratification across cardiovascular imaging modalities as well as potential application on handheld devices.

17.
JACC Adv ; 2(7)2023 Sep.
Article in English | MEDLINE | ID: mdl-38094515

ABSTRACT

BACKGROUND: Smartphone-based health applications are increasingly popular, but their real-world use for cardiovascular risk management remains poorly understood. OBJECTIVES: The purpose of this study was to investigate the patterns of tracking health goals using smart devices, including smartphones and/or tablets, in the United States. METHODS: Using the nationally representative Health Information National Trends Survey for 2017 to 2020, we examined self-reported tracking of health-related goals (optimizing body weight, increasing physical activity, and/or quitting smoking) using smart devices among those with cardiovascular disease (CVD) or cardiovascular risk factors of hypertension, diabetes, obesity, and/or smoking. Survey analyses were used to obtain national estimates of use patterns and identify features associated with the use of these devices for tracking health goals. RESULTS: Of 16,092 Health Information National Trends Survey participants, 10,660 had CVD or cardiovascular risk factors, representing 154.2 million (95% CI: 149.2-159.3 million) U.S. adults. Among the general U.S. adult population, 46% (95% CI: 44%-47%) tracked their health goals using their smart devices, compared with 42% (95% CI: 40%-43%) of those with or at risk of CVD. Younger age, female, Black race, higher educational attainment, and greater income were independently associated with tracking of health goals using smart devices. CONCLUSIONS: Two in 5 U.S. adults with or at risk of CVD use their smart devices to track health goals. While representing a potential avenue to improve care, the lower use of smart devices among older and low-income individuals, who are at higher risk of adverse cardiovascular outcomes, requires that digital health interventions are designed so as not to exacerbate existing disparities.

18.
medRxiv ; 2023 Dec 15.
Article in English | MEDLINE | ID: mdl-38106089

ABSTRACT

Background: Randomized clinical trials (RCTs) are designed to produce evidence in selected populations. Assessing their effects in the real-world is essential to change medical practice, however, key populations are historically underrepresented in the RCTs. We define an approach to simulate RCT-based effects in real-world settings using RCT digital twins reflecting the covariate patterns in an electronic health record (EHR). Methods: We developed a Generative Adversarial Network (GAN) model, RCT-Twin-GAN, which generates a digital twin of an RCT (RCT-Twin) conditioned on covariate distributions from an EHR cohort. We improved upon a traditional tabular conditional GAN, CTGAN, with a loss function adapted for data distributions and by conditioning on multiple discrete and continuous covariates simultaneously. We assessed the similarity between a Heart Failure with preserved Ejection Fraction (HFpEF) RCT (TOPCAT), a Yale HFpEF EHR cohort, and RCT-Twin. We also evaluated cardiovascular event-free survival stratified by Spironolactone (treatment) use. Results: By applying RCT-Twin-GAN to 3445 TOPCAT participants and conditioning on 3445 Yale EHR HFpEF patients, we generated RCT-Twin datasets between 1141-3445 patients in size, depending on covariate conditioning and model parameters. RCT-Twin randomly allocated spironolactone (S)/ placebo (P) arms like an RCT, was similar to RCT by a multi-dimensional distance metric, and balanced covariates (median absolute standardized mean difference (MASMD) 0.017, IQR 0.0034-0.030). The 5 EHR-conditioned covariates in RCT-Twin were closer to the EHR compared with the RCT (MASMD 0.008 vs 0.63, IQR 0.005-0.018 vs 0.59-1.11). RCT-Twin reproduced the overall effect size seen in TOPCAT (5-year cardiovascular composite outcome odds ratio (95% confidence interval) of 0.89 (0.75-1.06) in RCT vs 0.85 (0.69-1.04) in RCT-Twin). Conclusions: RCT-Twin-GAN simulates RCT-derived effects in real-world patients by translating these effects to the covariate distributions of EHR patients. This key methodological advance may enable the direct translation of RCT-derived effects into real-world patient populations and may enable causal inference in real-world settings.

19.
medRxiv ; 2023 Nov 01.
Article in English | MEDLINE | ID: mdl-37961715

ABSTRACT

Randomized controlled trials (RCT) represent the cornerstone of evidence-based medicine but are resource-intensive. We propose and evaluate a machine learning (ML) strategy of adaptive predictive enrichment through computational trial phenomaps to optimize RCT enrollment. In simulated group sequential analyses of two large cardiovascular outcomes RCTs of (1) a therapeutic drug (pioglitazone versus placebo; Insulin Resistance Intervention after Stroke (IRIS) trial), and (2) a disease management strategy (intensive versus standard systolic blood pressure reduction in the Systolic Blood Pressure Intervention Trial (SPRINT)), we constructed dynamic phenotypic representations to infer response profiles during interim analyses and examined their association with study outcomes. Across three interim timepoints, our strategy learned dynamic phenotypic signatures predictive of individualized cardiovascular benefit. By conditioning a prospective candidate's probability of enrollment on their predicted benefit, we estimate that our approach would have enabled a reduction in the final trial size across ten simulations (IRIS: -14.8% ± 3.1%, pone-sample t-test=0.001; SPRINT: -17.6% ± 3.6%, pone-sample t-test<0.001), while preserving the original average treatment effect (IRIS: hazard ratio of 0.73 ± 0.01 for pioglitazone vs placebo, vs 0.76 in the original trial; SPRINT: hazard ratio of 0.72 ± 0.01 for intensive vs standard systolic blood pressure, vs 0.75 in the original trial; all with pone-sample t-test<0.01). This adaptive framework has the potential to maximize RCT enrollment efficiency.

20.
NPJ Digit Med ; 6(1): 217, 2023 Nov 25.
Article in English | MEDLINE | ID: mdl-38001154

ABSTRACT

Randomized clinical trials (RCT) represent the cornerstone of evidence-based medicine but are resource-intensive. We propose and evaluate a machine learning (ML) strategy of adaptive predictive enrichment through computational trial phenomaps to optimize RCT enrollment. In simulated group sequential analyses of two large cardiovascular outcomes RCTs of (1) a therapeutic drug (pioglitazone versus placebo; Insulin Resistance Intervention after Stroke (IRIS) trial), and (2) a disease management strategy (intensive versus standard systolic blood pressure reduction in the Systolic Blood Pressure Intervention Trial (SPRINT)), we constructed dynamic phenotypic representations to infer response profiles during interim analyses and examined their association with study outcomes. Across three interim timepoints, our strategy learned dynamic phenotypic signatures predictive of individualized cardiovascular benefit. By conditioning a prospective candidate's probability of enrollment on their predicted benefit, we estimate that our approach would have enabled a reduction in the final trial size across ten simulations (IRIS: -14.8% ± 3.1%, pone-sample t-test = 0.001; SPRINT: -17.6% ± 3.6%, pone-sample t-test < 0.001), while preserving the original average treatment effect (IRIS: hazard ratio of 0.73 ± 0.01 for pioglitazone vs placebo, vs 0.76 in the original trial; SPRINT: hazard ratio of 0.72 ± 0.01 for intensive vs standard systolic blood pressure, vs 0.75 in the original trial; all simulations with Cox regression-derived p value of < 0.01 for the effect of the intervention on the respective primary outcome). This adaptive framework has the potential to maximize RCT enrollment efficiency.

SELECTION OF CITATIONS
SEARCH DETAIL
...