Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 225
Filter
1.
medRxiv ; 2024 May 16.
Article in English | MEDLINE | ID: mdl-38798457

ABSTRACT

Importance: Randomized clinical trials (RCTs) are the standard for defining an evidence-based approach to managing disease, but their generalizability to real-world patients remains challenging to quantify. Objective: To develop a multidimensional patient variable mapping algorithm to quantify the similarity and representation of electronic health record (EHR) patients corresponding to an RCT and estimate the putative treatment effects in real-world settings based on individual treatment effects observed in an RCT. Design: A retrospective analysis of the Treatment of Preserved Cardiac Function Heart Failure with an Aldosterone Antagonist Trial (TOPCAT; 2006-2012) and a multi-hospital patient cohort from the electronic health record (EHR) in the Yale New Haven Hospital System (YNHHS; 2015-2023). Setting: A multicenter international RCT (TOPCAT) and multi-hospital patient cohort (YNHHS). Participants: All TOPCAT participants and patients with heart failure with preserved ejection fraction (HFpEF) and ≥1 hospitalization within YNHHS. Exposures: 63 pre-randomization characteristics measured across the TOPCAT and YNNHS cohorts. Main Outcomes and Measures: Real-world generalizability of the RCT TOPCAT using a multidimensional phenotypic distance metric between TOPCAT and YNHHS cohorts. Estimation of the individualized treatment effect of spironolactone use on all-cause mortality within the YNHHS cohort based on phenotypic distance from the TOPCAT cohort. Results: There were 3,445 patients in TOPCAT and 11,712 HFpEF patients across five hospital sites. Across the 63 TOPCAT variables mapped by clinicians to the EHR, there were larger differences between TOPCAT and each of the 5 EHR sites (median SMD 0.200, IQR 0.037-0.410) than between the 5 EHR sites (median SMD 0.062, IQR 0.010-0.130). The synthesis of these differences across covariates using our multidimensional similarity score also suggested substantial phenotypic dissimilarity between the TOPCAT and EHR cohorts. By phenotypic distance, a majority (55%) of TOPCAT participants were closer to each other than any individual EHR patient. Using a TOPCAT-derived model of individualized treatment benefit from spironolactone, those predicted to derive benefit and receiving spironolactone in the EHR cohorts had substantially better outcomes compared with predicted benefit and not receiving the medication (HR 0.74, 95% CI 0.62-0.89). Conclusions and Relevance: We propose a novel approach to evaluating the real-world representativeness of RCT participants against corresponding patients in the EHR across the full multidimensional spectrum of the represented phenotypes. This enables the evaluation of the implications of RCTs for real-world patients. KEY POINTS: Question: How can we examine the multi-dimensional generalizability of randomized clinical trials (RCT) to real-world patient populations?Findings: We demonstrate a novel phenotypic distance metric comparing an RCT to real-world populations in a large multicenter RCT of heart failure patients and the corresponding patients in multisite electronic health records (EHRs). Across 63 pre-randomization characteristics, pairwise assessments of members of the RCT and EHR cohorts were more discordant from each other than between members of the EHR cohort (median standardized mean difference 0.200 [0.037-0.410] vs 0.062 [0.010-0.130]), with a majority (55%) of RCT participants closer to each other than any individual EHR patient. The approach also enabled the quantification of expected real world outcomes based on effects observed in the RCT.Meaning: A multidimensional phenotypic distance metric quantifies the generalizability of RCTs to a given population while also offering an avenue to examine expected real-world patient outcomes based on treatment effects observed in the RCT.

2.
Eur Heart J Digit Health ; 5(3): 303-313, 2024 May.
Article in English | MEDLINE | ID: mdl-38774380

ABSTRACT

Aims: An algorithmic strategy for anatomical vs. functional testing in suspected coronary artery disease (CAD) (Anatomical vs. Stress teSting decIsion Support Tool; ASSIST) is associated with better outcomes than random selection. However, in the real world, this decision is rarely random. We explored the agreement between a provider-driven vs. simulated algorithmic approach to cardiac testing and its association with outcomes across multinational cohorts. Methods and results: In two cohorts of functional vs. anatomical testing in a US hospital health system [Yale; 2013-2023; n = 130 196 (97.0%) vs. n = 4020 (3.0%), respectively], and the UK Biobank [n = 3320 (85.1%) vs. n = 581 (14.9%), respectively], we examined outcomes stratified by agreement between the real-world and ASSIST-recommended strategies. Younger age, female sex, Black race, and diabetes history were independently associated with lower odds of ASSIST-aligned testing. Over a median of 4.9 (interquartile range [IQR]: 2.4-7.1) and 5.4 (IQR: 2.6-8.8) years, referral to the ASSIST-recommended strategy was associated with a lower risk of acute myocardial infarction or death (hazard ratioadjusted: 0.81, 95% confidence interval [CI] 0.77-0.85, P < 0.001 and 0.74 [95% CI 0.60-0.90], P = 0.003, respectively), an effect that remained significant across years, test types, and risk profiles. In post hoc analyses of anatomical-first testing in the Prospective Multicentre Imaging Study for Evaluation of Chest Pain (PROMISE) trial, alignment with ASSIST was independently associated with a 17% and 30% higher risk of detecting CAD in any vessel or the left main artery/proximal left anterior descending coronary artery, respectively. Conclusion: In cohorts where historical practices largely favour functional testing, alignment with an algorithmic approach to cardiac testing defined by ASSIST was associated with a lower risk of adverse outcomes. This highlights the potential utility of a data-driven approach in the diagnostic management of CAD.

3.
Am J Med ; 2024 May 10.
Article in English | MEDLINE | ID: mdl-38735354

ABSTRACT

INTRODUCTION: Individuals with long COVID lack evidence-based treatments and have difficulty participating in traditional site-based trials. Our digital, decentralized trial investigates the efficacy and safety of nirmatrelvir/ritonavir, targeting viral persistence as a potential cause of long COVID. METHODS: The PAX LC trial (NCT05668091) is a Phase 2, 1:1 randomized, double-blind, superiority, placebo-controlled trial in 100 community-dwelling, highly symptomatic adult participants with long COVID residing in the 48 contiguous US states to determine the efficacy, safety, and tolerability of 15 days of nirmatrelvir/ritonavir compared with placebo/ritonavir. Participants are recruited via patient groups, cultural ambassadors, and social media platforms. Medical records are reviewed through a platform facilitating participant-mediated data acquisition from electronic health records nationwide. During the drug treatment, participants complete daily digital diaries using a web-based application. Blood draws for eligibility and safety assessments are conducted at or near participants' homes. The study drug is shipped directly to participants' homes. The primary endpoint is the PROMIS-29 Physical Health Summary Score difference between baseline and Day 28, evaluated by a mixed model repeated measure analysis. Secondary endpoints include PROMIS-29 (Mental Health Summary Score and all items), Modified GSQ-30 with supplemental symptoms questionnaire, COVID Core Outcome Measures for Recovery, EQ-5D-5L (Utility Score and all items), PGIS 1 and 2, PGIC 1 and 2, and healthcare utilization. The trial incorporates immunophenotyping to identify long COVID biomarkers and treatment responders. CONCLUSION: The PAX LC trial uses a novel decentralized design and a participant-centric approach to test a 15-day regimen of nirmatrelvir/ritonavir for long COVID.

4.
Circ Genom Precis Med ; : e000095, 2024 May 23.
Article in English | MEDLINE | ID: mdl-38779844

ABSTRACT

Wearable devices are increasingly used by a growing portion of the population to track health and illnesses. The data emerging from these devices can potentially transform health care. This requires an interoperability framework that enables the deployment of platforms, sensors, devices, and software applications within diverse health systems, aiming to facilitate innovation in preventing and treating cardiovascular disease. However, the current data ecosystem includes several noninteroperable systems that inhibit such objectives. The design of clinically meaningful systems for accessing and incorporating these data into clinical workflows requires strategies to ensure the quality of data and clinical content and patient and caregiver accessibility. This scientific statement aims to address the best practices, gaps, and challenges pertaining to data interoperability in this area, with considerations for (1) data integration and the scope of measures, (2) application of these data into clinical approaches/strategies, and (3) regulatory/ethical/legal issues.

5.
Article in English | MEDLINE | ID: mdl-38687616

ABSTRACT

OBJECTIVES: The study developed framework that leverages an open-source Large Language Model (LLM) to enable clinicians to ask plain-language questions about a patient's entire echocardiogram report history. This approach is intended to streamline the extraction of clinical insights from multiple echocardiogram reports, particularly in patients with complex cardiac diseases, thereby enhancing both patient care and research efficiency. MATERIALS AND METHODS: Data from over 10 years were collected, comprising echocardiogram reports from patients with more than 10 echocardiograms on file at the Mount Sinai Health System. These reports were converted into a single document per patient for analysis, broken down into snippets and relevant snippets were retrieved using text similarity measures. The LLaMA-2 70B model was employed for analyzing the text using a specially crafted prompt. The model's performance was evaluated against ground-truth answers created by faculty cardiologists. RESULTS: The study analyzed 432 reports from 37 patients for a total of 100 question-answer pairs. The LLM correctly answered 90% questions, with accuracies of 83% for temporality, 93% for severity assessment, 84% for intervention identification, and 100% for diagnosis retrieval. Errors mainly stemmed from the LLM's inherent limitations, such as misinterpreting numbers or hallucinations. CONCLUSION: The study demonstrates the feasibility and effectiveness of using a local, open-source LLM for querying and interpreting echocardiogram report data. This approach offers a significant improvement over traditional keyword-based searches, enabling more contextually relevant and semantically accurate responses; in turn showing promise in enhancing clinical decision-making and research by facilitating more efficient access to complex patient data.

6.
Am J Cardiol ; 222: 39-50, 2024 Apr 26.
Article in English | MEDLINE | ID: mdl-38677666

ABSTRACT

The practice patterns and outcomes of protected left main (PLM) and unprotected left main (ULM) percutaneous coronary intervention (PCI) are not well defined in contemporary US clinical practice. Data were collected from all Veteran Affairs catheterization laboratories participating in the Clinical Assessment Reporting and Tracking Program between 2009 and 2019. The analysis included 4,351 patients who underwent left main PCI, of whom 1,306 pairs of PLM and ULM PCI were included in a propensity-matched cohort. Selected temporal trends were also assessed. The primary outcome was major adverse cardiovascular event (MACE) outcomes at 1 year, which was defined as a composite of all-cause mortality, rehospitalization for myocardial infarction (MI), rehospitalization for stroke, or urgent revascularization. Patients who underwent ULM PCI compared with patients who underwent PLM PCI were older (age 71.5 vs 69.2 years, p <0.001), more clinically complex, and more likely to present with acute coronary syndrome. In the propensity-matched cohort, radial access was used more often for ULM PCI (21% [273] vs 14% [185], p <0.001) and ULM PCI was more likely to involve the left main bifurcation (22% vs 14%, p = 0.003) and require mechanical circulatory support (10% [134] vs 1% [17], p <0.001). The 1-year MACEs occurred more frequently with ULM PCI than PLM PCI (22% [289] vs 16% [215], p ≤0.001) and all-cause mortality was also higher (16% [213] vs 10% [125], p ≤0.001). In the matched cohort, there was a low incidence of rehospitalization for MI (4% [48] ULM vs 4% [48] PLM, p = 1.000) or revascularization (7% [94] ULM vs 6% [84] PLM, p = 0.485). In this real-world experience, patients who underwent PLM PCI had better 1-year outcomes than those who underwent ULM PCI; however, in both groups, there was a high rate of mortality and MACEs at 1 year despite a relatively low rate of MI or revascularization.

8.
medRxiv ; 2024 Apr 01.
Article in English | MEDLINE | ID: mdl-38633789

ABSTRACT

Introduction: Serial functional status assessments are critical to heart failure (HF) management but are often described narratively in documentation, limiting their use in quality improvement or patient selection for clinical trials. We developed and validated a deep learning-based natural language processing (NLP) strategy to extract functional status assessments from unstructured clinical notes. Methods: We identified 26,577 HF patients across outpatient services at Yale New Haven Hospital (YNHH), Greenwich Hospital (GH), and Northeast Medical Group (NMG) (mean age 76.1 years; 52.0% women). We used expert annotated notes from YNHH for model development/internal testing and from GH and NMG for external validation. The primary outcomes were NLP models to detect (a) explicit New York Heart Association (NYHA) classification, (b) HF symptoms during activity or rest, and (c) functional status assessment frequency. Results: Among 3,000 expert-annotated notes, 13.6% mentioned NYHA class, and 26.5% described HF symptoms. The model to detect NYHA classes achieved a class-weighted AUROC of 0.99 (95% CI: 0.98-1.00) at YNHH, 0.98 (0.96-1.00) at NMG, and 0.98 (0.92-1.00) at GH. The activity-related HF symptom model achieved an AUROC of 0.94 (0.89-0.98) at YNHH, 0.94 (0.91-0.97) at NMG, and 0.95 (0.92-0.99) at GH. Deploying the NYHA model among 166,655 unannotated notes from YNHH identified 21,528 (12.9%) with NYHA mentions and 17,642 encounters (10.5%) classifiable into functional status groups based on activity-related symptoms. Conclusions: We developed and validated an NLP approach to extract NYHA classification and activity-related HF symptoms from clinical notes, enhancing the ability to track optimal care and identify trial-eligible patients.

9.
medRxiv ; 2024 Apr 03.
Article in English | MEDLINE | ID: mdl-38633808

ABSTRACT

Background: Current risk stratification strategies for heart failure (HF) risk require either specific blood-based biomarkers or comprehensive clinical evaluation. In this study, we evaluated the use of artificial intelligence (AI) applied to images of electrocardiograms (ECGs) to predict HF risk. Methods: Across multinational longitudinal cohorts in the integrated Yale New Haven Health System (YNHHS) and in population-based UK Biobank (UKB) and Brazilian Longitudinal Study of Adult Health (ELSA-Brasil), we identified individuals without HF at baseline. Incident HF was defined based on the first occurrence of an HF hospitalization. We evaluated an AI-ECG model that defines the cross-sectional probability of left ventricular dysfunction from a single image of a 12-lead ECG and its association with incident HF. We accounted for the competing risk of death using the Fine-Gray subdistribution model and evaluated the discrimination using Harrel's c-statistic. The pooled cohort equations to prevent HF (PCP-HF) were used as a comparator for estimating incident HF risk. Results: Among 231,285 individuals at YNHHS, 4472 had a primary HF hospitalization over 4.5 years (IQR 2.5-6.6) of follow-up. In UKB and ELSA-Brasil, among 42,741 and 13,454 people, 46 and 31 developed HF over a follow-up of 3.1 (2.1-4.5) and 4.2 (3.7-4.5) years, respectively. A positive AI-ECG screen portended a 4-fold higher risk of incident HF among YNHHS patients (age-, sex-adjusted HR [aHR] 3.88 [95% CI, 3.63-4.14]). In UKB and ELSA-Brasil, a positive-screen ECG portended 13- and 24-fold higher hazard of incident HF, respectively (aHR: UKBB, 12.85 [6.87-24.02]; ELSA-Brasil, 23.50 [11.09-49.81]). The association was consistent after accounting for comorbidities and the competing risk of death. Higher model output probabilities were progressively associated with a higher risk for HF. The model's discrimination for incident HF was 0.718 in YNHHS, 0.769 in UKB, and 0.810 in ELSA-Brasil. Across cohorts, incorporating model probability with PCP-HF yielded a significant improvement in discrimination over PCP-HF alone. Conclusions: An AI model applied to images of 12-lead ECGs can identify those at elevated risk of HF across multinational cohorts. As a digital biomarker of HF risk that requires just an ECG image, this AI-ECG approach can enable scalable and efficient screening for HF risk.

10.
medRxiv ; 2024 Mar 15.
Article in English | MEDLINE | ID: mdl-38559021

ABSTRACT

Background: Point-of-care ultrasonography (POCUS) enables access to cardiac imaging directly at the bedside but is limited by brief acquisition, variation in acquisition quality, and lack of advanced protocols. Objective: To develop and validate deep learning models for detecting underdiagnosed cardiomyopathies on cardiac POCUS, leveraging a novel acquisition quality-adapted modeling strategy. Methods: To develop the models, we identified transthoracic echocardiograms (TTEs) of patients across five hospitals in a large U.S. health system with transthyretin amyloid cardiomyopathy (ATTR-CM, confirmed by Tc99m-pyrophosphate imaging), hypertrophic cardiomyopathy (HCM, confirmed by cardiac magnetic resonance), and controls enriched for the presence of severe AS. In a sample of 290,245 TTE videos, we used novel augmentation approaches and a customized loss function to weigh image and view quality to train a multi-label, view agnostic video-based convolutional neural network (CNN) to discriminate the presence of ATTR-CM, HCM, and/or AS. Models were tested across 3,758 real-world POCUS videos from 1,879 studies in 1,330 independent emergency department (ED) patients from 2011 through 2023. Results: Our multi-label, view-agnostic classifier demonstrated state-of-the-art performance in discriminating ATTR-CM (AUROC 0.98 [95%CI: 0.96-0.99]) and HCM (AUROC 0.95 [95% CI: 0.94-0.96]) on standard TTE studies. Automated metrics of anatomical view correctness confirmed significantly lower quality in POCUS vs TTE videos (median view classifier confidence of 0.63 [IQR: 0.44-0.88] vs 0.93 [IQR: 0.69-1.00], p<0.001). When deployed to POCUS videos, our algorithm effectively discriminated ATTR-CM and HCM with AUROC of up to 0.94 (parasternal long-axis (PLAX)), and 0.85 (apical 4 chamber), corresponding to positive diagnostic odds ratios of 46.7 and 25.5, respectively. In total, 18/35 (51.4%) of ATTR-CM and 32/57 (41.1%) of HCM patients in the POCUS cohort had an AI-positive screen in the year before their eventual confirmatory imaging. Conclusions: We define and validate an AI framework that enables scalable, opportunistic screening of under-diagnosed cardiomyopathies using POCUS.

11.
J Am Coll Cardiol ; 2024 Mar 26.
Article in English | MEDLINE | ID: mdl-38593945

ABSTRACT

Recent Artificial Intelligence (AI) advancements in cardiovascular care offer potential enhancements in effective diagnosis, treatment, and outcomes. Over 600 Food and Drug Administration (FDA)-approved clinical AI algorithms now exist, with 10% focusing on cardiovascular applications, highlighting the growing opportunities for AI to augment care. This review discusses the latest advancements in the field of AI, with a particular focus on the utilization of multimodal inputs and the field of generative AI. Further discussions in this review involve an approach to understanding the larger context in which AI-augmented care may exist, and include a discussion of the need for rigorous evaluation, appropriate infrastructure for deployment, ethics and equity assessments, regulatory oversight, and viable business cases for deployment. Embracing this rapidly evolving technology while setting an appropriately high evaluation benchmark with careful and patient-centered implementation will be crucial for cardiology to leverage AI to enhance patient care and the provider experience.

12.
J Am Coll Cardiol ; 2024 Mar 26.
Article in English | MEDLINE | ID: mdl-38593946

ABSTRACT

Recent AI advancements in cardiovascular care offer potential enhancements in diagnosis, treatment, and outcomes. Innovations to date focus on automating measurements, enhancing image quality, and detecting diseases using novel methods. Applications span wearables, electrocardiograms, echocardiography, angiography, genetics, and more. AI models detect diseases from electrocardiograms at accuracy not previously achieved by technology or human experts, including reduced ejection fraction, valvular heart disease, and other cardiomyopathies. However, AI's unique characteristics necessitates rigorous validation by addressing training methods, real-world efficacy, equity concerns, and long-term reliability. Despite an exponentially growing number of studies in cardiovascular AI, trials showing improvement in outcomes remain lacking. A number are currently underway. Embracing this rapidly evolving technology while setting a high evaluation benchmark will be crucial for cardiology to leverage AI to enhance patient care and the provider experience.

13.
JAMA Cardiol ; 2024 Apr 06.
Article in English | MEDLINE | ID: mdl-38581644

ABSTRACT

Importance: Aortic stenosis (AS) is a major public health challenge with a growing therapeutic landscape, but current biomarkers do not inform personalized screening and follow-up. A video-based artificial intelligence (AI) biomarker (Digital AS Severity index [DASSi]) can detect severe AS using single-view long-axis echocardiography without Doppler characterization. Objective: To deploy DASSi to patients with no AS or with mild or moderate AS at baseline to identify AS development and progression. Design, Setting, and Participants: This is a cohort study that examined 2 cohorts of patients without severe AS undergoing echocardiography in the Yale New Haven Health System (YNHHS; 2015-2021) and Cedars-Sinai Medical Center (CSMC; 2018-2019). A novel computational pipeline for the cross-modal translation of DASSi into cardiac magnetic resonance (CMR) imaging was further developed in the UK Biobank. Analyses were performed between August 2023 and February 2024. Exposure: DASSi (range, 0-1) derived from AI applied to echocardiography and CMR videos. Main Outcomes and Measures: Annualized change in peak aortic valve velocity (AV-Vmax) and late (>6 months) aortic valve replacement (AVR). Results: A total of 12 599 participants were included in the echocardiographic study (YNHHS: n = 8798; median [IQR] age, 71 [60-80] years; 4250 [48.3%] women; median [IQR] follow-up, 4.1 [2.4-5.4] years; and CSMC: n = 3801; median [IQR] age, 67 [54-78] years; 1685 [44.3%] women; median [IQR] follow-up, 3.4 [2.8-3.9] years). Higher baseline DASSi was associated with faster progression in AV-Vmax (per 0.1 DASSi increment: YNHHS, 0.033 m/s per year [95% CI, 0.028-0.038] among 5483 participants; CSMC, 0.082 m/s per year [95% CI, 0.053-0.111] among 1292 participants), with values of 0.2 or greater associated with a 4- to 5-fold higher AVR risk than values less than 0.2 (YNHHS: 715 events; adjusted hazard ratio [HR], 4.97 [95% CI, 2.71-5.82]; CSMC: 56 events; adjusted HR, 4.04 [95% CI, 0.92-17.70]), independent of age, sex, race, ethnicity, ejection fraction, and AV-Vmax. This was reproduced across 45 474 participants (median [IQR] age, 65 [59-71] years; 23 559 [51.8%] women; median [IQR] follow-up, 2.5 [1.6-3.9] years) undergoing CMR imaging in the UK Biobank (for participants with DASSi ≥0.2 vs those with DASSi <.02, adjusted HR, 11.38 [95% CI, 2.56-50.57]). Saliency maps and phenome-wide association studies supported associations with cardiac structure and function and traditional cardiovascular risk factors. Conclusions and Relevance: In this cohort study of patients without severe AS undergoing echocardiography or CMR imaging, a new AI-based video biomarker was independently associated with AS development and progression, enabling opportunistic risk stratification across cardiovascular imaging modalities as well as potential application on handheld devices.

14.
Am J Cardiol ; 221: 19-28, 2024 Apr 05.
Article in English | MEDLINE | ID: mdl-38583700

ABSTRACT

Cardiogenic shock after acute myocardial infarction (AMI-CS) carries significant mortality despite advances in revascularization and mechanical circulatory support. We sought to identify the process-based and structural characteristics of centers with lower mortality in AMI-CS. We analyzed 16,337 AMI-CS cases across 440 centers enrolled in the National Cardiovascular Data Registry's Chest Pain-MI Registry, a retrospective cohort database, between January 1, 2015, and December 31, 2018. Centers were stratified across tertiles of risk-adjusted in-hospital mortality rate (RAMR) for comparison. Risk-adjusted multivariable logistic regression was also performed to identify hospital-level characteristics associated with decreased mortality. The median participant age was 66 (interquartile range 57 to 75) years, and 33.0% (n = 5,390) were women. The median RAMR was 33.4% (interquartile range 26.0% to 40.0%) and ranged from 26.9% to 50.2% across tertiles. Even after risk adjustment, lower-RAMR centers saw patients with fewer co-morbidities. Lower-RAMR centers performed more revascularization (92.8% vs 90.6% vs 85.9%, p <0.001) and demonstrated better adherence to associated process measures. Left ventricular assist device capability (odds ratio [OR] 0.78 [0.67 to 0.92], p = 0.002), more frequent revascularization (OR 0.93 [0.88 to 0.98], p = 0.006), and higher AMI-CS volume (OR 0.95 [0.91 to 0.99], p = 0.009) were associated with lower in-hospital mortality. However, several such characteristics were not more frequently observed at low-RAMR centers, despite potentially reflecting greater institutional experience or resources. This may reflect the heterogeneity of AMI-CS even after risk adjustment. In conclusion, low-RAMR centers do not necessarily exhibit factors associated with decreased mortality in AMI-CS, which may reflect the challenges in performing outcomes research in this complex population.

15.
medRxiv ; 2024 Mar 26.
Article in English | MEDLINE | ID: mdl-38585929

ABSTRACT

Randomized clinical trials (RCTs) are essential to guide medical practice; however, their generalizability to a given population is often uncertain. We developed a statistically informed Generative Adversarial Network (GAN) model, RCT-Twin-GAN, that leverages relationships between covariates and outcomes and generates a digital twin of an RCT (RCT-Twin) conditioned on covariate distributions from a second patient population. We used RCT-Twin-GAN to reproduce treatment effect outcomes of the Systolic Blood Pressure Intervention Trial (SPRINT) and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Blood Pressure Trial, which tested the same intervention but had different treatment effect results. To demonstrate treatment effect estimates of each RCT conditioned on the other RCT patient population, we evaluated the cardiovascular event-free survival of SPRINT digital twins conditioned on the ACCORD cohort and vice versa (SPRINT-conditioned ACCORD twins). The conditioned digital twins were balanced by the intervention arm (mean absolute standardized mean difference (MASMD) of covariates between treatment arms 0.019 (SD 0.018), and the conditioned covariates of the SPRINT-Twin on ACCORD were more similar to ACCORD than a sprint (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20). Most importantly, across iterations, SPRINT conditioned ACCORD-Twin datasets reproduced the overall non-significant effect size seen in ACCORD (5-year cardiovascular outcome hazard ratio (95% confidence interval) of 0.88 (0.73-1.06) in ACCORD vs median 0.87 (0.68-1.13) in the SPRINT conditioned ACCORD-Twin), while the ACCORD conditioned SPRINT-Twins reproduced the significant effect size seen in SPRINT (0.75 (0.64-0.89) vs median 0.79 (0.72-0.86)) in ACCORD conditioned SPRINT-Twin). Finally, we describe the translation of this approach to real-world populations by conditioning the trials on an electronic health record population. Therefore, RCT-Twin-GAN simulates the direct translation of RCT-derived treatment effects across various patient populations with varying covariate distributions.

16.
medRxiv ; 2024 Mar 19.
Article in English | MEDLINE | ID: mdl-38562867

ABSTRACT

Introduction: Portable devices capable of electrocardiogram (ECG) acquisition have the potential to enhance structural heart disease (SHD) management by enabling early detection through artificial intelligence-ECG (AI-ECG) algorithms. However, the performance of these AI algorithms for identifying SHD in a real-world screening setting is unknown. To address this gap, we aim to evaluate the validity of our wearable-adapted AI algorithm, which has been previously developed and validated for detecting SHD from single-lead portable ECGs in patients undergoing routine echocardiograms in the Yale New Haven Hospital (YNHH). Research Methods and Analysis: This is the protocol for a cross-sectional study in the echocardiographic laboratories of YNHH. The study will enroll 585 patients referred for outpatient transthoracic echocardiogram (TTE) as part of their routine clinical care. Patients expressing interest in participating in the study will undergo a screening interview, followed by enrollment upon meeting eligibility criteria and providing informed consent. During their routine visit, patients will undergo a 1-lead ECG with two devices - one with an Apple Watch and the second with another portable 1-lead ECG device. With participant consent, these 1-lead ECG data will be linked to participant demographic and clinical data recorded in the YNHH electronic health records (EHR). The study will assess the performance of the AI-ECG algorithm in identifying SHD, including left ventricular systolic dysfunction (LVSD), valvular disease and severe left ventricular hypertrophy (LVH), by comparing the algorithm's results with data obtained from TTE, which is the established gold standard for diagnosing SHD. Ethics and Dissemination: All patient EHR data required for assessing eligibility and conducting the AI-ECG will be accessed through secure servers approved for protected health information. Data will be maintained on secure, encrypted servers for a minimum of five years after the publication of our findings in a peer-reviewed journal, and any unanticipated adverse events or risks will be reported by the principal investigator to the Yale Institutional Review Board, which has reviewed and approved this protocol (Protocol Number: 2000035532).

17.
medRxiv ; 2024 Mar 19.
Article in English | MEDLINE | ID: mdl-38562897

ABSTRACT

Background: Risk stratification strategies for cancer therapeutics-related cardiac dysfunction (CTRCD) rely on serial monitoring by specialized imaging, limiting their scalability. Objectives: To examine an artificial intelligence (AI)-enhanced electrocardiographic (AI-ECG) surrogate for imaging risk biomarkers, and its association with CTRCD. Methods: Across a five-hospital U.S.-based health system (2013-2023), we identified patients with breast cancer or non-Hodgkin lymphoma (NHL) who received anthracyclines (AC) and/or trastuzumab (TZM), and a control cohort receiving immune checkpoint inhibitors (ICI). We deployed a validated AI model of left ventricular systolic dysfunction (LVSD) to ECG images (≥0.1, positive screen) and explored its association with i) global longitudinal strain (GLS) measured within 15 days (n=7,271 pairs); ii) future CTRCD (new cardiomyopathy, heart failure, or left ventricular ejection fraction [LVEF]<50%), and LVEF<40%. In the ICI cohort we correlated baseline AI-ECG-LVSD predictions with downstream myocarditis. Results: Higher AI-ECG LVSD predictions were associated with worse GLS (-18% [IQR:-20 to -17%] for predictions<0.1, to -12% [IQR:-15 to -9%] for ≥0.5 (p<0.001)). In 1,308 patients receiving AC/TZM (age 59 [IQR:49-67] years, 999 [76.4%] women, 80 [IQR:42-115] follow-up months) a positive baseline AI-ECG LVSD screen was associated with ~2-fold and ~4.8-fold increase in the incidence of the composite CTRCD endpoint (adj.HR 2.22 [95%CI:1.63-3.02]), and LVEF<40% (adj.HR 4.76 [95%CI:2.62-8.66]), respectively. Among 2,056 patients receiving ICI (age 65 [IQR:57-73] years, 913 [44.4%] women, follow-up 63 [IQR:28-99] months) AI-ECG predictions were not associated with ICI myocarditis (adj.HR 1.36 [95%CI:0.47-3.93]). Conclusion: AI applied to baseline ECG images can stratify the risk of CTRCD associated with anthracycline or trastuzumab exposure.

19.
Circ Cardiovasc Qual Outcomes ; 17(3): e010404, 2024 03.
Article in English | MEDLINE | ID: mdl-38502717
SELECTION OF CITATIONS
SEARCH DETAIL
...