Search | VHL Regional Portal

Augmented intelligence facilitates concept mapping across different electronic health records.

Dam, Tariq A; Fleuren, Lucas M; Roggeveen, Luca F; Otten, Martijn; Biesheuvel, Laurens; Jagesar, Ameet R; Lalisang, Robbert C A; Kullberg, Robert F J; Hendriks, Tom; Girbes, Armand R J; Hoogendoorn, Mark; Thoral, Patrick J; Elbers, Paul W G.

Int J Med Inform ; 179: 105233, 2023 Nov.

Article in English | MEDLINE | ID: mdl-37748329

ABSTRACT

INTRODUCTION: With the advent of artificial intelligence, the secondary use of routinely collected medical data from electronic healthcare records (EHR) has become increasingly popular. However, different EHR systems typically use different names for the same medical concepts. This obviously hampers scalable model development and subsequent clinical implementation for decision support. Therefore, converting original parameter names to a so-called ontology, a standardized set of predefined concepts, is necessary but time-consuming and labor-intensive. We therefore propose an augmented intelligence approach to facilitate ontology alignment by predicting correct concepts based on parameter names from raw electronic health record data exports. METHODS: We used the manually mapped parameter names from the multicenter "Dutch ICU data warehouse against COVID-19" sourced from three types of EHR systems to train machine learning models for concept mapping. Data from 29 intensive care units on 38,824 parameters mapped to 1,679 relevant and unique concepts and 38,069 parameters labeled as irrelevant were used for model development and validation. We used the Natural Language Toolkit (NLTK) to preprocess the parameter names based on WordNet cognitive synonyms transformed by term-frequency inverse document frequency (TF-IDF), yielding numeric features. We then trained linear classifiers using stochastic gradient descent for multi-class prediction. Finally, we fine-tuned these predictions using information on distributions of the data associated with each parameter name through similarity score and skewness comparisons. RESULTS: The initial model, trained using data from one hospital organization for each of three EHR systems, scored an overall top 1 precision of 0.744, recall of 0.771, and F1-score of 0.737 on a total of 58,804 parameters. Leave-one-hospital-out analysis returned an average top 1 recall of 0.680 for relevant parameters, which increased to 0.905 for the top 5 predictions. When reducing the training dataset to only include relevant parameters, top 1 recall was 0.811 and top 5 recall was 0.914 for relevant parameters. Performance improvement based on similarity score or skewness comparisons affected at most 5.23% of numeric parameters. CONCLUSION: Augmented intelligence is a promising method to improve concept mapping of parameter names from raw electronic health record data exports. We propose a robust method for mapping data across various domains, facilitating the integration of diverse data sources. However, recall is not perfect, and therefore manual validation of mapping remains essential.

Assess and validate predictive performance of models for in-hospital mortality in COVID-19 patients: A retrospective cohort study in the Netherlands comparing the value of registry data with high-granular electronic health records.

Vagliano, Iacopo; Schut, Martijn C; Abu-Hanna, Ameen; Dongelmans, Dave A; de Lange, Dylan W; Gommers, Diederik; Cremer, Olaf L; Bosman, Rob J; Rigter, Sander; Wils, Evert-Jan; Frenzel, Tim; de Jong, Remko; Peters, Marco A A; Kamps, Marlijn J A; Ramnarain, Dharmanand; Nowitzky, Ralph; Nooteboom, Fleur G C A; de Ruijter, Wouter; Urlings-Strop, Louise C; Smit, Ellen G M; Mehagnoul-Schipper, D Jannet; Dormans, Tom; de Jager, Cornelis P C; Hendriks, Stefaan H A; Achterberg, Sefanja; Oostdijk, Evelien; Reidinga, Auke C; Festen-Spanjer, Barbara; Brunnekreef, Gert B; Cornet, Alexander D; van den Tempel, Walter; Boelens, Age D; Koetsier, Peter; Lens, Judith; Faber, Harald J; Karakus, A; Entjes, Robert; de Jong, Paul; Rettig, Thijs C D; Reuland, M C; Arbous, Sesmu; Fleuren, Lucas M; Dam, Tariq A; Thoral, Patrick J; Lalisang, Robbert C A; Tonutti, Michele; de Bruin, Daan P; Elbers, Paul W G; de Keizer, Nicolette F.

Int J Med Inform ; 167: 104863, 2022 11.

Article in English | MEDLINE | ID: mdl-36162166

ABSTRACT

PURPOSE: To assess, validate and compare the predictive performance of models for in-hospital mortality of COVID-19 patients admitted to the intensive care unit (ICU) over two different waves of infections. Our models were built with high-granular Electronic Health Records (EHR) data versus less-granular registry data. METHODS: Observational study of all COVID-19 patients admitted to 19 Dutch ICUs participating in both the national quality registry National Intensive Care Evaluation (NICE) and the EHR-based Dutch Data Warehouse (hereafter EHR). Multiple models were developed on data from the first 24 h of ICU admissions from February to June 2020 (first COVID-19 wave) and validated on prospective patients admitted to the same ICUs between July and December 2020 (second COVID-19 wave). We assessed model discrimination, calibration, and the degree of relatedness between development and validation population. Coefficients were used to identify relevant risk factors. RESULTS: A total of 1533 patients from the EHR and 1563 from the registry were included. With high granular EHR data, the average AUROC was 0.69 (standard deviation of 0.05) for the internal validation, and the AUROC was 0.75 for the temporal validation. The registry model achieved an average AUROC of 0.76 (standard deviation of 0.05) in the internal validation and 0.77 in the temporal validation. In the EHR data, age, and respiratory-system related variables were the most important risk factors identified. In the NICE registry data, age and chronic respiratory insufficiency were the most important risk factors. CONCLUSION: In our study, prognostic models built on less-granular but readily-available registry data had similar performance to models built on high-granular EHR data and showed similar transportability to a prospective COVID-19 population. Future research is needed to verify whether this finding can be confirmed for upcoming waves.

Subject(s)

COVID-19 , COVID-19/epidemiology , Electronic Health Records , Hospital Mortality , Humans , Intensive Care Units , Netherlands/epidemiology , Registries , Retrospective Studies

Rapid Evaluation of Coronavirus Illness Severity (RECOILS) in intensive care: Development and validation of a prognostic tool for in-hospital mortality.

Plecko, Drago; Bennett, Nicolas; Mårtensson, Johan; Dam, Tariq A; Entjes, Robert; Rettig, Thijs C D; Dongelmans, Dave A; Boelens, Age D; Rigter, Sander; Hendriks, Stefaan H A; de Jong, Remko; Kamps, Marlijn J A; Peters, Marco; Karakus, Attila; Gommers, Diederik; Ramnarain, Dharmanand; Wils, Evert-Jan; Achterberg, Sefanja; Nowitzky, Ralph; van den Tempel, Walter; de Jager, Cornelis P C; Nooteboom, Fleur G C A; Oostdijk, Evelien; Koetsier, Peter; Cornet, Alexander D; Reidinga, Auke C; de Ruijter, Wouter; Bosman, Rob J; Frenzel, Tim; Urlings-Strop, Louise C; de Jong, Paul; Smit, Ellen G M; Cremer, Olaf L; Mehagnoul-Schipper, D Jannet; Faber, Harald J; Lens, Judith; Brunnekreef, Gert B; Festen-Spanjer, Barbara; Dormans, Tom; de Bruin, Daan P; Lalisang, Robbert C A; Vonk, Sebastiaan J J; Haan, Martin E; Fleuren, Lucas M; Thoral, Patrick J; Elbers, Paul W G; Bellomo, Rinaldo.

Acta Anaesthesiol Scand ; 66(1): 65-75, 2022 01.

Article in English | MEDLINE | ID: mdl-34622441

ABSTRACT

BACKGROUND: The prediction of in-hospital mortality for ICU patients with COVID-19 is fundamental to treatment and resource allocation. The main purpose was to develop an easily implemented score for such prediction. METHODS: This was an observational, multicenter, development, and validation study on a national critical care dataset of COVID-19 patients. A systematic literature review was performed to determine variables possibly important for COVID-19 mortality prediction. Using a logistic multivariable model with a LASSO penalty, we developed the Rapid Evaluation of Coronavirus Illness Severity (RECOILS) score and compared its performance against published scores. RESULTS: Our development (validation) cohort consisted of 1480 (937) adult patients from 14 (11) Dutch ICUs admitted between March 2020 and April 2021. Median age was 65 (65) years, 31% (26%) died in hospital, 74% (72%) were males, average length of ICU stay was 7.83 (10.25) days and average length of hospital stay was 15.90 (19.92) days. Age, platelets, PaO2/FiO2 ratio, pH, blood urea nitrogen, temperature, PaCO2, Glasgow Coma Scale (GCS) score measured within +/-24 h of ICU admission were used to develop the score. The AUROC of RECOILS score was 0.75 (CI 0.71-0.78) which was higher than that of any previously reported predictive scores (0.68 [CI 0.64-0.71], 0.61 [CI 0.58-0.66], 0.67 [CI 0.63-0.70], 0.70 [CI 0.67-0.74] for ISARIC 4C Mortality Score, SOFA, SAPS-III, and age, respectively). CONCLUSIONS: Using a large dataset from multiple Dutch ICUs, we developed a predictive score for mortality of COVID-19 patients admitted to ICU, which outperformed other predictive scores reported so far.

Subject(s)

COVID-19 , Adult , Aged , Critical Care , Hospital Mortality , Humans , Intensive Care Units , Male , Multicenter Studies as Topic , Observational Studies as Topic , Patient Acuity , Prognosis , Retrospective Studies , SARS-CoV-2

Predictors for extubation failure in COVID-19 patients using a machine learning approach.

Fleuren, Lucas M; Dam, Tariq A; Tonutti, Michele; de Bruin, Daan P; Lalisang, Robbert C A; Gommers, Diederik; Cremer, Olaf L; Bosman, Rob J; Rigter, Sander; Wils, Evert-Jan; Frenzel, Tim; Dongelmans, Dave A; de Jong, Remko; Peters, Marco; Kamps, Marlijn J A; Ramnarain, Dharmanand; Nowitzky, Ralph; Nooteboom, Fleur G C A; de Ruijter, Wouter; Urlings-Strop, Louise C; Smit, Ellen G M; Mehagnoul-Schipper, D Jannet; Dormans, Tom; de Jager, Cornelis P C; Hendriks, Stefaan H A; Achterberg, Sefanja; Oostdijk, Evelien; Reidinga, Auke C; Festen-Spanjer, Barbara; Brunnekreef, Gert B; Cornet, Alexander D; van den Tempel, Walter; Boelens, Age D; Koetsier, Peter; Lens, Judith; Faber, Harald J; Karakus, A; Entjes, Robert; de Jong, Paul; Rettig, Thijs C D; Arbous, Sesmu; Vonk, Sebastiaan J J; Fornasa, Mattia; Machado, Tomas; Houwert, Taco; Hovenkamp, Hidde; Noorduijn Londono, Roberto; Quintarelli, Davide; Scholtemeijer, Martijn G; de Beer, Aletta A.

Crit Care ; 25(1): 448, 2021 12 27.

Article in English | MEDLINE | ID: mdl-34961537

ABSTRACT

INTRODUCTION: Determining the optimal timing for extubation can be challenging in the intensive care. In this study, we aim to identify predictors for extubation failure in critically ill patients with COVID-19. METHODS: We used highly granular data from 3464 adult critically ill COVID patients in the multicenter Dutch Data Warehouse, including demographics, clinical observations, medications, fluid balance, laboratory values, vital signs, and data from life support devices. All intubated patients with at least one extubation attempt were eligible for analysis. Transferred patients, patients admitted for less than 24 h, and patients still admitted at the time of data extraction were excluded. Potential predictors were selected by a team of intensive care physicians. The primary and secondary outcomes were extubation without reintubation or death within the next 7 days and within 48 h, respectively. We trained and validated multiple machine learning algorithms using fivefold nested cross-validation. Predictor importance was estimated using Shapley additive explanations, while cutoff values for the relative probability of failed extubation were estimated through partial dependence plots. RESULTS: A total of 883 patients were included in the model derivation. The reintubation rate was 13.4% within 48 h and 18.9% at day 7, with a mortality rate of 0.6% and 1.0% respectively. The grandient-boost model performed best (area under the curve of 0.70) and was used to calculate predictor importance. Ventilatory characteristics and settings were the most important predictors. More specifically, a controlled mode duration longer than 4 days, a last fraction of inspired oxygen higher than 35%, a mean tidal volume per kg ideal body weight above 8 ml/kg in the day before extubation, and a shorter duration in assisted mode (< 2 days) compared to their median values. Additionally, a higher C-reactive protein and leukocyte count, a lower thrombocyte count, a lower Glasgow coma scale and a lower body mass index compared to their medians were associated with extubation failure. CONCLUSION: The most important predictors for extubation failure in critically ill COVID-19 patients include ventilatory settings, inflammatory parameters, neurological status, and body mass index. These predictors should therefore be routinely captured in electronic health records.

Subject(s)

Airway Extubation , COVID-19 , Treatment Failure , Adult , COVID-19/therapy , Critical Illness , Humans , Machine Learning

The Dutch Data Warehouse, a multicenter and full-admission electronic health records database for critically ill COVID-19 patients.

Fleuren, Lucas M; Dam, Tariq A; Tonutti, Michele; de Bruin, Daan P; Lalisang, Robbert C A; Gommers, Diederik; Cremer, Olaf L; Bosman, Rob J; Rigter, Sander; Wils, Evert-Jan; Frenzel, Tim; Dongelmans, Dave A; de Jong, Remko; Peters, Marco; Kamps, Marlijn J A; Ramnarain, Dharmanand; Nowitzky, Ralph; Nooteboom, Fleur G C A; de Ruijter, Wouter; Urlings-Strop, Louise C; Smit, Ellen G M; Mehagnoul-Schipper, D Jannet; Dormans, Tom; de Jager, Cornelis P C; Hendriks, Stefaan H A; Achterberg, Sefanja; Oostdijk, Evelien; Reidinga, Auke C; Festen-Spanjer, Barbara; Brunnekreef, Gert B; Cornet, Alexander D; van den Tempel, Walter; Boelens, Age D; Koetsier, Peter; Lens, Judith; Faber, Harald J; Karakus, A; Entjes, Robert; de Jong, Paul; Rettig, Thijs C D; Arbous, Sesmu; Vonk, Sebastiaan J J; Fornasa, Mattia; Machado, Tomas; Houwert, Taco; Hovenkamp, Hidde; Noorduijn-Londono, Roberto; Quintarelli, Davide; Scholtemeijer, Martijn G; de Beer, Aletta A.

Crit Care ; 25(1): 304, 2021 08 23.

Article in English | MEDLINE | ID: mdl-34425864

ABSTRACT

BACKGROUND: The Coronavirus disease 2019 (COVID-19) pandemic has underlined the urgent need for reliable, multicenter, and full-admission intensive care data to advance our understanding of the course of the disease and investigate potential treatment strategies. In this study, we present the Dutch Data Warehouse (DDW), the first multicenter electronic health record (EHR) database with full-admission data from critically ill COVID-19 patients. METHODS: A nation-wide data sharing collaboration was launched at the beginning of the pandemic in March 2020. All hospitals in the Netherlands were asked to participate and share pseudonymized EHR data from adult critically ill COVID-19 patients. Data included patient demographics, clinical observations, administered medication, laboratory determinations, and data from vital sign monitors and life support devices. Data sharing agreements were signed with participating hospitals before any data transfers took place. Data were extracted from the local EHRs with prespecified queries and combined into a staging dataset through an extract-transform-load (ETL) pipeline. In the consecutive processing pipeline, data were mapped to a common concept vocabulary and enriched with derived concepts. Data validation was a continuous process throughout the project. All participating hospitals have access to the DDW. Within legal and ethical boundaries, data are available to clinicians and researchers. RESULTS: Out of the 81 intensive care units in the Netherlands, 66 participated in the collaboration, 47 have signed the data sharing agreement, and 35 have shared their data. Data from 25 hospitals have passed through the ETL and processing pipeline. Currently, 3464 patients are included in the DDW, both from wave 1 and wave 2 in the Netherlands. More than 200 million clinical data points are available. Overall ICU mortality was 24.4%. Respiratory and hemodynamic parameters were most frequently measured throughout a patient's stay. For each patient, all administered medication and their daily fluid balance were available. Missing data are reported for each descriptive. CONCLUSIONS: In this study, we show that EHR data from critically ill COVID-19 patients may be lawfully collected and can be combined into a data warehouse. These initiatives are indispensable to advance medical data science in the field of intensive care medicine.

Subject(s)

COVID-19/epidemiology , Critical Illness/epidemiology , Data Warehousing/statistics & numerical data , Electronic Health Records/statistics & numerical data , Hospitalization/statistics & numerical data , Intensive Care Units/statistics & numerical data , Critical Care , Humans , Netherlands

Risk factors for adverse outcomes during mechanical ventilation of 1152 COVID-19 patients: a multicenter machine learning study with highly granular data from the Dutch Data Warehouse.

Fleuren, Lucas M; Tonutti, Michele; de Bruin, Daan P; Lalisang, Robbert C A; Dam, Tariq A; Gommers, Diederik; Cremer, Olaf L; Bosman, Rob J; Vonk, Sebastiaan J J; Fornasa, Mattia; Machado, Tomas; van der Meer, Nardo J M; Rigter, Sander; Wils, Evert-Jan; Frenzel, Tim; Dongelmans, Dave A; de Jong, Remko; Peters, Marco; Kamps, Marlijn J A; Ramnarain, Dharmanand; Nowitzky, Ralph; Nooteboom, Fleur G C A; de Ruijter, Wouter; Urlings-Strop, Louise C; Smit, Ellen G M; Mehagnoul-Schipper, D Jannet; Dormans, Tom; de Jager, Cornelis P C; Hendriks, Stefaan H A; Oostdijk, Evelien; Reidinga, Auke C; Festen-Spanjer, Barbara; Brunnekreef, Gert; Cornet, Alexander D; van den Tempel, Walter; Boelens, Age D; Koetsier, Peter; Lens, Judith; Achterberg, Sefanja; Faber, Harald J; Karakus, A; Beukema, Menno; Entjes, Robert; de Jong, Paul; Houwert, Taco; Hovenkamp, Hidde; Noorduijn Londono, Roberto; Quintarelli, Davide; Scholtemeijer, Martijn G; de Beer, Aletta A.

Intensive Care Med Exp ; 9(1): 32, 2021 Jun 28.

Article in English | MEDLINE | ID: mdl-34180025

ABSTRACT

BACKGROUND: The identification of risk factors for adverse outcomes and prolonged intensive care unit (ICU) stay in COVID-19 patients is essential for prognostication, determining treatment intensity, and resource allocation. Previous studies have determined risk factors on admission only, and included a limited number of predictors. Therefore, using data from the highly granular and multicenter Dutch Data Warehouse, we developed machine learning models to identify risk factors for ICU mortality, ventilator-free days and ICU-free days during the course of invasive mechanical ventilation (IMV) in COVID-19 patients. METHODS: The DDW is a growing electronic health record database of critically ill COVID-19 patients in the Netherlands. All adult ICU patients on IMV were eligible for inclusion. Transfers, patients admitted for less than 24 h, and patients still admitted at time of data extraction were excluded. Predictors were selected based on the literature, and included medication dosage and fluid balance. Multiple algorithms were trained and validated on up to three sets of observations per patient on day 1, 7, and 14 using fivefold nested cross-validation, keeping observations from an individual patient in the same split. RESULTS: A total of 1152 patients were included in the model. XGBoost models performed best for all outcomes and were used to calculate predictor importance. Using Shapley additive explanations (SHAP), age was the most important demographic risk factor for the outcomes upon start of IMV and throughout its course. The relative probability of death across age values is visualized in Partial Dependence Plots (PDPs), with an increase starting at 54 years. Besides age, acidaemia, low P/F-ratios and high driving pressures demonstrated a higher probability of death. The PDP for driving pressure showed a relative probability increase starting at 12 cmH2O. CONCLUSION: Age is the most important demographic risk factor of ICU mortality, ICU-free days and ventilator-free days throughout the course of invasive mechanical ventilation in critically ill COVID-19 patients. pH, P/F ratio, and driving pressure should be monitored closely over the course of mechanical ventilation as risk factors predictive of these outcomes.

Large-scale ICU data sharing for global collaboration: the first 1633 critically ill COVID-19 patients in the Dutch Data Warehouse.

Fleuren, Lucas M; de Bruin, Daan P; Tonutti, Michele; Lalisang, Robbert C A; Elbers, Paul W G.

Intensive Care Med ; 47(4): 478-481, 2021 04.

Article in English | MEDLINE | ID: mdl-33595710

Subject(s)

COVID-19 , Critical Illness , Data Warehousing , Information Dissemination , Intensive Care Units , Humans , Netherlands

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL