Predicting the Evolution of COVID-19 Mortality Risk: a Recurrent Neural Network Approach

Marta Villegas; Aitor Gonzalez-Agirre; Asier Gutiérrez-Fandiño; Jordi Armengol-Estapé; Casimiro Pio Carrino; David Pérez Fernández; Felipe Soares; Pablo Serrano; Miguel Pedrera; Noelia García; Alfonso Valencia

This article is a Preprint

Preprints are preliminary research reports that have not been certified by peer review. They should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Preprints posted online allow authors to receive rapid feedback and the entire scientific community can appraise the work for themselves and respond appropriately. Those comments are posted alongside the preprints for anyone to read them and serve as a post publication assessment.

Predicting the Evolution of COVID-19 Mortality Risk: a Recurrent Neural Network Approach

Marta Villegas; Aitor Gonzalez-Agirre; Asier Gutiérrez-Fandiño; Jordi Armengol-Estapé; Casimiro Pio Carrino; David Pérez Fernández; Felipe Soares; Pablo Serrano; Miguel Pedrera; Noelia García; Alfonso Valencia.

Affiliation

Marta Villegas; Barcelona Supercomputing Center
Aitor Gonzalez-Agirre; Barcelona Supercomputing Center
Asier Gutiérrez-Fandiño; Barcelona Supercomputing Center
Jordi Armengol-Estapé; Barcelona Supercomputing Center
Casimiro Pio Carrino; Barcelona Supercomputing Center
David Pérez Fernández; Ministry of Inclusion, Social Security and Migration
Felipe Soares; Universidade Federal do Rio Grande do Sul
Pablo Serrano; Hospital Universitario 12 de Octubre
Miguel Pedrera; Hospital Universitario 12 de Octubre
Noelia García; Instituto de Investigación Sanitaria del Hospital Universitario 12 de Octubre
Alfonso Valencia; Barcelona Supercomputing Center

Preprint in English | medRxiv | ID: ppmedrxiv-20244061

ABSTRACT

ABSTRACT

AO_SCPLOWBSTRACTC_SCPLOWO_ST_ABSBackgroundC_ST_ABSThe propagation of COVID-19 in Spain prompted the declaration of the state of alarm on March 14, 2020. On 2 December 2020, the infection had been confirmed in 1,665,775 patients and caused 45,784 deaths. This unprecedented health crisis challenged the ingenuity of all professionals involved. Decision support systems in clinical care and health services management were identified as crucial in the fight against the pandemic. MethodsThis study applies Deep Learning techniques for mortality prediction of COVID-19 patients. Two datasets with clinical information (medication, laboratory tests, vital signs etc.) were used. They are comprised of 2,307 and 3,870 COVID-19 infected patients admitted to two Spanish hospital chains. Firstly, we built a sequence of temporal events gathering all the clinical information for each patient. Next, we used the temporal sequences to train a Recurrent Neural Network (RNN) model with an attention mechanism exploring interpretability. We conducted extensive experiments and trained the RNNs in different settings, performing hyperparameter search and cross-validation. We ensembled resulting RNNs to reduce variability and enhance sensitivity. ResultsWe assessed the performance of our models using global metrics, by averaging the performance across all the days in the sequences. We also measured day-by-day metrics starting from the day of hospital admission and the outcome day and evaluated the daily predictions. Regarding sensitivity, when compared to more traditional models, our best two RNN ensemble models outperform a Support Vector Classifier in 6 and 16 percentage points, and Random Forest in 23 and 18 points. For the day-by-day predictions from the outcome date, the models also achieved better results than baselines showing its ability towards early predictions. ConclusionsWe have shown the feasibility of our approach to predict the clinical outcome of patients infected with SARS-CoV-2. The result is a time series model that can support decision-making in healthcare systems and aims at interpretability. The system is robust enough to deal with real world data and it is able to overcome the problems derived from the sparsity and heterogeneity of the data. In addition, the approach was validated using two datasets showing substantial differences. This not only validates the robustness of the proposal but also meets the requirements of a real scenario where the interoperability between hospitals datasets is difficult to achieve.

License

cc_by

Fulltext

Add to My VHL

XML

Search on Google

Full text: Available Collection: Preprints Database: medRxiv Type of study: Experimental_studies / Prognostic study / Rct Language: English Year: 2020 Document type: Preprint

Fulltext

Add to My VHL

XML

Search on Google