Your browser doesn't support javascript.
loading
ImputeCoVNet: 2D ResNet Autoencoder for Imputation of SARS-CoV-2 Sequences
Ahmad Pesaranghader; Justin Pelletier; Jean-Christophe Grenier; Raphaël Poujol; Julie Hussin.
Affiliation
  • Ahmad Pesaranghader; McGill University
  • Justin Pelletier; Montreal Heart Institute
  • Jean-Christophe Grenier; Montreal Heart Institute
  • Raphaël Poujol; Montreal Heart Institute
  • Julie Hussin; Université de Montréal
Preprint in English | bioRxiv | ID: ppbiorxiv-456305
ABSTRACT
We describe a new deep learning approach for the imputation of SARS-CoV-2 variants. Our model, ImputeCoVNet, consists of a 2D ResNet Autoencoder that aims at imputing missing genetic variants in SARS-CoV-2 sequences in an efficient manner. We show that ImputeCoVNet leads to accurate results at minor allele frequencies as low as 0.0001. When compared with an approach based on Hamming distance, ImputeCoVNet achieved comparable results with significantly less computation time. We also present the provision of geographical metadata (e.g., exposed country) to decoder increases the imputation accuracy. Additionally, by visualizing the embedding results of SARS-CoV-2 variants, we show that the trained encoder of ImputeCoVNet, or the embedded results from it, recapitulates viral clades information, which means it could be used for predictive tasks using virus sequence analysis.
License
cc_by
Full text: Available Collection: Preprints Database: bioRxiv Type of study: Prognostic study Language: English Year: 2021 Document type: Preprint
Full text: Available Collection: Preprints Database: bioRxiv Type of study: Prognostic study Language: English Year: 2021 Document type: Preprint
...