This article is a Preprint
Preprints are preliminary research reports that have not been certified by peer review. They should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Preprints posted online allow authors to receive rapid feedback and the entire scientific community can appraise the work for themselves and respond appropriately. Those comments are posted alongside the preprints for anyone to read them and serve as a post publication assessment.
Learning the language of viral evolution and escape (preprint)
biorxiv; 2020.
Preprint
in English
| bioRxiv | ID: ppzbmed-10.1101.2020.07.08.193946
ABSTRACT
Viral mutation that escapes from human immunity remains a major obstacle to antiviral and vaccine development. While anticipating escape could aid rational therapeutic design, the complex rules governing viral escape are challenging to model. Here, we demonstrate an unprecedented ability to predict viral escape by using machine learning algorithms originally developed to model the complexity of human natural language. Our key conceptual advance is that predicting escape requires identifying mutations that preserve viral fitness, or “grammaticality,” and also induce high antigenic change, or “semantic change.” We develop viral language models for influenza hemagglutinin, HIV Env, and SARS-CoV-2 Spike that we use to construct antigenically meaningful semantic landscapes, perform completely unsupervised prediction of escape mutants, and learn structural escape patterns from sequence alone. More profoundly, we lay a promising conceptual bridge between natural language and viral evolution. One sentence summary Neural language models of semantic change and grammaticality enable unprecedented prediction of viral escape mutations.
Full text:
Available
Collection:
Preprints
Database:
bioRxiv
Main subject:
HIV Infections
Language:
English
Year:
2020
Document Type:
Preprint
Similar
MEDLINE
...
LILACS
LIS