Your browser doesn't support javascript.
A dataset comprised of binding interactions for 104,972 antibodies against a SARS-CoV-2 peptide.
Engelhart, Emily; Emerson, Ryan; Shing, Leslie; Lennartz, Chelsea; Guion, Daniel; Kelley, Mary; Lin, Charles; Lopez, Randolph; Younger, David; Walsh, Matthew E.
  • Engelhart E; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Emerson R; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Shing L; Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA.
  • Lennartz C; Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA.
  • Guion D; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Kelley M; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Lin C; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Lopez R; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Younger D; A-Alpha Bio, Inc., Seattle, WA, USA.
  • Walsh ME; Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA. mwalsh52@jhu.edu.
Sci Data ; 9(1): 653, 2022 10 26.
Article in English | MEDLINE | ID: covidwho-2087256
ABSTRACT
The dataset presented here contains quantitative binding scores of scFv-format antibodies against a SARS-CoV-2 target peptide collected via an AlphaSeq assay that can be used in the development and benchmarking of machine learning models. Starting from three seed sequences identified from a phage display campaign using a human naïve library, four sets of 29,900 antibodies were designed in silico by creating all k = 1 mutations and random k = 2 and k = 3 mutations throughout the complementary-determining regions (CDRs). Of the 119,600 designs, 104,972 were successfully built in to the AlphaSeq library and target binding was subsequently measured with 71,384 designs resulting in a predicted affinity value for at least one of the triplicate measurements. Data include antibodies with predicted affinity measurements ranging from 37 pM to 22 mM. To our knowledge, this dataset is the largest, publicly available dataset that contains antibody sequences, antigen sequence and quantitative measurements of binding scores and provides an opportunity to serve as a benchmark to evaluate antibody-specific representation models for machine learning.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Single-Chain Antibodies / COVID-19 Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Limits: Humans Language: English Journal: Sci Data Year: 2022 Document Type: Article Affiliation country: S41597-022-01779-4

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Single-Chain Antibodies / COVID-19 Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Limits: Humans Language: English Journal: Sci Data Year: 2022 Document Type: Article Affiliation country: S41597-022-01779-4