Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
Elife ; 122023 03 27.
Article in English | MEDLINE | ID: mdl-36971345

ABSTRACT

Immunoglobulin loci-transgenic animals are widely used in antibody discovery and increasingly in vaccine response modelling. In this study, we phenotypically characterised B-cell populations from the Intelliselect Transgenic mouse (Kymouse) demonstrating full B-cell development competence. Comparison of the naïve B-cell receptor (BCR) repertoires of Kymice BCRs, naïve human, and murine BCR repertoires revealed key differences in germline gene usage and junctional diversification. These differences result in Kymice having CDRH3 length and diversity intermediate between mice and humans. To compare the structural space explored by CDRH3s in each species' repertoire, we used computational structure prediction to show that Kymouse naïve BCR repertoires are more human-like than mouse-like in their predicted distribution of CDRH3 shape. Our combined sequence and structural analysis indicates that the naïve Kymouse BCR repertoire is diverse with key similarities to human repertoires, while immunophenotyping confirms that selected naïve B cells are able to go through complete development.


Subject(s)
Antibodies , B-Lymphocytes , Animals , Humans , Mice , Mice, Transgenic , Immunophenotyping , High-Throughput Nucleotide Sequencing , Receptors, Antigen, B-Cell/genetics
2.
Bioinformatics ; 38(9): 2628-2630, 2022 04 28.
Article in English | MEDLINE | ID: mdl-35274671

ABSTRACT

MOTIVATION: Rational design of therapeutic antibodies can be improved by harnessing the natural sequence diversity of these molecules. Our understanding of the diversity of antibodies has recently been greatly facilitated through the deposition of hundreds of millions of human antibody sequences in next-generation sequencing (NGS) repositories. Contrasting a query therapeutic antibody sequence to naturally observed diversity in similar antibody sequences from NGS can provide a mutational roadmap for antibody engineers designing biotherapeutics. Because of the sheer scale of the antibody NGS datasets, performing queries across them is computationally challenging. RESULTS: To facilitate harnessing antibody NGS data, we developed AbDiver (http://naturalantibody.com/abdiver), a free portal allowing users to compare their query sequences to those observed in the natural repertoires. AbDiver offers three antibody-specific use-cases: (i) compare a query antibody to positional variability statistics precomputed from multiple independent studies, (ii) retrieve close full variable sequence matches to a query antibody and (iii) retrieve CDR3 or clonotype matches to a query antibody. We applied our system to a set of 742 therapeutic antibodies, demonstrating that for each use-case our system can retrieve relevant results for most sequences. AbDiver facilitates the navigation of vast antibody mutation space for the purpose of rational therapeutic antibody design. AVAILABILITY AND IMPLEMENTATION: AbDiver is freely accessible at http://naturalantibody.com/abdiver. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Antibodies , High-Throughput Nucleotide Sequencing , Humans , Antibodies/therapeutic use , Antibodies/genetics , Software
3.
Elife ; 102021 10 18.
Article in English | MEDLINE | ID: mdl-34661527

ABSTRACT

Several human B cell subpopulations are recognised in the peripheral blood, which play distinct roles in the humoral immune response. These cells undergo developmental and maturational changes involving VDJ recombination, somatic hypermutation and class switch recombination, altogether shaping their immunoglobulin heavy chain (IgH) repertoire. Here, we sequenced the IgH repertoire of naïve, marginal zone, switched and plasma cells from 10 healthy adults along with matched unsorted and in silico separated CD19+ bulk B cells. Using advanced bioinformatic analysis and machine learning, we show that sorted B cell subpopulations are characterised by distinct repertoire characteristics on both the individual sequence and the repertoire level. Sorted subpopulations shared similar repertoire characteristics with their corresponding in silico separated subsets. Furthermore, certain IgH repertoire characteristics correlated with the position of the constant region on the IgH locus. Overall, this study provides unprecedented insight over mechanisms of B cell repertoire control in peripherally circulating B cell subpopulations.


Subject(s)
B-Lymphocytes/physiology , Lymphocyte Subsets/physiology , Computational Biology , Humans , Immunity, Humoral
4.
PLoS Comput Biol ; 17(3): e1008781, 2021 03.
Article in English | MEDLINE | ID: mdl-33647011

ABSTRACT

The naïve antibody/B-cell receptor (BCR) repertoires of different individuals ought to exhibit significant functional commonality, given that most pathogens trigger an effective antibody response to immunodominant epitopes. Sequence-based repertoire analysis has so far offered little evidence for this phenomenon. For example, a recent study estimated the number of shared ('public') antibody clonotypes in circulating baseline repertoires to be around 0.02% across ten unrelated individuals. However, to engage the same epitope, antibodies only require a similar binding site structure and the presence of key paratope interactions, which can occur even when their sequences are dissimilar. Here, we search for evidence of geometric similarity/convergence across human antibody repertoires. We first structurally profile naïve ('baseline') antibody diversity using snapshots from 41 unrelated individuals, predicting all modellable distinct structures within each repertoire. This analysis uncovers a high (much greater than random) degree of structural commonality. For instance, around 3% of distinct structures are common to the ten most diverse individual samples ('Public Baseline' structures). Our approach is the first computational method to find levels of BCR commonality commensurate with epitope immunodominance and could therefore be harnessed to find more genetically distant antibodies with same-epitope complementarity. We then apply the same structural profiling approach to repertoire snapshots from three individuals before and after flu vaccination, detecting a convergent structural drift indicative of recognising similar epitopes ('Public Response' structures). We show that Antibody Model Libraries derived from Public Baseline and Public Response structures represent a powerful geometric basis set of low-immunogenicity candidates exploitable for general or target-focused therapeutic antibody screening.


Subject(s)
Antibodies , Antibody Diversity , B-Lymphocytes , Databases, Genetic , Immunodominant Epitopes , B-Lymphocytes/chemistry , B-Lymphocytes/immunology , B-Lymphocytes/metabolism , Computational Biology , Humans
5.
Bioinformatics ; 37(5): 734-735, 2021 05 05.
Article in English | MEDLINE | ID: mdl-32805021

ABSTRACT

MOTIVATION: The emergence of a novel strain of betacoronavirus, SARS-CoV-2, has led to a pandemic that has been associated with over 700 000 deaths as of August 5, 2020. Research is ongoing around the world to create vaccines and therapies to minimize rates of disease spread and mortality. Crucial to these efforts are molecular characterizations of neutralizing antibodies to SARS-CoV-2. Such antibodies would be valuable for measuring vaccine efficacy, diagnosing exposure and developing effective biotherapeutics. Here, we describe our new database, CoV-AbDab, which already contains data on over 1400 published/patented antibodies and nanobodies known to bind to at least one betacoronavirus. This database is the first consolidation of antibodies known to bind SARS-CoV-2 as well as other betacoronaviruses such as SARS-CoV-1 and MERS-CoV. It contains relevant metadata including evidence of cross-neutralization, antibody/nanobody origin, full variable domain sequence (where available) and germline assignments, epitope region, links to relevant PDB entries, homology models and source literature. RESULTS: On August 5, 2020, CoV-AbDab referenced sequence information on 1402 anti-coronavirus antibodies and nanobodies, spanning 66 papers and 21 patents. Of these, 1131 bind to SARS-CoV-2. AVAILABILITYAND IMPLEMENTATION: CoV-AbDab is free to access and download without registration at http://opig.stats.ox.ac.uk/webapps/coronavirus. Community submissions are encouraged. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
COVID-19 , Middle East Respiratory Syndrome Coronavirus , Antibodies, Neutralizing , Antibodies, Viral , Humans , SARS-CoV-2 , Spike Glycoprotein, Coronavirus
6.
Front Immunol ; 11: 1734, 2020.
Article in English | MEDLINE | ID: mdl-32849618

ABSTRACT

B cells play a central role in adaptive immune processes, mainly through the production of antibodies. The maturation of the B cell system with age is poorly studied. We extensively investigated age-related alterations of naïve and antigen-experienced immunoglobulin heavy chain (IgH) repertoires. The most significant changes were observed in the first 10 years of life, and were characterized by altered immunoglobulin gene usage and an increased frequency of mutated antibodies structurally diverging from their germline precursors. Older age was associated with an increased usage of downstream IgH constant region genes and fewer antibodies with self-reactive properties. As mutations accumulated with age, the frequency of germline-encoded self-reactive antibodies decreased, indicating a possible beneficial role of self-reactive B cells in the developing immune system. Our results suggest a continuous process of change through childhood across a broad range of parameters characterizing IgH repertoires and stress the importance of using well-selected, age-appropriate controls in IgH studies.


Subject(s)
Aging/immunology , B-Lymphocytes/immunology , Genes, Immunoglobulin Heavy Chain , Immunoglobulin Heavy Chains/immunology , Mutation , Adolescent , Adult , Age Factors , Aging/genetics , Aging/metabolism , B-Lymphocytes/metabolism , Child , Child Development , Child, Preschool , Computational Biology , High-Throughput Nucleotide Sequencing , Humans , Immunoglobulin Heavy Chains/genetics , Immunoglobulin Heavy Chains/metabolism , Infant , Middle Aged , Young Adult
7.
PLoS Comput Biol ; 16(2): e1007636, 2020 02.
Article in English | MEDLINE | ID: mdl-32069281

ABSTRACT

Most current analysis tools for antibody next-generation sequencing data work with primary sequence descriptors, leaving accompanying structural information unharnessed. We have used novel rapid methods to structurally characterize the complementary-determining regions (CDRs) of more than 180 million human and mouse B-cell receptor (BCR) repertoire sequences. These structurally annotated CDRs provide unprecedented insights into both the structural predetermination and dynamics of the adaptive immune response. We show that B-cell types can be distinguished based solely on these structural properties. Antigen-unexperienced BCR repertoires use the highest number and diversity of CDR structures and these patterns of naïve repertoire paratope usage are highly conserved across subjects. In contrast, more differentiated B-cells are more personalized in terms of CDR structure usage. Our results establish the CDR structure differences in BCR repertoires and have applications for many fields including immunodiagnostics, phage display library generation, and "humanness" assessment of BCR repertoires from transgenic animals. The software tool for structural annotation of BCR repertoires, SAAB+, is available at https://github.com/oxpig/saab_plus.


Subject(s)
B-Lymphocytes/immunology , Cell Differentiation , Receptors, Antigen, B-Cell/metabolism , Adaptive Immunity , Animals , Animals, Genetically Modified , Antibodies , B-Lymphocytes/cytology , Cluster Analysis , High-Throughput Nucleotide Sequencing , Humans , Immunoglobulin G/chemistry , Mice , Mice, Inbred C57BL , Principal Component Analysis , Receptors, Antigen, B-Cell/genetics , Software
8.
Front Immunol ; 11: 605170, 2020.
Article in English | MEDLINE | ID: mdl-33384691

ABSTRACT

Deep sequencing of B cell receptor (BCR) heavy chains from a cohort of 31 COVID-19 patients from the UK reveals a stereotypical naive immune response to SARS-CoV-2 which is consistent across patients. Clonal expansion of the B cell population is also observed and may be the result of memory bystander effects. There was a strong convergent sequence signature across patients, and we identified 1,254 clonotypes convergent between at least four of the COVID-19 patients, but not present in healthy controls or individuals following seasonal influenza vaccination. A subset of the convergent clonotypes were homologous to known SARS and SARS-CoV-2 spike protein neutralizing antibodies. Convergence was also demonstrated across wide geographies by comparison of data sets between patients from UK, USA, and China, further validating the disease association and consistency of the stereotypical immune response even at the sequence level. These convergent clonotypes provide a resource to identify potential therapeutic and prophylactic antibodies and demonstrate the potential of BCR profiling as a tool to help understand patient responses.


Subject(s)
Antibodies, Neutralizing/immunology , Antibodies, Viral/immunology , COVID-19/pathology , Receptors, Antigen, B-Cell/genetics , SARS-CoV-2/immunology , B-Lymphocytes/immunology , COVID-19/immunology , Female , High-Throughput Nucleotide Sequencing , Humans , Lymphopenia/immunology , Male , Middle Aged , Spike Glycoprotein, Coronavirus/immunology
9.
MAbs ; 11(7): 1197-1205, 2019 10.
Article in English | MEDLINE | ID: mdl-31216939

ABSTRACT

Recently it has become possible to query the great diversity of natural antibody repertoires using next-generation sequencing (NGS). These methods are capable of producing millions of sequences in a single experiment. Here we compare clinical-stage therapeutic antibodies to the ~1b sequences from 60 independent sequencing studies in the Observed Antibody Space database, which includes antibody sequences from NGS analysis of immunoglobulin gene repertoires. Of 242 post-Phase 1 antibodies, we found 16 with sequence identity matches of 95% or better for both heavy and light chains. There are also 54 perfect matches to therapeutic CDR-H3 regions in the NGS outputs, suggesting a nontrivial amount of convergence between naturally observed sequences and those developed artificially. This has potential implications for both the legal protection of commercial antibodies and the discovery of antibody therapeutics.


Subject(s)
Complementarity Determining Regions/genetics , Immunoglobulins/genetics , Immunotherapy/methods , Data Mining , Databases, Genetic , High-Throughput Nucleotide Sequencing , Humans , Immunity, Humoral , Immunoglobulins/therapeutic use
10.
J Immunol ; 201(12): 3694-3704, 2018 12 15.
Article in English | MEDLINE | ID: mdl-30397033

ABSTRACT

Next-generation sequencing of the Ig gene repertoire (Ig-seq) produces large volumes of information at the nucleotide sequence level. Such data have improved our understanding of immune systems across numerous species and have already been successfully applied in vaccine development and drug discovery. However, the high-throughput nature of Ig-seq means that it is afflicted by high error rates. This has led to the development of error-correction approaches. Computational error-correction methods use sequence information alone, primarily designating sequences as likely to be correct if they are observed frequently. In this work, we describe an orthogonal method for filtering Ig-seq data, which considers the structural viability of each sequence. A typical natural Ab structure requires the presence of a disulfide bridge within each of its variable chains to maintain the fold. Our Ab Sequence Selector (ABOSS) uses the presence/absence of this bridge as a way of both identifying structurally viable sequences and estimating the sequencing error rate. On simulated Ig-seq datasets, ABOSS is able to identify more than 99% of structurally viable sequences. Applying our method to six independent Ig-seq datasets (one mouse and five human), we show that our error calculations are in line with previous experimental and computational error estimates. We also show how ABOSS is able to identify structurally impossible sequences missed by other error-correction methods.


Subject(s)
High-Throughput Nucleotide Sequencing/methods , Immunoglobulins/genetics , Software , Vaccines/immunology , Algorithms , Animals , Computational Biology , Databases as Topic , Drug Development , Humans , Mice , Protein Conformation , Quality Control , Scientific Experimental Error , Structure-Activity Relationship
11.
J Immunol ; 201(8): 2502-2509, 2018 10 15.
Article in English | MEDLINE | ID: mdl-30217829

ABSTRACT

Abs are immune system proteins that recognize noxious molecules for elimination. Their sequence diversity and binding versatility have made Abs the primary class of biopharmaceuticals. Recently, it has become possible to query their immense natural diversity using next-generation sequencing of Ig gene repertoires (Ig-seq). However, Ig-seq outputs are currently fragmented across repositories and tend to be presented as raw nucleotide reads, which means nontrivial effort is required to reuse the data for analysis. To address this issue, we have collected Ig-seq outputs from 55 studies, covering more than half a billion Ab sequences across diverse immune states, organisms (primarily human and mouse), and individuals. We have sorted, cleaned, annotated, translated, and numbered these sequences and make the data available via our Observed Antibody Space (OAS) resource at http://antibodymap.org The data within OAS will be regularly updated with newly released Ig-seq datasets. We believe OAS will facilitate data mining of immune repertoires for improved understanding of the immune system and development of better biotherapeutics.


Subject(s)
Antibodies/genetics , Data Mining/methods , Immunoglobulins/genetics , Immunotherapy/methods , Animals , Antibody Diversity , Databases, Genetic , High-Throughput Nucleotide Sequencing , Humans , Immunity, Humoral/genetics , Mice , Molecular Sequence Annotation
12.
Front Immunol ; 9: 1698, 2018.
Article in English | MEDLINE | ID: mdl-30083160

ABSTRACT

Every human possesses millions of distinct antibodies. It is now possible to analyze this diversity via next-generation sequencing of immunoglobulin genes (Ig-seq). This technique produces large volume sequence snapshots of B-cell receptors that are indicative of the antibody repertoire. In this paper, we enrich these large-scale sequence datasets with structural information. Enriching a sequence with its structural data allows better approximation of many vital features, such as its binding site and specificity. Here, we describe the structural annotation of antibodies pipeline that maps the outputs of large Ig-seq experiments to known antibody structures. We demonstrate the viability of our protocol on five separate Ig-seq datasets covering ca. 35 m unique amino acid sequences from ca. 600 individuals. Despite the great theoretical diversity of antibodies, we find that the majority of sequences coming from such studies can be reliably mapped to an existing structure.

13.
Front Immunol ; 8: 1753, 2017.
Article in English | MEDLINE | ID: mdl-29276518

ABSTRACT

Next-generation sequencing of immunoglobulin gene repertoires (Ig-seq) allows the investigation of large-scale antibody dynamics at a sequence level. However, structural information, a crucial descriptor of antibody binding capability, is not collected in Ig-seq protocols. Developing systematic relationships between the antibody sequence information gathered from Ig-seq and low-throughput techniques such as X-ray crystallography could radically improve our understanding of antibodies. The mapping of Ig-seq datasets to known antibody structures can indicate structurally, and perhaps functionally, uncharted areas. Furthermore, contrasting naïve and antigenically challenged datasets using structural antibody descriptors should provide insights into antibody maturation. As the number of antibody structures steadily increases and more and more Ig-seq datasets become available, the opportunities that arise from combining the two types of information increase as well. Here, we review how these data types enrich one another and show potential for advancing our knowledge of the immune system and improving antibody engineering.

SELECTION OF CITATIONS
SEARCH DETAIL
...