Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 1 de 1
Filter
Add more filters










Database
Language
Publication year range
1.
Nat Commun ; 15(1): 4271, 2024 May 20.
Article in English | MEDLINE | ID: mdl-38769289

ABSTRACT

T Cell Receptor (TCR) antigen binding underlies a key mechanism of the adaptive immune response yet the vast diversity of TCRs and the complexity of protein interactions limits our ability to build useful low dimensional representations of TCRs. To address the current limitations in TCR analysis we develop a capacity-controlled disentangling variational autoencoder trained using a dataset of approximately 100 million TCR sequences, that we name TCR-VALID. We design TCR-VALID such that the model representations are low-dimensional, continuous, disentangled, and sufficiently informative to provide high-quality TCR sequence de novo generation. We thoroughly quantify these properties of the representations, providing a framework for future protein representation learning in low dimensions. The continuity of TCR-VALID representations allows fast and accurate TCR clustering and is benchmarked against other state-of-the-art TCR clustering tools and pre-trained language models.


Subject(s)
Receptors, Antigen, T-Cell , Receptors, Antigen, T-Cell/immunology , Receptors, Antigen, T-Cell/metabolism , Receptors, Antigen, T-Cell/genetics , Humans , Deep Learning , Algorithms , Cluster Analysis , Computational Biology/methods , Amino Acid Sequence
SELECTION OF CITATIONS
SEARCH DETAIL
...