Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 1 de 1
Filter
Add more filters










Database
Language
Publication year range
1.
Science ; 379(6637): 1123-1130, 2023 03 17.
Article in English | MEDLINE | ID: mdl-36927031

ABSTRACT

Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.


Subject(s)
Evolution, Molecular , Machine Learning , Proteins , Sequence Analysis, Protein , Amino Acid Sequence , Proteins/chemistry , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL
...