Your browser doesn't support javascript.
loading
Protein multi-level structure feature-integrated deep learning method for mutational effect prediction.
Pang, Ai-Ping; Luo, Yongsheng; Zhou, Junping; Cai, Xue; Huang, Lianggang; Zhang, Bo; Liu, Zhi-Qiang; Zheng, Yu-Guo.
Afiliação
  • Pang AP; National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, People's Republic of China.
  • Luo Y; Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, People's Republic of China.
  • Zhou J; Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai, China.
  • Cai X; National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, People's Republic of China.
  • Huang L; Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, People's Republic of China.
  • Zhang B; National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, People's Republic of China.
  • Liu ZQ; Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, People's Republic of China.
  • Zheng YG; National and Local Joint Engineering Research Center for Biomanufacturing of Chiral Chemicals, Zhejiang University of Technology, Hangzhou, People's Republic of China.
Biotechnol J ; 19(8): e2400203, 2024 Aug.
Article em En | MEDLINE | ID: mdl-39115336
ABSTRACT
Through iterative rounds of mutation and selection, proteins can be engineered to enhance their desired biological functions. Nevertheless, identifying optimal mutation sites for directed evolution remains challenging due to the vastness of the protein sequence landscape and the epistatic mutational effects across residues. To address this challenge, we introduce MLSmut, a deep learning-based approach that leverages multi-level structural features of proteins. MLSmut extracts salient information from protein co-evolution, sequence semantics, and geometric features to predict the mutational effect. Extensive benchmark evaluations on 10 single-site and two multi-site deep mutation scanning datasets demonstrate that MLSmut surpasses existing methods in predicting mutational outcomes. To overcome the limited training data availability, we employ a two-stage training strategy initial coarse-tuning on a large corpus of unlabeled protein data followed by fine-tuning on a curated dataset of 40-100 experimental measurements. This approach enables our model to achieve satisfactory performance on downstream protein prediction tasks. Importantly, our model holds the potential to predict the mutational effects of any protein sequence. Collectively, these findings suggest that our approach can substantially reduce the reliance on laborious wet lab experiments and deepen our understanding of the intricate relationships between mutations and protein function.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas / Aprendizado Profundo / Mutação Idioma: En Revista: Biotechnol J Assunto da revista: BIOTECNOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de publicação: Alemanha

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas / Aprendizado Profundo / Mutação Idioma: En Revista: Biotechnol J Assunto da revista: BIOTECNOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de publicação: Alemanha