Your browser doesn't support javascript.
HKAM-MKM: A hybrid kernel alignment maximization-based multiple kernel model for identifying DNA-binding proteins.
Zhao, Shulin; Ding, Yijie; Liu, Xiaobin; Su, Xi.
  • Zhao S; Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China; Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China.
  • Ding Y; Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China.
  • Liu X; Department of Nephrology, The Affiliated Wuxi People's Hospital of Nanjing Medical University, 214023, Wuxi, China. Electronic address: viplxb163@163.com.
  • Su X; Foshan Maternal and Child Health Hospital, Foshan, Guangdong, China. Electronic address: xisu_fsfy@163.com.
Comput Biol Med ; 145: 105395, 2022 06.
Article in English | MEDLINE | ID: covidwho-1894895
ABSTRACT
The identification of DNA-binding proteins (DBPs) has always been a hot issue in the field of sequence classification. However, considering that the experimental identification method is very resource-intensive, the construction of a computational prediction model is worthwhile. This study developed and evaluated a hybrid kernel alignment maximization-based multiple kernel model (HKAM-MKM) for predicting DBPs. First, we collected two datasets and performed feature extraction on the sequences to obtain six feature groups, and then constructed the corresponding kernels. To ensure the effective utilisation of the base kernel and avoid ignoring the difference between the sample and its neighbours, we proposed local kernel alignment to calculate the kernel between the sample and its neighbours, with each sample as the centre. We combined the global and local kernel alignments to develop a hybrid kernel alignment model, and balance the relationship between the two through parameters. By maximising the hybrid kernel alignment value, we obtained the weight of each kernel and then linearly combined the kernels in the form of weights. Finally, the fused kernel was input into a support vector machine for training and prediction. Finally, in the independent test sets PDB186 and PDB2272, we obtained the highest Matthew's correlation coefficient (MCC) (0.768 and 0.5962, respectively) and the highest accuracy (87.1% and 78.43%, respectively), which were superior to the other predictors. Therefore, HKAM-MKM is an efficient prediction tool for DBPs.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Algorithms / DNA-Binding Proteins Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: Comput Biol Med Year: 2022 Document Type: Article Affiliation country: J.compbiomed.2022.105395

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Algorithms / DNA-Binding Proteins Type of study: Experimental Studies / Prognostic study / Randomized controlled trials Language: English Journal: Comput Biol Med Year: 2022 Document Type: Article Affiliation country: J.compbiomed.2022.105395