CN109326329A

Movatterモバイル変換

Info

Publication number: CN109326329A
Application number: CN201811353819.0A
Authority: CN
Inventors: 李慧
Original assignee: Jinling Institute of Technology
Current assignee: Jinling Institute of Technology
Priority date: 2018-11-14
Filing date: 2018-11-14
Publication date: 2019-02-12
Anticipated expiration: 2038-11-14
Also published as: CN109326329B

Abstract

本发明公开了一种非平衡模式下基于集成学习的锌结合蛋白质作用位点预测方法，针对锌结合蛋白质作用位点的特点，对蛋白质源数据进行预处理；借助随机下采样技术对锌结合蛋白质作用位点的非平衡性进行平衡化处理，得到若干个子平衡数据集；分别在若干个子平衡数据集上，选取有可区分性的蛋白质生化特征，进行特征表示，组成特征向量；分别把特征向量作为基分类器支持向量机的输入，计算样本权重，再构建基于样本加权的概率神经网络模型，最后整合基分类模型支持向量机和基于样本加权的概率神经网络模型得到预测模型；采用得到预测模型对目标样品中的锌结合蛋白质作用位点进行识别。

The invention discloses a method for predicting the action site of zinc binding protein based on integrated learning in a non-equilibrium mode. According to the characteristics of the action site of zinc binding protein, protein source data is preprocessed; The non-equilibrium of the action site is balanced, and several sub-equilibrium data sets are obtained; respectively, on several sub-equilibrium data sets, distinguishable protein biochemical features are selected to represent the features and form feature vectors; As the input of the base classifier support vector machine, the sample weight is calculated, and then the probabilistic neural network model based on sample weighting is constructed. Finally, the base classification model support vector machine and the probabilistic neural network model based on sample weighting are integrated to obtain the prediction model; the prediction model is obtained by using Identify zinc-binding protein action sites in target samples.