%0 Journal Article
%T Protein secondary structure co-training prediction method
蛋白质二级结构的协同训练预测方法
%A LIU Jun
%A XIONG Zhong-yang
%A WANG Yin-hui
%A
刘君
%A 熊忠阳
%A 王银辉
%J 计算机应用研究
%D 2011
%I
%X Machine learning based protein secondary structure prediction methods suffer low prediction accuracy because they ignore the amino acid hydrophobic property and the interaction between far away amino acids. A sequence of hydrophobic value can be build by replacing the amino acid by its hydrophobic value. Experiments show that the BP neural network using long amino hydrophobic value sequence works well in prediction of E structure which is controlled mainly by long amino acid-amino acid interaction. Because both the Profile space and the hydrophobic energy value space are sufficient and redundant views, this paper proposes a Co-training algorithm. In the proposed algorithm, there are two classifiers. One is SVM classifier trained in Profile space, and the other is BP neural network classifier trained in hydrophobic value space, and they predict one amino acid secondary structure independently. If these two classifiers have different prediction results with one amino acid, an arbitration rule proposed in this paper is employed to make the final decision which is based on an active selecting strategy. Suspected sample and creditable sample are defined according to the characteristics of the classifiers and spaces to arbitrate the controversial prediction results. The experimental results show that the proposed algorithm has higher prediction accuracy both in E structure which controlled mainly by long interaction and H structure which controlled mainly by short interaction than existing algorithms.
%K Co-training
%K protein
%K Secondary structure prediction
%K SVM
%K neural network
协同训练
%K 蛋白质
%K 二级结构预测
%K 支持向量机
%K 神经网络
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=82AB9E5FBFE4C3CE7E1AD56056749FF7&yid=9377ED8094509821&vid=D3E34374A0D77D7F&iid=94C357A881DFC066&sid=57133673016E56C3&eid=34603A9A580CC7B9&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=11