全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Random Forest for Classification of Thermophilic and Psychrophilic Proteins Based on Amino Acid Composition Distribution
基于氨基酸组成分布的嗜热和嗜冷蛋白随机森林分类模型

Keywords: Random forest,amino acid composition distribution,thermophilic and psychrophilic protein,ROC curve
随机森林
,氨基酸组成分布,嗜热和嗜冷蛋白,ROC曲线

Full-Text   Cite this paper   Add to My Lib

Abstract:

We used amino acid composition distribution (AACD) to discriminate thermophilic and psychrophilic proteins. We used 10-fold cross-validation and independent testing with other dataset to evaluate the models. The results showed that when the segment was 1, the overall accuracy reached 92.9% and 90.2%, respectively. The AACD method improved the prediction accuracy when support vector machine was used as the classifier. When six new features were introduced, the overall accuracy of random forest improved to 93.2% and 92.2%, the areas under the receiver operation characteristic curve were 0.9771 and 0.9696, which was better than other ensemble classifiers and comparable with that of SVM.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133