OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

Journal of Software 2010

Feature Selection via Correlation Coefficient Clustering

DOI: 10.4304/jsw.5.12.1371-1377

Hui-Huang Hsu,Cheng-Wei Hsieh

Keywords: Feature Selection , Clustering , Correlation Coefficient , Support Vector Machines (SVMs) , Machine Learning , Classification

Full-Text Cite this paper Add to My Lib

Abstract:

Feature selection is a fundamental problem in machine learning and data mining. How to choose the most problem-related features from a set of collected features is essential. In this paper, a novel method using correlation coefficient clustering in removing similar/redundant features is proposed. The collected features are grouped into clusters by measuring their correlation coefficient values. The most class-dependent feature in each cluster is retained while others in the same cluster are removed. Thus, the most class-related and mutually unrelated features are identified. The proposed method was applied to two datasets: the disordered protein dataset and the Arrhythmia (ARR) dataset. The experimental results show that the method is superior to other feature selection methods in speed and/or accuracy. Detail discussions are given in the paper.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133