全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Classification for Imbalanced Microarray Data Based on Oversampling Technology and Random Forest
基于过采样技术和随机森林的不平衡微阵列数据分类方法研究

Keywords: Microarray data,Sample distribution imbalance,Oversampling technology,Probability distribution,Random forest
微阵列数据
,样本分布不平衡,过采样技术,概率分布,随机森林

Full-Text   Cite this paper   Add to My Lib

Abstract:

In recent years, applying DNA microarray technology to diagnose for disease, especially for cancer, has been becoming one of hot topics in bioinformatics. In contrast with many other data carriers,microarray data generally holds some unique characteristics. A novel oversampling technology based on probability distribution was proposed to solve the problem brought by the characteristic of sample distribution imbalance of microarray data. 13y this technology, some reasonable pseudo samples would be created for the minority class to guarantee the balance between two classes. Then we used random forest to classify the samples belonging to different classes. Its effectiveness and feasibility were verified on two benchmark microarray datasets. Experimental results show that the proposed method can obtain better classification performance, compared with some traditional approaches.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133