全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
-  2018 

基于对称不确定性和邻域粗糙集的肿瘤分类信息基因选择
Informative Gene Selection for Tumor Classification Based on Symmetric Uncertainty and Neighborhood Rough Set

DOI: 10.16337/j.1004-9037.2018.03.005

Keywords: 基因表达谱,邻域粗糙集,对称不确定性,特征选择,肿瘤分类
gene expression profiles
,neighborhood rough set,symmetric uncertainty,feature selection,tumor classification

Full-Text   Cite this paper   Add to My Lib

Abstract:

基因表达谱中信息基因选择是有效建立肿瘤分类模型的关键问题。肿瘤基因表达谱具有高维小样本、噪声大且存在大量关和冗余基因等特点。为了获得基因数量尽可能少而分类能力尽可能强的一组信息基因,提出一种基于对称不确定性和邻域粗糙集的肿瘤分类信息基因选择SUNRS方法。首先利用对称不确定性指标评估信息基因的重要度,以剔除大量关和冗余基因,获取信息基因的候选子集;然后利用邻域粗糙集约简算法对信息基因候选子集进行寻优,获得信息基因的目标子集。实验结果表明,SUNRS方法能够用较少的信息基因获得更高的分类精度,从而既能改善算法的泛化性能,又能提高时间效率。
Informative gene selection is an essential step to perform tumor classification with large scale gene expression profiles. However, it is difficult to select informative genes related to tumor from gene expression profiles because of its characteristics such as high dimensionality and relatively small samples, many noises, and some of the genes are superfluous and irrelevant. To deal with the challenging problem of finding an informative gene subset with the least number of genes but the highest classification performance, a novel hybrid gene selection algorithm named SUNRS is proposed based on the symmetric uncertainty (SU) and neighborhood rough set (NRS). Firstly, the symmetric uncertain index, which aims to eliminate redundant and irrelevant genes, is used to select top-ranked genes as the candidate gene subset. Secondly, the neighborhood rough set reduction algorithm is used to obtain the target gene subset by optimizing the candidate gene subset. Experimental results show that the proposed algorithm can obtain higher classification accuracy with less informative gene, which not only improves the generalization performance of the algorithm, but also enhances the time efficiency.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133