|
计算机科学 2011
Research on Discernibility Matrix Knowledge Reduction Algorithm in Cloud Computing
|
Abstract:
Knowledge reduction is one of the important research issues in rough set theory. Classical knowledge reduction algorithms can only deal with small datasets,while the existing parallel knowledge reduction algorithms assume all the datasets can be loaded into the main memory and only implement reduction tasks concurrently, which is infeasible for handling large-scale data. Massive data with high dimension makes attribute reduction a challenging task. To solve this problem, the characteristics of discernibility matrix cells were analyzed, and discernibility matrix for data parallel was designed in terms of the indiscernibility of the attributes) and MapReduce programming model. Thus, large-scale data oriented discernibility matrix knowledge reduction algorithm in cloud computing was proposed. I}he experimental results demonstrate that our proposed algorithm can scale well and efficiently process largcscale datasets on commodity computers.