OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

计算机应用研究 2012

Deduplication model based on file-similarity clustering
基于文件相似性分簇的重复数据消除模型

WANG Can,QIN Zhi-guang,WANG Juan,CAI Bo,
王灿,秦志光,王娟,蔡博

Keywords: cloud-storage,deduplication,throughput,file-similarity clustering,load balancing
云存储,重复数据消除,吞吐量,文件相似性分簇,负载均衡

Full-Text Cite this paper Add to My Lib

Abstract:

To resolve the locality dependence and multiple-nodes dependence problems of the current throughput improving methods for deduplication system, this paper proposed a deduplication model based on file-similarity clustering. This model expanded the traditional flat index structure into spatial structure. According to the Broder's theorem, it kept only a handful of the most representative indices in RAM. It partitioned the index horizontally and distributed on several totally autonomous storage nodes. The experimental results indicate that the model can effectively improve the deduplication performance and the throughput on average in the large scale cloud-storage environment, and the data loads are balanced. Therefore, the model can be extended smoothly.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Deduplication model based on file-similarity clustering基于文件相似性分簇的重复数据消除模型

Deduplication model based on file-similarity clustering
基于文件相似性分簇的重复数据消除模型