全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Deduplication model based on file-similarity clustering
基于文件相似性分簇的重复数据消除模型

Keywords: cloud-storage,deduplication,throughput,file-similarity clustering,load balancing
云存储
,重复数据消除,吞吐量,文件相似性分簇,负载均衡

Full-Text   Cite this paper   Add to My Lib

Abstract:

To resolve the locality dependence and multiple-nodes dependence problems of the current throughput improving methods for deduplication system, this paper proposed a deduplication model based on file-similarity clustering. This model expanded the traditional flat index structure into spatial structure. According to the Broder's theorem, it kept only a handful of the most representative indices in RAM. It partitioned the index horizontally and distributed on several totally autonomous storage nodes. The experimental results indicate that the model can effectively improve the deduplication performance and the throughput on average in the large scale cloud-storage environment, and the data loads are balanced. Therefore, the model can be extended smoothly.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133