全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
软件学报  2005 

JIACKPT: A Recoverable Software Distributed Shared Memory System
可恢复的软件DSM系统JIACKPT

Keywords: software distributed shared memory system,checkpoint,global consistent state,JIAJIA
软件DSM系统
,检查点,全局一致状态,JIAJIA

Full-Text   Cite this paper   Add to My Lib

Abstract:

Software distributed shared memory (DSM) system has constructed a virtual shared memory abstract on cluster, which combines the programmability of shared memory and fine scalability of cluster. So it is widely studied. Software DSM system is easy to fail because it is a distributed system, some kinds of fault tolerance are necessary for it to be more practical. A recoverable and portable software DSM system, JIACKPT (JIAjia with ChecKPoinTing), has been designed and implemented to tolerate the fault of system. JIACKPT, based on JIAJIA, has adopted the checkpointing technology. By maintaining the strict global consistent state and using some optimization techniques, JIACKPT has gotten high performance. The experimental results on an 8-node PC cluster show that the checkpoint overhead is less than 10% of the whole execution time when checkpoint is done once per minute. JIACKPT also has good portability and can run on several operating systems, such as Linux, Solaris, etc. JIACKPT is a practical recoverable software DSM system.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133