PLANK J S,THOMASON M G.Processor allocation and checkpoint interval selection in cluster computing systems[J].Journal of Parallel and Distributed Computing,2001,l8(3):111-126.
[2]
GUPTA B,RAHIMI S.A novel recovery approach for cluster federations[J].Advances in Grid and Pervasive Computing,2007,4459(2):519-530.
[3]
KRISHNAN S,GANNON D.Checkpoint and restart for distributed components in xcat3[C]∥The 5th IEEE/ACMInternational Workshop on Grid Computing.New Jercy:IEEE Computer Society,2005:117-128.
[4]
EILAM T,KALANTAR M.Model-based automation of service deployment in a constrained environment[R].NewYork:IBMWatson Research Center,2004:23-33.
[5]
ZHAN J F.Fire phoenix cluster operating system kernel and its evaluation[C]∥IEEE International Conference on ClusterComputing 2005.Boston:IEEE Computer Society,2005:112-131.
[6]
LIU TY,XU Z W.Clustone:a service-based cluster middleware[C]∥The Fourth International Conference on Parallel andDistributed Computing,Applications and Technologies.Chengdu:IEEE Computer Society,2003:11-20.
[7]
COULOURIS G,DOLLIMORE J,KINDBERG T.Distributed systems:concepts and design[M].4th ed.New Jersey:Addison Wesley,2008:13-45.
[8]
MENA S,SCHIPER A.A step towards a new generation of group communication systems[C]∥ACM/IFIP/USENIXInternational Middleware Conference on Middleware 2003.Rio de Janeiro,Brazil:Springer,2003:122-128.
[9]
KEIDAR I,SUSSMAN J,MARZULLO K,et al.A group membership service for WANs[J].ACMTransactions on ComputerSystems,2002,20(3):1-48.
[10]
CHOCKLER G V,KEIDAR I,VITENBERG R.Group communication specifications:a comprehensive study[J].ACMComp Surveys,2001,33(4):1-43.
[11]
孟丹,孙凝辉,徐志伟.高性能计算机曙光4000 A的网格使能特征[J].计算机研究与发展,2004,41(12):2079-2086.MENG Dan,SUN Ning-hui,XU Zhi-wei.Grid-enabling features of the dawning 4000 A high-performance computer[J].Journal of Computer Research and Development,2004,41(12):2079-2086.(in Chinese)