DOI: 10.5176/978-981-08-7656-2 A-40

Authors: A.Shamila Ebenezer, Dr.Baskaran

Abstract: Computational Grid enables the aggregation and sharing of geographically distributed computational resources for solving various large-scale applications. Due to the widespread use of resources, systems are highly prone to errors and failures. Hence fault tolerance plays a key role in grid to avoid the problem of unreliability. The two main techniques for implementing fault tolerance in grid environment are check pointing and replication. In this paper, a survey of the different replication techniques is presented. The goal of replication is to ensure at least one replica is always able to continue and complete the computation in the event of failure of other resources. This paper describes the different job replication techniques that have been proposed to improve the fault tolerance in the grid
environment. This study will help the researchers who are undertaking their research in improving the fault tolerance in grid, to analyze and compare different job replication techniques with their own work.

Keywords: Optimal Job replication, fault tolerance, Grid computing

simplr_role_lock:

Price: $0.00

Loading Updating cart...
LoadingUpdating...