14th International CSI Computer Conference-CSICC 2009 , 2009-10-20

Title : ( Job failure in grid environment based on workload characteristics )

Authors: Hossein Deldari ,

Citation: BibTeX | EndNote

Abstract

The power of grid technology in aggregating autonomous resources owned by several organizations into a single virtual system has made it popular in compute-intensive and data-intensive applications. Complex and dynamic nature of grid makes failure of users jobs fairly probable. Furthermore, traditional methods for job failure recovery have proven costly and thus a need to shift toward proactive and predictive management strategies is necessary in such systems. In this paper, an innovative effort is made to predict the futurity of jobs submitted to a production grid environment (AuverGrid). By analyzing grid workload traces and extracting patterns describing common failure characteristics, the success or failure status of jobs during 6 months of AuverGrid activity was predicted with around 96% accuracy. The quality of services on grid can be improved by integrating the result of this work into management services like scheduling and monitoring.

Keywords

job failure
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@inproceedings{paperid:1016834,
author = {Deldari, Hossein},
title = {Job failure in grid environment based on workload characteristics},
booktitle = {14th International CSI Computer Conference-CSICC 2009},
year = {2009},
location = {تهران, IRAN},
keywords = {job failure},
}

[Download]

%0 Conference Proceedings
%T Job failure in grid environment based on workload characteristics
%A Deldari, Hossein
%J 14th International CSI Computer Conference-CSICC 2009
%D 2009

[Download]