Design of a fault-tolerant job-flow manager for grid environments using standard technologies, job-flow patterns, and a transparent proxy

Gargi Dasgupta, Onyeka Ezenwoye, Liana Fong, Selim Kalayci, S. Masoud Sadjadi, Balaji Viswanathan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

The execution of job flow applications is a reality today in academic and industrial domains. Current approaches to execution of job flows often follow proprietary solutions on expressing the job flows and do not leverage recurrent job-flow patterns to address faults in Grid computing environments. In this paper, we provide a design solution to development of job-flow managers that uses standard technologies such as BPEL and JSDL to express job flows and employs a two-layer peer-to-peer architecture with interoperable protocols for cross-domain interactions among job-flow mangers. In addition, we identify a number of recurring job-flow patterns and introduce their corresponding fault-tolerant patterns to address runtime faults and exceptions. Finally, to keep the business logic of job flows separate from their fault-tolerant behavior, we use a transparent proxy that intercepts job-flow execution at runtime to handle potential faults using a growing knowledge base that contains the most recently identified job-flow patterns and their corresponding fault-tolerant patterns.

Original languageEnglish (US)
Title of host publication20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008
Pages814-819
Number of pages6
StatePublished - 2008
Externally publishedYes
Event20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008 - San Francisco Bay, CA, United States
Duration: Jul 1 2008Jul 3 2008

Publication series

Name20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008

Other

Other20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008
Country/TerritoryUnited States
CitySan Francisco Bay, CA
Period7/1/087/3/08

Keywords

  • BPEL
  • Fault tolerant
  • Grid computing
  • JSDL
  • Job-flow patterns
  • Peer-to-peer
  • Software design

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Design of a fault-tolerant job-flow manager for grid environments using standard technologies, job-flow patterns, and a transparent proxy'. Together they form a unique fingerprint.

Cite this