Abstract
Heterogeneous architectures comprising a multi-core CPU and many-core GPU(s) are increasingly being used within cluster and cloud environments. In this paper, we study the problem of optimizing the overall throughput of a set of applications deployed on a cluster of such heterogeneous nodes. We consider two different scheduling formulations. In the first formulation, we consider jobs that can be executed on either the GPU or the CPU of a single node. In the second formulation, we consider jobs that can be executed on the CPU, GPU, or both, of any number of nodes in the system. We have developed scheduling schemes addressing both of the problems. In our evaluation, we first show that the schemes proposed for first formulation outperform a blind round-robin scheduler and approximate the performances of an ideal scheduler that involves an impractical exhaustive exploration of all possible schedules. Next, we show that the scheme proposed for the second formulation outperforms the best of existing schemes for heterogeneous clusters, TORQUE and MCT, by up to 42%. Additionally, we evaluate the robustness of our proposed scheduling policies under inaccurate inputs to account for real execution scenarios. We show that, with up to 20% of inaccuracy in the input, the degradation in performance is marginal (less than 7%) on the average.
Original language | English (US) |
---|---|
Pages (from-to) | 2262-2271 |
Number of pages | 10 |
Journal | Future Generation Computer Systems |
Volume | 29 |
Issue number | 8 |
DOIs | |
State | Published - 2013 |
Externally published | Yes |
Event | 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) - Ottawa, Canada Duration: May 13 2012 → May 16 2012 |
Keywords
- CPU-GPU systems
- Scheduling
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Networks and Communications