Prediction of Queue Waiting Times for Metascheduling on Parallel Batch Systems


Autoria(s): Kumar, Rajath; Vadhiyar, Sathish
Data(s)

2015

Resumo

Prediction of queue waiting times of jobs submitted to production parallel batch systems is important to provide overall estimates to users and can also help meta-schedulers make scheduling decisions. In this work, we have developed a framework for predicting ranges of queue waiting times for jobs by employing multi-class classification of similar jobs in history. Our hierarchical prediction strategy first predicts the point wait time of a job using dynamic k-Nearest Neighbor (kNN) method. It then performs a multi-class classification using Support Vector Machines (SVMs) among all the classes of the jobs. The probabilities given by the SVM for the class predicted using k-NN and its neighboring classes are used to provide a set of ranges of predicted wait times with probabilities. We have used these predictions and probabilities in a meta-scheduling strategy that distributes jobs to different queues/sites in a multi-queue/grid environment for minimizing wait times of the jobs. Experiments with different production supercomputer job traces show that our prediction strategies can give correct predictions for about 77-87% of the jobs, and also result in about 12% improved accuracy when compared to the next best existing method. Experiments with our meta-scheduling strategy using different production and synthetic job traces for various system sizes, partitioning schemes and different workloads, show that the meta-scheduling strategy gives much improved performance when compared to existing scheduling policies by reducing the overall average queue waiting times of the jobs by about 47%.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/51836/1/JSSPP_8828_108_2015.pdf

Kumar, Rajath and Vadhiyar, Sathish (2015) Prediction of Queue Waiting Times for Metascheduling on Parallel Batch Systems. In: 18th International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), MAY 23, 2014, Phoenix, AZ, pp. 108-128.

Publicador

SPRINGER-VERLAG BERLIN

Relação

http://dx.doi.org/ 10.1007/978-3-319-15789-4_7

http://eprints.iisc.ernet.in/51836/

Palavras-Chave #Supercomputer Education & Research Centre
Tipo

Conference Proceedings

NonPeerReviewed