21–27 Mar 2009
Prague
Europe/Prague timezone

Data Location-Aware Job Scheduling in the Grid. Application to the GridWay Metascheduler.

26 Mar 2009, 08:00
1h
Prague

Prague

Prague Congress Centre 5. května 65, 140 00 Prague 4, Czech Republic
Board: Thursday 004
poster Grid Middleware and Networking Technologies Poster session

Speaker

Mr Antonio Delgado Peris (CIEMAT)

Description

Grid infrastructures constitute nowadays the core of the computing facilities of the biggest LHC experiments. These experiments produce and manage petabytes of data per year and run thousands of computing jobs every day to process that data. It is the duty of metaschedulers to allocate the tasks to the most appropriate resources at the proper time. Our work reviews the policies that have been proposed for the scheduling of grid jobs in the context of very data-intensive applications. We indicate some of the practical problems that such models will face and describe what we consider essential characteristics of an optimum scheduling system: aim to minimize not only job turnaround time but also data replication, flexibility to support different virtual organization requirements and capability to coordinate the tasks of data placement and job allocation while keeping their execution decoupled. These ideas have guided the development of an enhanced prototype for GridWay, a general purpose metascheduler, part of the Globus Toolkit and member of the EGEE's RESPECT program. Current GridWay's scheduling algorithm is unaware of data location. Our prototype makes it possible for job requests to set data needs not only as absolute requirements but also as functions for resource ranking. As our tests show, this makes it more flexible than currently used resource brokers to implement different data-aware scheduling algorithms.

Primary authors

Mr Antonio Delgado Peris (CIEMAT) Dr Eduardo Huedo (Universidad Complutense de Madrid) Dr Ignacio M Llorente (Universidad Complutense de Madrid) Dr Jose Hernandez (CIEMAT)

Presentation materials