1–5 Sept 2014
Faculty of Civil Engineering
Europe/Prague timezone

Distrubuted job scheduling in MetaCentrum

4 Sept 2014, 14:10
25m
C217 (Faculty of Civil Engineering)

C217

Faculty of Civil Engineering

Faculty of Civil Engineering, Czech Technical University in Prague Thakurova 7/2077 Prague 166 29 Czech Republic
Oral Computing Technology for Physics Research Computing Technology for Physics Research

Speaker

Mr Šimon Tóth (CESNET)

Description

MetaCentrum, Czech national grid, provides access to various resources across Czech Republic. In this talk, we will describe unique features of job scheduling system used in MetaCentrum. System is based on heavily modified Torque batch system, which is improved to support requirements of such large installation. We will describe distributed setup of several standalone servers, which can work as independent servers, while preserving global scheduling via cooperating schedulers, as well as extensions supporting scheduling of GPU jobs, support for encapsulation of jobs into virtual machines (started on-demand) or even virtual clusters (hidden in on-demand prepared private virtual network).

Primary authors

Mr Miroslav Ruda (CESNET) Mr Šimon Tóth (CESNET)

Presentation materials

Peer reviewing

Paper