Speaker
Jerome Belleman
(CERN)
Description
The CERN Batch System comprises 4000 worker nodes, 60 queues and offers
a service for various types of large user communities. In light of the
developments driven by the Agile Infrastructure and the more demanding
processing requirements, it is faced with increasingly challenging scalability
and flexibility needs.
This production cluster currently runs IBM/Platform LSF. Over the last few
months, an increasing number of large-scale interventions had to take place,
betraying some critical limitations we will need to overcome in the future. We
have started working on a project helping us implementing work flows to help
use face these problems.
Author
Jerome Belleman
(CERN)