Speaker
Jerome Belleman
(CERN)
Description
The CERN batch service runs a 60k CPU core cluster using Platform LSF. We present some of the challenges of running a service at this scale, and describe the current planning of how we aim to evolve the current system to a more dynamic, larger scale service.
As part of this, we recently undertook a project of developing new monitoring tools and upgrading the batch accounting system; we present the current state of development in this area.
Author
Jerome Belleman
(CERN)