Speaker
Georg Rath
(Lawrence Berkeley National Laboratory)
Description
PDSF, the Parallel Distributed Systems Facility, was moved to Lawrence Berkeley National Lab from Oakland CA in 2016. The cluster has been in continuous operation since 1996 serving high energy physics research. The cluster is a tier-1 site for Star, a tier-2 site for Alice and a tier-3 site for Atlas.
This site report will describe lessons learned and challenges met, when migrating from Univa GridEngine to the Slurm scheduler, experiences running containerized software stacks using Shifter, as well as upcoming changes to systems management and the future of PDSF.
Desired length | 12 |
---|
Author
Georg Rath
(Lawrence Berkeley National Laboratory)
Co-authors
James Botts
(LBNL)
Jeff Porter
(Lawrence Berkeley National Lab. (US))
Jan Balewski
(NERSC)
Douglas Jacobsen
(NERSC)
Lisa Gerhardt
(LBNL)
Tina Declerck
(NERSC)
Tony Quan
(LBL)
Mr
Ershaad Basheer
(NERSC)