PDSF Site Report

May 14, 2018, 11:20 AM
Georg Rath (Lawrence Berkeley National Laboratory)


PDSF, the Parallel Distributed Systems Facility, was moved to Lawrence Berkeley National Lab from Oakland CA in 2016. The cluster has been in continuous operation since 1996 serving high energy physics research. The cluster is a tier-1 site for Star, a tier-2 site for Alice and a tier-3 site for Atlas.

This site report will describe lessons learned and challenges met, when migrating from Univa GridEngine to the Slurm scheduler, experiences running containerized software stacks using Shifter, as well as upcoming changes to systems management and the future of PDSF.

Georg Rath (Lawrence Berkeley National Laboratory)


James Botts (LBNL) Jeff Porter (Lawrence Berkeley National Lab. (US)) Jan Balewski (NERSC) Douglas Jacobsen (NERSC) Lisa Gerhardt (LBNL) Tina Declerck (NERSC) Tony Quan (LBL) Mr Ershaad Basheer (NERSC)

