Oct 17 – 21, 2016
US/Pacific timezone

Profiling data intensive workflows on Genepool and PDSF clusters at NERSC.

Oct 19, 2016, 12:00 PM
Building 50 Auditorium (LBNL)

Building 50 Auditorium


Berkeley, CA 94720
Computing & Batch Services Computing and Batch Services


Dr Bhupender Thakur (NERSC, Lawrence Berkeley National Lab)


NERSC is well known for its user friendly, large-scale computing environment. Along with the large Cray systems (Edison and Cori), NERSC also supports data intensive workflows of the Joint Genome Institute, HEP and material science community via its Genepool, PDSF and Matgen clusters. These clusters are all provisioned from a single backend cluster, Mendel. This talk will briefly outline the workflows in Mendel and provide a comparative profile of its various applications. It will also summarize various user and system incidents over the last few years of its service. A deeper analysis of the bio-informatics workflow on the Genepool compute cluster, and a plan for testing workflows on a Mendel testbed with Cori-like environment will be discussed. Finally, a prospective plan for future evolution of Genepool part of Mendel will also be outlined.

Primary author

Dr Bhupender Thakur (NERSC, Lawrence Berkeley National Lab)

Presentation materials