17-21 October 2016
LBNL
US/Pacific timezone

Profiling data intensive workflows on Genepool and PDSF clusters at NERSC.

19 Oct 2016, 12:00
25m
Building 50 Auditorium (LBNL)

Building 50 Auditorium

LBNL

Berkeley, CA 94720
Computing & Batch Services Computing and Batch Services

Speaker

Dr Bhupender Thakur (NERSC, Lawrence Berkeley National Lab)

Description

NERSC is well known for its user friendly, large-scale computing environment. Along with the large Cray systems (Edison and Cori), NERSC also supports data intensive workflows of the Joint Genome Institute, HEP and material science community via its Genepool, PDSF and Matgen clusters. These clusters are all provisioned from a single backend cluster, Mendel. This talk will briefly outline the workflows in Mendel and provide a comparative profile of its various applications. It will also summarize various user and system incidents over the last few years of its service. A deeper analysis of the bio-informatics workflow on the Genepool compute cluster, and a plan for testing workflows on a Mendel testbed with Cori-like environment will be discussed. Finally, a prospective plan for future evolution of Genepool part of Mendel will also be outlined.

Primary author

Dr Bhupender Thakur (NERSC, Lawrence Berkeley National Lab)

Presentation Materials