24–28 Apr 2017
Hungarian Academy of Sciences
Europe/Budapest timezone

Experiences With Intel Knights Landing, OmniPath and Slurm

26 Apr 2017, 14:55
25m
Hungarian Academy of Sciences

Hungarian Academy of Sciences

Széchenyi István tér 9 1051 Budapest Hungary
Computing & Batch Services Computing and batch systems

Speaker

William Strecker-Kellogg (Brookhaven National Lab)

Description

Brookhaven Lab recently acquired an Intel Knight's Landing (KNL) cluster consisting of 144 nodes connected with a dual-rail OmniPath (OPA) fabric. We will detail our experiences integrating this cluster into our environment, testing the performance and deugging issues relating to the fabric and hardware. Details about the integration with the batch system (Slurm) and performance issues found with different kernels will be discussed, as well as some results from scientific users of the system.

Scheduling constraints / preferences

I'm leaving early Friday morning

Length of talk (minutes) 20

Authors

William Strecker-Kellogg (Brookhaven National Lab) Tony Wong (Brookhaven National Laboratory) Costin Caramarcu (Brookhaven National Laboratory (US)) Alexandr Zaytsev (Brookhaven National Laboratory (US)) Christopher Hollowell (Brookhaven National Laboratory)

Presentation materials