Speaker
William Strecker-Kellogg
(Brookhaven National Lab)
Description
Brookhaven Lab recently acquired an Intel Knight's Landing (KNL) cluster consisting of 144 nodes connected with a dual-rail OmniPath (OPA) fabric. We will detail our experiences integrating this cluster into our environment, testing the performance and deugging issues relating to the fabric and hardware. Details about the integration with the batch system (Slurm) and performance issues found with different kernels will be discussed, as well as some results from scientific users of the system.
Scheduling constraints / preferences
I'm leaving early Friday morning
Length of talk (minutes) | 20 |
---|
Authors
William Strecker-Kellogg
(Brookhaven National Lab)
Tony Wong
(Brookhaven National Laboratory)
Costin Caramarcu
(Brookhaven National Laboratory (US))
Alexandr Zaytsev
(Brookhaven National Laboratory (US))
Christopher Hollowell
(Brookhaven National Laboratory)