HEPiX Fall 2014 Workshop

Name: HEPiX Fall 2014 Workshop
Start: 2014-10-13T01:30:00-05:00
End: 2014-10-17T13:00:00-05:00
Location: University of Nebraska - Lincoln

13–17 Oct 2014

University of Nebraska - Lincoln

America/Chicago timezone

Support

bbockelm@cse.unl.edu

Evaluating Infiniband Based Networking Solutions for HEP/NP Data Processing Applications

14 Oct 2014, 16:30

30m

Nebraska Union - Regency Suite (University of Nebraska - Lincoln)

Nebraska Union - Regency Suite

University of Nebraska - Lincoln

<b>GPS coordinates :</b> <a href="https://www.google.com/maps/place/Nebraska+Union/@40.8177945,-96.7004067,17z"> N 40.81 E -96.70 </a>

Security & Networking Networking and Security

Mr Alexandr Zaytsev (Brookhaven National Laboratory (US))

The Infiniband networking technology is a long established and rapidly developing technology which is currently dominating the field of low-latency, high-throughput interconnects for HPC systems in general and those included in the TOP-500 list in particular. Over the last 4 years a successful use of Infiniband networking technology combined with additional IP-over-IB protocol and Infiniband to Ethernet bridging layers was demonstrated well beyond the realm of HPC, covering various high throughput computing (HTC) systems, including data processing farms and private clouds devoted to HEP/NP data processing. With the recent advances of Mellanox VPI technology in 2013-2014 the 4X FDR IB now stands as the most versatile networking solution available for existing and future data centers that need to support both HTC and HPC oriented activities that can be seamlessly integrated into the existing Ethernet based infrastructure. Furthermore, it can be done completely transparently for the end users of these facilities, though certain modifications of the end user’s activity-patterns are needed in order to utilize the full potential of the Infiniband based networking infrastructure. This contribution contains a detailed report on the series of tests and evaluation activities performed within the RACF over the last year in order to evaluate a Mellanox 4X FDR Infiniband based networking architecture (provided with an oversubscribed tree topology) as a potential alternative networking solution for both the RHIC and ATLAS data processing farms of the RACF, as well as the existing dCache and future Ceph based storage systems associated with them. Results of the price/performance comparison of such a networking system with a competing solution based on the 10 GbE technology (provided with non-blocking fabric topology) for a HEP/NP data processing farm consisting of 1500 compute nodes are presented. Job placement optimizations in Condor for the offline data processing farm of the PHENIX experiment were implemented in order to demonstrate a sample user activity-pattern that more optimally utilizes the Infiniband-based networking solution. The results of using those optimizations in production for the PHENIX experiment over the last 9 months are presented.

Mr Alexandr Zaytsev (Brookhaven National Laboratory (US))

Mr Christopher Hollowell (Brookhaven National Laboratory) Dr Ofer Rind (Brookhaven National Laboratory) Dr Tony Wong (Brookhaven National Laboratory) Mr William Strecker-Kellogg (Brookhaven National Lab)

Slides

HEPiX2014_RACF_azaytsev_IB_v6.pdf

HEPiX2014_RACF_azaytsev_IB_v6.pptx

HEPiX Fall 2014 Workshop

Support

Evaluating Infiniband Based Networking Solutions for HEP/NP Data Processing Applications

Nebraska Union - Regency Suite

University of Nebraska - Lincoln

Speaker

Description

Author

Co-authors

Presentation materials