14–18 Oct 2013
Amsterdam, Beurs van Berlage
Europe/Amsterdam timezone

NaNet: a low-latency NIC enabling GPU-based, real-time low level trigger systems.

15 Oct 2013, 13:30
20m
Verwey Kamer (Amsterdam, Beurs van Berlage)

Verwey Kamer

Amsterdam, Beurs van Berlage

Oral presentation to parallel session Data acquisition, trigger and controls Data Acquisition, Trigger and Controls

Speaker

Alessandro Lonardo (INFN, Roma I (IT))

Description

The integration of GPUs in trigger and data acquisition systems is currently being investigated in several HEP experiments. At higher trigger levels, when the efficient many-core parallelization of event reconstruction algorithms is possible, the benefit of reducing significantly the number of the farm computing nodes is evident. At lower levels, where tipically severe real-time constraints are present and custom hardware is used, the advantages of GPUs adoption is less straightforward. A pilot project within the CERN NA62 experiment is investigating the usage of GPUs in the central Level 0 trigger processor, exploiting their computing power to implement efficent, high throughput event selection algorithms while retaining the real-time requisites of the system. One of the project preliminary results was that data transfer over GbE links from readout boards to GPU memories using commodity NICs and vanilla software stack consumed the biggest part of the time budget and was the main source of fluctuations in the global system response time. In order to reduce data transfer latency and its fluctuations we envisioned the usage of the GPUDirect RDMA technology, injecting readout data directly from the NIC into the GPU memories without any intermediate buffering, and the offloading of the network stack protocol management from the CPU, eliminating OS contribution to latency and jitter. We implemented these two features in the NaNet FPGA-based NIC: the first was inherited from the APEnet+ 3D NIC development, while the second was realized integrating an Open IP provided by the FPGA vendor. We will provide a deep description of the NaNet architecture and a detailed performance analysis of the integrated system on the NA62 RICH detector GPU-based L0 trigger processor case study, along with an insight of future developments.

Primary authors

Alessandro Lonardo (INFN, Roma I (IT)) Andrea Biagioni (Universita e INFN, Roma I (IT)) Davide Rossetti (U) Dr Francesca Locicero (INFN) Dr Francesco Simula (INFN) Dr Laura Tosoratto (INFN) Dr Ottorino Frezza (INFN) Dr Pier Stanislao Paolucci (INFN) Piero Vicini (INFN Rome Section) Roberto Ammendola (INFN)

Co-authors

Felice Pantaleo (CERN - University of Pisa) Gianluca Lamanna (CERN) Marco Sozzi (Sezione di Pisa (IT)) Riccardo Fantechi (Sezione di Pisa (IT))

Presentation materials