Fourth Computational and Data Science school for HEP (CoDaS-HEP 2022)

Name: Fourth Computational and Data Science school for HEP (CoDaS-HEP 2022)
Start: 2022-08-01T08:30:00-04:00
End: 2022-08-05T13:00:00-04:00
Location: Princeton University

1–5 Aug 2022

Princeton University

US/Eastern timezone

Vector Parallelism on Multi-Core Processors

1 Aug 2022, 12:00

30m

407 Jadwin Hall (Princeton University)

407 Jadwin Hall

Princeton University

Princeton Center For Theoretical Science (PCTS)

Steven R Lantz (Cornell University (US))

All modern CPUs boost their performance through vector processing units (VPUs). VPUs are activated through special SIMD instructions that load multiple numbers into extra-wide registers and operate on them simultaneously. Intel's latest processors feature a plethora of 512-bit vector registers, as well as 1 or 2 VPUs per core, each of which can operate on 16 floats or 8 doubles in every cycle. Typically these SIMD gains are achieved not by the programmer directly, but by (a) the compiler through automatic vectorization of simple loops in the source code, or (b) function calls to highly vectorized performance libraries. Either way, vectorization is a significant component of parallel performance on CPUs, and to maximize performance, it is important to consider how well one's code is vectorized. We will take a look at vector hardware, then turn to simple code examples that illustrate how compiler-generated vectorization works.

abc_fma at godbolt.org

abc_fma.c

VectorParallelismMultiCoreProcs.pdf

Fourth Computational and Data Science school for HEP (CoDaS-HEP 2022)

Vector Parallelism on Multi-Core Processors

407 Jadwin Hall

Princeton University

Speaker

Description

Presentation materials

Choose timezone

Fourth Computational and Data Science school for HEP (CoDaS-HEP 2022)

Speaker

Description

Presentation materials