HEPiX Benchmarking Working Group

Name: HEPiX Benchmarking Working Group
Start: 2017-04-07T13:55:00+02:00
End: 2017-04-07T15:05:00+02:00
Location: CERN

Friday 7 Apr 2017, 13:55 → 15:05 Europe/Zurich

31/S-023 (CERN)

31/S-023

CERN

Show room on map

Manfred Alef (Karlsruhe Institute of Technology (KIT)), Domenico Giordano (CERN), Michele Michelotto

Hide

Meeting Date: 7/4/2017
Attendees: D. Abdurachmanov, M. Alef, O. Awile, J-M. Barbet, A. Brosa, J. Flix, D. Giordano, V. Innocente, M. Michelotto, F. Pantaleo, A. Perez-Calero, M. Reis, M. Rovere, A. Sciaba, Sverre Jarp, M. Schulz,

1) News (Domenico)

- See notes there. no further comments.

2) DB12 cpp (Domenico)

DB12 implemented in C++
- perf profile shows that main used components are math libs
- avoid that measurements are affected by the version of the Python interpreter
- is mostly not affected in performances changing CPU model (IB, HW, BW) and OS
  - DB12 python is affected on the contrary
- DB12 written in C++ in x10 faster than DB12.py
Shown the relative scale factors of DB12 (python and C++) and KV among several CPU models
- KV_speed trend seems to go in opposite direction. To be verified.
Antonio suggests that DB12.cpp gets included in the CMS pilot reports, to compare results respect to DB12 python
- D.G.: C++ version was not improved to substitute the python version but only to better understand the behaviour of DB12.py. But clearly experimetns are free to use it
- It seems that the repository (gitlab) is not accessible
  - Update: now both the gitlab.cern.ch and github.com repositories are accessible

3) Dissecting Benchmarks with perf (Vincenzo)

Effects of benchmarks stressing more the front-end or the back-end of a CPU
- HEP applications are stall because of memory stall or division (we do many)
- HS06 uses a lot of memory, much more than Geant4 (and probably python) so if Haswell is much better that IvyB in the front-end component you get an improvement that HS06 will not give
- avx is a hardware specific fast library. an important side effect is that the VM has to know the real CPU model in order to profit of it
Seen differences between running benchmark suite in same order or permuting the sequence
- for instance this can affect differently the L3 cache
- NB.: Vincenzo is not proposing the adoption of scimark, but highlights the fact also this simple benchmark suite can give different results if running the sequence in different order

4) CMS report (Pepe)

Work ongoing, updates in the next weeks. Pilots are running in production ES, cpu models is included in the report.

There are minutes attached to this event. Show them.

- 14:00 → 14:10
  News 10m
  
  Speakers: Domenico Giordano (CERN), Manfred Alef (Karlsruhe Institute of Technology (KIT)), Michele Michelotto
  1. Next Events
  
  GDB (Wednesday 12 Apr 2017)
  
  11:55 → 12:15 Benchmarking update - Domenico Giordano (CERN)
  
  https://indico.cern.ch/event/578985/
  
  Request for experiment contacts
  
  provide short summary of your ongoing (and foreseen) activities in the WG and your current findings respect to the two main topics covered in the past weeks by our working group
  
  correlation of experiments’ workloads (single and multicore) Vs HS06
  
  correlation of experiments’ workloads (single and multicore) Vs DB12
  
  This can include also
  
  tests performed with DB12 at boot, or using instances that have MJF information extended with the DB12-at-boot information (such as GridKA resources)
  
  tests obtained including fast benchmarks in the job pilots
  
  2. In answer to a discussion in the last meeting about the availability of the CPU model of the host server in CERN OpenStack VMs
  
  [Info from Arne Wielback] Available for projects having cpu mode='host-passthrough' enabled. This matches with the ComputeOptimised projects, such as Batch, CMS Tier-0
  
  Personal projects and Service projects do not have the feature enabled, then a Generic Model is passed (but still the CPU family is available)
- 14:10 → 14:20
  
  DB12 cpp 10m
  
  Speaker: Domenico Giordano (CERN)
  
  DB12_cpp.pdf
- 14:20 → 14:40
  
  Dissecting Benchmarks with perf 20m
  
  Speaker: Vincenzo Innocente (CERN)
  
  Benchmarking.pdf
- 14:40 → 14:45
  
  CMS short status report + Recent Results 5m
  
  Speaker: Jose Flix Molina (Centro de Investigaciones Energéti cas Medioambientales y Tecno)
- 14:45 → 14:50
  
  ATLAS short status report (TBC) 5m
- 14:50 → 14:55
  
  Alice short status report (TBC) 5m