23–28 Oct 2022
Villa Romanazzi Carducci, Bari, Italy
Europe/Rome timezone

Ceph S3 Object Storage for CMS data

27 Oct 2022, 11:00
30m
Area Poster (Floor -1) (Villa Romanazzi)

Area Poster (Floor -1)

Villa Romanazzi

Poster Track 1: Computing Technology for Physics Research Poster session with coffee break

Speaker

Nick Smith (Fermi National Accelerator Lab. (US))

Description

To support the needs of novel collider analyses such as long-lived particle searches, considerable computing resources are spent forward-copying data products from low-level data tiers like CMS AOD and MiniAOD to reduced data formats for end-user analysis tasks. In the HL-LHC era, it will be increasingly difficult to ensure online access to low-level data formats. In this talk, we present a novel online data storage mechanism that obviates the need for data tiers by storing individual data products in column objects using RadosGW, a Ceph object store technology. Benchmarks of the performance of storage and retrieval of the event data through the S3 protocol for a prototype of typical analysis workflows will be presented, and compared with traditional xrootd ROOT file access protocols.

References

https://indico.cern.ch/event/1125222/timetable/?view=standard#32-object-store-rd
https://uscms-software-and-computing.github.io/postdocs/nsmith-.html

Significance

The use of Ceph object stores and S3 protocol to access experiment data is novel within HEP. Our experience will help guide evaluation and possible adoption of these technologies.

Experiment context, if any CMS

Primary authors

Bo Jayatilaka (Fermi National Accelerator Lab. (US)) David Alexander Mason (Fermi National Accelerator Lab. (US)) Nick Smith (Fermi National Accelerator Lab. (US)) Oliver Gutsche (Fermi National Accelerator Lab. (US))

Presentation materials