8โ€“12 Sept 2025
Hamburg, Germany
Europe/Berlin timezone

New Transformation Capabilities and Workflow Integration for ServiceX, a Delivery System for Distributed Data

10 Sept 2025, 11:00
30m
ESA W 'West Wing'

ESA W 'West Wing'

Poster Track 1: Computing Technology for Physics Research Poster session with coffee break

Speaker

Artur Cordeiro Oudot Choi (University of Washington (US))

Description

The ServiceX project aims to provide a data extraction and delivery service for HEP analysis data, accessing files from distributed stores and applying user-configured transformations on them. ServiceX aims to support many existing analysis workflows and tools in as transparent a manner as possible, while enabling new technologies. We will discuss the most recent backends added to ServiceX, including RDataFrame support and the ability to read and write RNTuples. We will also discuss usability improvements in the user client libraries, in particular the ability to store and propagate metadata to downstream tools.

References

https://indico.cern.ch/event/1330797/contributions/5796587/

Significance

ServiceX has significantly expanded the range of transformations it can run in order to accommodate many more real-world workflows, and the client integration with external tools (in particular metadata handling) is new.

Experiment context, if any HL-LHC R&D

Authors

Artur Cordeiro Oudot Choi (University of Washington (US)) Benjamin Galewsky (Univ. Illinois at Urbana Champaign (US)) Gordon Watts (University of Washington (US)) Ilija Vukotic (University of Chicago (US)) Kyungeon Choi (University of Texas at Austin (US)) Peter Onyisi (University of Texas at Austin (US)) Roger Janusiak (University of Washington)

Presentation materials