Speaker
Description
As the HL-LHC prepares to produce increasingly large volumes of data, the need for efficient data extraction and access services is growing. To address this challenge, the ServiceX toolset was developed to connect user-level analysis workflows to remotely stored datasets. ServiceX functions as a query-based sample delivery system, where client requests trigger Kubernetes-distributed workloads running at facilities with high-bandwidth connectivity to the WLCG. Additionally, ServiceX provides a simple user interface that leverages declarative syntax to define data extraction queries, enabled by a server-side architecture that includes code-generation and data-finder services. Modern analysis frameworks can use ServiceX as the first step in event selection, efficiently reducing file sizes and accelerating data access with minimal boilerplate. This talk presents the ServiceX toolset and demonstrates its use within a modern, full-scale ATLAS analysis pipeline from the IRIS-HEP Integration Challenge.