11–15 Mar 2024
Charles B. Wang Center, Stony Brook University
US/Eastern timezone

A Function-As-Task Workflow Management Approach with PanDA and iDDS

11 Mar 2024, 14:50
20m
Theatre ( Charles B. Wang Center, Stony Brook University )

Theatre

Charles B. Wang Center, Stony Brook University

100 Circle Rd, Stony Brook, NY 11794
Oral Track 1: Computing Technology for Physics Research Track 1: Computing Technology for Physics Research

Speaker

Wen Guan (Brookhaven National Laboratory (US))

Description

The growing complexity of high energy physics analysis often involves running a large number of different tools. This demands a multi-step data processing approach, with each step requiring different resources and carrying dependencies on preceding steps. It’s important and useful to have a tool to automate these diverse steps efficiently.
With the Production and Distributed Analysis (PanDA) system and the intelligent Data Delivery Service (iDDS), we provide a platform for coordinating sequences of tasks with a workflow, orchestrating the seamless execution of tasks in a specified order and under predefined conditions, in order to automate the task sequence. In this presentation, we will present our efforts, beginning with an overview of the platform's architecture. We'll then describe a user-friendly interface with workflows described in python and tasks described by python functions. Next, we detail the flow to transform python functions into tasks and schedule tasks to distributed heterogeneous resources, coupled with a messaging-based asynchronous result-processing mechanism. Finally, we'll showcase a practical example illustrating how this platform effectively converts a machine learning hyperparameter optimization processing on an ATLAS ttH analysis to a distributed workflow.

Primary authors

Christian Weber (Brookhaven National Laboratory (US)) Dr Edward Karavakis (Brookhaven National Laboratory (US)) Fa-Hui Lin (University of Texas at Arlington (US)) Fernando Harald Barreiro Megino (University of Texas at Arlington) Kaushik De (University of Texas at Arlington (US)) Paul Nilsson (Brookhaven National Laboratory (US)) Rui Zhang (University of Wisconsin Madison (US)) Tadashi Maeno (Brookhaven National Laboratory (US)) Torre Wenaus (Brookhaven National Laboratory (US)) Wen Guan (Brookhaven National Laboratory (US)) Xin Zhao (Brookhaven National Laboratory (US)) Zhaoyu Yang (Brookhaven National Laboratory (US))

Presentation materials