21-25 May 2012
New York City, NY, USA
US/Eastern timezone

Evolution of ATLAS PanDA System

22 May 2012, 13:30
4h 45m
Rosenthal Pavilion (10th floor) (Kimmel Center)

Rosenthal Pavilion (10th floor)

Kimmel Center

Poster Distributed Processing and Analysis on Grids and Clouds (track 3) Poster Session

Speaker

Tadashi Maeno (Brookhaven National Laboratory (US))

Description

The PanDA Production and Distributed Analysis System plays a key role in the ATLAS distributed computing infrastructure. PanDA is the ATLAS workload management system for processing all Monte-Carlo simulation and data reprocessing jobs in addition to user and group analysis jobs. The system processes more than 5 million jobs in total per week, and more than 1400 users have submitted analysis jobs in 2011 through PanDA. PanDA has performed well with high reliability and robustness during the two years of LHC data-taking, while being actively evolved to meet the rapidly changing requirements for analysis use cases. We will present an overview of system evolution including PanDA's roles in data flow, automatic rebrokerage and reattempt for analysis jobs, adaptation for the CERNVM File System, support for the 'multi-cloud' model through which Tier 2s act as members of multiple clouds, pledged resource management, monitoring improvements, and so on. We will also describe results from the analysis of two years of PanDA usage statistics, current issues, and plans for the future.

Primary authors

Dr Alden Stradling (University of Texas at Arlington (US)) Collaboration Atlas (Atlas)

Co-authors

Kaushik De (University of Texas at Arlington (US)) Paul Nilsson (University of Texas at Arlington (US)) Dr Rodney Walker (Ludwig-Maximilians-Univ. Muenchen (DE)) Sergey Panitkin (Brookhaven National Laboratory (US)) Tadashi Maeno (Brookhaven National Laboratory (US)) Dr Torre Wenaus (Brookhaven National Laboratory (US)) Dr Valeri Fayn (Brookhaven National Laboratory (US))

Presentation Materials