CHEP 06

Name: CHEP 06
Start: 2006-02-13T08:00:00+01:00
End: 2006-02-17T19:30:00+01:00
Location: Tata Institute of Fundamental Research

13–17 Feb 2006

Tata Institute of Fundamental Research

Europe/Zurich timezone

Support

chep06@tifr.res.in

DIAL: Distributed Interactive Analysis of Large Datasets

13 Feb 2006, 14:00

20m

D406 (Tata Institute of Fundamental Research)

D406

Tata Institute of Fundamental Research

Homi Bhabha Road Mumbai 400005 India

oral presentation Distributed Data Analysis Distributed Data Analysis

David Adams (BNL)

DIAL is a generic framework for distributed analysis. The heart of the system is a scheduler (also called analysis service) that receives high-level processing requests expressed in terms of an input dataset and a transformation to act on that dataset. The scheduler splits the dataset, applies the transformation to each subdataset to produce a new subdataset, and then merges these to produce the overall output dataset which is made available to the caller. DIAL defines a job interface that makes it possible for schedulers to connect with a wide range of batch and grid workload management systems. It also provides command line, root, python and web clients for job submission that enable users to submit and monitor jobs in a uniform manner. Scaling to very large jobs can be handled with a scheduler that does partial splitting and submits each subjob to another scheduler. I will give the current status of DIAL and discuss its use in the context of the ATLAS experiment at the CERN LHC (Large Hadron Collider). There we are looking at submission to local batch systems, globus gatekeepers, EGEE/LCG workload management, ATLAS production, and PANDA. The latter is a U.S. ATLAS framework for data production and distributed analysis (thus the name) that may also use DIAL for its internal scheduling.

David Adams (BNL)

Chun Lik Tan (University of Birmingham) Karl Harrison (University of Cambridge)

Paper

paper

Paper

Slides

slides

CHEP 06

Support

DIAL: Distributed Interactive Analysis of Large Datasets

D406

Tata Institute of Fundamental Research

Speaker

Description

Author

Co-authors

Presentation materials