21–25 May 2012
New York City, NY, USA
US/Eastern timezone

File and Dataset Metadata Collection and Use in Atlas

24 May 2012, 13:30
4h 45m
Rosenthal Pavilion (10th floor) (Kimmel Center)

Rosenthal Pavilion (10th floor)

Kimmel Center

Poster Software Engineering, Data Stores and Databases (track 5) Poster Session

Speaker

Elizabeth Gallas (University of Oxford (GB))

Description

The ATLAS Metadata Interface (“AMI”) was designed as a generic cataloguing system, and as such it has found many uses in the experiment including software release management, tracking of reconstructed event sizes and control of dataset nomenclature. In this paper we will discuss the primary use of AMI which is to provide a catalogue of datasets (file collections) which is searchable using physics criteria. The AMI dataset catalogues are filled from several sources: - The Tier 0 database for raw data and first pass reconstruction. - The Production System database for Monte Carlo and reprocessed data. - The Distributed Data Management system. - Direct input from the physicist community. We will summarize the information taken from each source, and discuss the different mechanisms used to obtain it. By correlating information from different sources we can derive aggregate information which is important for physics analysis; for example the total number of events contained in dataset, and possible reasons for missing events such as a lost file. Finally we will describe some specialized interfaces which were developed for the Data Preparation and reprocessing coordinators. These interfaces manipulate information from both the dataset domain held in AMI, and the run-indexed information held in the ATLAS COMA application (Conditions and Configuration Metadata).

Primary author

Co-authors

Elizabeth Gallas (University of Oxford (GB)) Fabian Lambert (Universite Joseph Fourier (FR)) Jerome Fulachier (Universite Joseph Fourier (FR)) Dr Solveig Albrand (Universite Joseph Fourier (FR))

Presentation materials