Mar 21 – 27, 2009
Europe/Prague timezone

The ATLAS TAGS Database distribution and management - Operational challenges of a multi-terabyte distributed database system

Mar 23, 2009, 8:00 AM


Prague Congress Centre 5. května 65, 140 00 Prague 4, Czech Republic
Board: Monday 063
poster Distributed Processing and Analysis Poster session


Florbela Viegas (CERN)


The TAG files store summary event quantities that allow a quick selection of interesting events. This data will be produced at a nominal rate of 200 Hz, and is uploaded into a relational database for access from websites and other tools. The estimated database volume is 6TB per year, making it the largest application running on the ATLAS relational databases, at CERN and at other voluntary sites. The sheer volume and high rate of production makes this application a challenge to data and resource management, on many aspects. This paper will focus on the operational challenges of this system. These include: uploading the data from files to the CERN's and remote sites' databases; distributing the TAG metadata that is essential to guide the user through event selection; controlling resource usage of the database, from the user query load to the strategy of cleaning and archiving of old TAG data.


Proposal for a poster presentation illustrating the data flow of the TAGS data, making a focus on the solutions tested and challenges faced in its management.

Primary authors

David Malon (Argonne National Laboratory) Florbela Viegas (CERN) Jack Cranshaw (Argonne National Laboratory)


Andrew Wong (TRIUMF - Canada's National Laboratory for Particle and Nuclear Physics) Armin Nairz (CERN) Carlos Gamboa (Brookhaven National Laboratory) Elisabeth Vinek (Universitaet Wien) Elizabeth Gallas (University of Oxford) Gancho Dimitrov (Lawrence Berkeley National Laboratory) Luc Goosens (CERN) Marcin Nowak (Brookhaven National Laboratory)

Presentation materials