The ATLAS TAGS Database distribution and management - Operational challenges of a multi-terabyte distributed database system
Presented by Florbela VIEGAS on 23 Mar 2009 from 08:00 to 08:20
Session: Poster session
Track: Distributed Processing and Analysis
Board #: Monday 063
The TAG files store summary event quantities that allow a quick selection of interesting events. This data will be produced at a nominal rate of 200 Hz, and is uploaded into a relational database for access from websites and other tools. The estimated database volume is 6TB per year, making it the largest application running on the ATLAS relational databases, at CERN and at other voluntary sites. The sheer volume and high rate of production makes this application a challenge to data and resource management, on many aspects. This paper will focus on the operational challenges of this system. These include: uploading the data from files to the CERN's and remote sites' databases; distributing the TAG metadata that is essential to guide the user through event selection; controlling resource usage of the database, from the user query load to the strategy of cleaning and archiving of old TAG data.
Proposal for a poster presentation illustrating the data flow of the TAGS data, making a focus on the solutions tested and challenges faced in its management.
Address: Prague Congress Centre 5. května 65, 140 00 Prague 4, Czech Republic
- Florbela VIEGAS CERN
- David MALON Argonne National Laboratory
- Jack CRANSHAW Argonne National Laboratory
- Gancho DIMITROV Lawrence Berkeley National Laboratory
- Marcin NOWAK Brookhaven National Laboratory
- Armin NAIRZ CERN
- Luc GOOSENS CERN