1–3 Mar 2006
CERN
Europe/Zurich timezone

Benefits of the MAGIC Grid

1 Mar 2006, 14:00
30m
40/5-A01 (CERN)

40/5-A01

CERN

45
Show room on map
Oral contribution Astroparticle physics - Fusion - High-Energy physics 1b: Astrophysics/Astroparticle physics - Fusion - High-Energy physics

Speaker

Dr Harald Kornmayer (FORSCHUNGSZENTRUM KARLSRUHE (FZK))

Description

Application context and scientific goals ======================================== The field of gamma-ray observations in the energy range between 10 GeV and 10 TeV developed fast over the last decade. From the first observation of TeV gamma rays from the Crab nebula using the atmospheric Cerenkov imaging technique in 1989 [1] to the discovery of new gamma ray sources with the new generation telescopes like the HESS observation of a high-energy particle acceleration in the shell of a supernova remnant [2], a new observation window to the universe was opened. In the future other ground based VHE $\gamma$-ray observatories (namely MAGIC [3], VERITAS [4] and KANGAROO [5]) will significantly contribute to the exploitation of this new observation window. With the new generation Cerenkov telescopes the requirements for the analysis and Monte Carlo production computing infrastructure will increase due to a higher number of camera pixels, faster FADC systems and a bigger mirror size. In the future the impact of VHE gamma-ray astronomy will increase by joined observations of different Cerenkov telescopes. In 2003 the national Grid centers in Italy (CNAF), Spain (PIC) and Germany (GridKA) started together with the MAGIC collaboration an effort to build a distributed computing system for Monte Carlo generation and analysis on top of existing Grid infrastructure. The MAGIC telescope was chosen due to the following reasons: o The MAGIC collaboration is international, with most partners from Europe o main partners of the MAGIC telescope are located close to the national Grid centers o The generation of Monte Carlo data is very compute intensive, specially to get enough statistics in the low energy range. o The analysis of the fast increasing real data samples will be done in different institutes. The collaborators need a seamless access to the data while reducing the number of replicas to a minimum. o The MAGIC collaboration will build a second telescope in 2007 resulting in a doubled data rate. The idea of the MAGIC Grid [6] was presented to the EGEE Generic Application Advisory Panel (EGAAP). In June 2004 EGEE accepted the generation of Monte Carlo data for the MAGIC telescope as one of the generic applications of the project. Grid added value ================ By implementing the MAGIC Grid over the last two years, the MAGIC collaboration benefit in many aspects. These aspects are described in this chapter. o Collaboration of different institutes By combining the resources of the MAGIC collaborators and the reliable resources from the national Grid centers the MAGIC collaborators will be empowered to use their computing infrastructure more efficiently. The time to analyse the big amount of data to solve specific scientific problems will be shortend. o Cost reduction By using the EGEE infrastructure and the EGEE services the effort for MAGIC collaboration to build a distributed computing system for the Monte Carlo simulations was significantly reduced. o Speedup of Monte Carlo production As the MAGIC Monte Carlo System was build on top of the EGEE middleware the integration of new computing resources is very easy. By getting support from many different EGEE resource providers the production rate for the Monte Carlos can be increased very easily. o Persistent storage of observation data The MAGIC telescope will produce a lot of data in the future. These data are currently stored on local resources including disk systems and tape libraries. The MAGIC collaboration recognized that this effort is not negligible especially concerning man power. Therefore the observation data will be stored by the spanish Grid center PIC. o Data availability improvements By importing the observation data to the Grid, the MAGIC collaboration expect that the availablitly of data will be increased with the help of Grid data management methods like data replication, etc. As the main data services will be provided in the future by the national Grid centers instead of research university groups at universities, the overall data availablitly is expected to increase. o Cost reduction By using the EGEE infrastructure and the EGEE services the effort for MAGIC collaboration to build a distributed computing system for the Monte Carlo simulations was significantly reduced. Experiences with the EGEE infrastructure ======================================== The experiences of the developers during the different phases of the realisation of the MAGIC Monte Carlo production system on the EGEE Grid infrastructure are described in this chapter. As the MAGIC virtual organisation was accepted as one of the first generic EGEE application, the development process was influenced by general developments within the EGEE project too like changed in the middleware versions, etc. o Prototype implementation -------------------------- The migration of the compute intensive MMCS program from a local batch system to the Grid was done by the definition of a template JDL form. This template sends all needed input data together with the executable to the Grid. The resources are chosen by the resource broker. The automatic registration of the output file as a logical file on the Grid was not very reliable at the beginning, but improved to production level within the EGEE project duration. o Production MAGIC Grid system ------------------------------ The submission of many production system needed the implementation of a graphical user interface and a database for metadata. The graphical user interface was realised with the JAVA programming language. The execution of the LCG/gLite commands is wrapped in JAVA shell commands. A MySQL database holds the schema for the metadata. As mentioned above the "copy and register" process for the output file was not realiable enough an additional job status "DONE (data available)" was invented. With the help of the database, jobs that did not reach this job status within two days are resubmitted. The job data are keeped in a seperate database table to analyse them later. o Reliability of EGEE services ------------------------------ The general services like resource brokers, VO management tools and Grid user support was provided by the EGEE resources providers. The MAGIC Grid is setup on top of this services. A short report of the experiences with this production services will be given. Key issues for the future of Grid technology ============================================ The MAGIC collaboration is currently evaluating the EGEE Grid infrastructure as the backbone for a distributed computing system in the future including the data storage on Grid data centers like PIC. Furthermore the discussion with other projects like the HESS collaboration has started to move towards "Virtual Very High energetic Gamma ray observatory" [7]. The problems and challenges that needs to be solved on the track to a sustainable Grid infrastructure will be discussed from the user perspective References: [1] T. Weekes et al., The Astrophysical Journal, volume 342 (1989), p. 379 [2] F. A. Aharonian et al., Nature 432, 75 - 77 (04 November 2004) [3] E. Lorenz, 1995, Fruehsjahrtagung Deutsche Physikalische Gesellschaft, March 9-10 [4] T. Weekes et al., Astropart. Phys., 17, 221-243 (2002) [5] Enomoto, R. et al., Astropart. Phys. 16, 235-244 (2002) [6] H. Kornmayer et al., "A distributed, Grid-based analysis system for the MAGIC telescope", Proceedings of the CHEP Conference , Interlaken, Switzerland, 2004 [7] H. Kornmayer et al., "Towards a virtual observatory for high energetic gamma rays", Cherenkov 2005, Paris, 2005

Author

Dr Harald Kornmayer (FORSCHUNGSZENTRUM KARLSRUHE (FZK))

Co-authors

Presentation materials