Indico celebrates its 20th anniversary! Check our blog post for more information!

21–25 May 2012
New York City, NY, USA
US/Eastern timezone

Integrating PROOF Analysis in Cloud and Batch Clusters

22 May 2012, 13:30
4h 45m
Rosenthal Pavilion (10th floor) (Kimmel Center)

Rosenthal Pavilion (10th floor)

Kimmel Center

Poster Distributed Processing and Analysis on Grids and Clouds (track 3) Poster Session

Speaker

Dr Ana Y. Rodríguez-Marrero (Instituto de Física de Cantabria (UC-CSIC))

Description

High Energy Physics (HEP) analysis are becoming more complex and demanding due to the large amount of data collected by the current experiments. The Parallel ROOT Facility (PROOF) provides researchers with an interactive tool to speed up the analysis of huge volumes of data by exploiting parallel processing on both multicore machines and computing clusters. The typical PROOF deployment scenario is a permanent set of cores configured to run the PROOF daemons. However, this approach is incapable of adapting to the dynamic nature of interactive usage. Several initiatives seek to improve the use of computing resources by integrating PROOF with a batch system, such as PoD or PROOF Cluster. These solutions are currently in production at Universidad de Oviedo and IFCA and are positively evaluated by users. Although they are able to adapt to the computing needs of users, they must comply with the specific configuration, OS and software installed at the batch nodes. Furthermore, they share the machines with other workloads, which may cause disruptions in the interactive service for users. These limitations make PROOF a typical use-case for cloud computing. In this work we take profit from Cloud Infrastructure at IFCA in order to provide a dynamic PROOF environment where users can control the software configuration of the machines. The Proof Analysis Framework (PAF) facilitates the development of new analysis and offers a transparent access to PROOF resources. Several performance measurements are presented for the different scenarios (PoD, SGE and Cloud), showing a speed improvement closely correlated with the number of cores used.

Primary author

Dr Ana Y. Rodríguez-Marrero (Instituto de Física de Cantabria (UC-CSIC))

Co-authors

Mr Alberto Cuesta-Noriega (Universidad de Oviedo) Dr Enol Fernández-del-Castillo (Instituto de Física de Cantabria (UC-CSIC)) Dr Francisco Matorras-Weinig (Instituto de Física de Cantabria (UC-CSIC)) Dr Isidro González-Caballero (Universidad de Oviedo) Dr Jesús Marco-de-Lucas (Instituto de Física de Cantabria (UC-CSIC)) Mr Álvaro López-García (Instituto de Física de Cantabria (UC-CSIC))

Presentation materials