EGEE'07

Name: EGEE'07
Start: 2007-10-01T08:00:00+02:00
End: 2007-10-05T18:00:00+02:00
Location: Europe Congress Center

1–5 Oct 2007

Europe Congress Center

Europe/Zurich timezone

Statistical analysis of the workload and modeling of the time a job spends at different states in the EGEE environment

Not scheduled

Europe Congress Center

Budapest Hungary

Poster Demo and Poster session

Prof. Emmanuel Varvarigos (University of Patras)

Describe the scientific/technical community and the scientific/technical activity using (planning to use) the EGEE infrastructure. A high-level description is needed (neither a detailed specialist report nor a list of references).

In this study we analyze the workload of LCG/EGEE production Grid and the time the job spends at different states. The probabilistic models obtained for the job submission process and the delay components introduced at different stages is important for modeling production Grids such as the EGEE.

Describe the added value of the Grid for the scientific/technical activity you (plan to) do on the Grid. This should include the scale of the activity and of the potential user community and the relevance for other scientific or business applications

We have considered the overall LCG/EGEE infrastructure as a single entity and observed the general properties of job submission and execution, by collecting and analyzing the daily reports of the Real Time Monitor tool. Based on the EGEE job flow diagram we distinguish four delay components of the job processing, each corresponding to the time spent at different states in the EGEE environment. We analyze and model each delay component separately and we also model the job arrival process. The existence of good probabilistic models is important for the improved understanding of the Grid Computing concept, the prediction of their performance, the design of the middleware they use, and the evaluation of new scheduling and quality of service policies. We also examine the efficiency of the EGEE environment and (indirectly) of the employed super-scheduling algorithm.

Report on the experience (or the proposed activity). It would be very important to mention key services which are essential for the success of your activity on the EGEE infrastructure.

Our results indicate that if we consider the LCG/EGEE Grid as the level of our observation, jobs are submitted without any specific weekly or daily patterns. The job interarrivals match very well with a rounded exponential distribution. We define and model four delay components that comprise the overall job processing. We observe that the total time of a job in the EGEE Grid is dominated by the CE’s Register and Queuing time (D3) and the WN’s Execution time (D4), while the other two components, namely, the time spent in the Pending, Submitted and Waiting states (D1), and the time in the Ready state (D2), are small.
We also evaluate the efficiency of the EGEE Grid and (indirectly) of the employed super-scheduling algorithm by comparing the total delay experienced by a job with that of a hypothetical ideal super-cluster. We found that we would have similar performance if we submitted the same workload to a super-cluster having 34% of the total average number of CPUs participating in EGEE

Mr Konstantinos Christodoulopoulos (University of Patras)

Prof. Emmanuel Varvarigos (University of Patras)

Poster

profilingPoster_v3.pdf

profilingPoster_v3.ppt

EGEE'07

Statistical analysis of the workload and modeling of the time a job spends at different states in the EGEE environment

Europe Congress Center

Speaker

Describe the scientific/technical community and the scientific/technical activity using (planning to use) the EGEE infrastructure. A high-level description is needed (neither a detailed specialist report nor a list of references).

Describe the added value of the Grid for the scientific/technical activity you (plan to) do on the Grid. This should include the scale of the activity and of the potential user community and the relevance for other scientific or business applications

Report on the experience (or the proposed activity). It would be very important to mention key services which are essential for the success of your activity on the EGEE infrastructure.

Primary author

Co-author

Presentation materials

Choose timezone

EGEE'07

Speaker

Describe the scientific/technical community and the scientific/technical activity using (planning to use) the EGEE infrastructure. A high-level description is needed (neither a detailed specialist report nor a list of references).

Describe the added value of the Grid for the scientific/technical activity you (plan to) do on the Grid. This should include the scale of the activity and of the potential user community and the relevance for other scientific or business applications

Report on the experience (or the proposed activity). It would be very important to mention key services which are essential for the success of your activity on the EGEE infrastructure.

Primary author

Co-author

Presentation materials

Share this page

Direct link

Social networks

Calendaring