Feb 11 – 14, 2008
<a href="http://www.polydome.org">Le Polydôme</a>, Clermont-Ferrand, FRANCE
Europe/Zurich timezone

Optimizing a Grid workflow for the EGEE infrastructure: The case of Wien2k

Feb 13, 2008, 2:00 PM
20m
Champagne (<a href="http://www.polydome.org">Le Polydôme</a>, Clermont-Ferrand, FRANCE)

Champagne

<a href="http://www.polydome.org">Le Polydôme</a>, Clermont-Ferrand, FRANCE

Oral Application Porting and Deployment Workflow and Parallelism

Speaker

Dr Maximilian Berger (University of Innsbruck)

Description

In real production grids the time between submitting a grid activity and its execution ranges from 10 to 60 minutes. When porting a complex workflow to the Grid, such as Wien2K, this overhead does not only appear once, but repeatedly, and increases the execution time of workflows largely. In our previous presentation we showed aggregation of grid activities. Since then we have also experimented with other means of reducing overhead through scheduling: Worker nodes may be scheduled without an active task, which they request through a pull model from a coordinator. These workers would be submitted once, and thus the overhead of scheduling them would only appear once. However, this mechanism is unfair towards other users, as it occupies resources for a longer period than one task. We will show results of our experiments, and classify them according to speed and fairness.

1. Short overview

Wien2K is a package for electronic structure calculation of crystals. After porting Wien2K to the gLite environment we discoverered a severe performance drawback: Scheduling in the real life EGEE infrastructure provides proved to take too long to manage a small workflows efficiently. We have already presented some optimization work through aggregation and will now present other optimization techniques.

Provide a set of generic keywords that define your contribution (e.g. Data Management, Workflows, High Energy Physics)

Workflows, Computational Chemistry, Wien2K, Application Porting

3. Impact

While similar research has been done for simple parameter studies, complex workflows have not yet been studied at this level of detail.
Although the optimization techniques shown here are applied to the Wien2K workflow in particular, they are generic enough to be applied to other complex Grid workflows. The lessons learned from porting the Wien2k application can provide a guidance for other future work: Porting other applications should become much easier, as the same patterns can be applied to other work.
The Wien2K application is currently used by thousands of scientists: Optimizing the Wien2K worfklow for the Grid would enable these scientists to use the Grid for their calculations, resulting in significantly lower simulation time.

4. Conclusions / Future plans

We plan to continue to optimize and improve our Grid version of Wien2K. During the EGEE III project we plan to work towards the user: Providing a user-friendly interface through the use of a web portal, and providing an "easy" downloadable package, without violating the original Wien2k license. We are also continuesly working on new ideas of workflow improvement.

URL for further information:

http://www.dps.uibk.ac.at/

Primary author

Dr Maximilian Berger (University of Innsbruck)

Co-author

Prof. Thomas Fahringer (University of Innsbruck)

Presentation materials