CHEP 2018 Conference, Sofia, Bulgaria

Name: CHEP 2018 Conference, Sofia, Bulgaria
Start: 2018-07-09T08:00:00+03:00
End: 2018-07-13T13:00:00+03:00
Location: Sofia, Bulgaria

9–13 Jul 2018

Sofia, Bulgaria

Europe/Sofia timezone

Contact us

Improving ATLAS computing resource utilization with HammerCloud

10 Jul 2018, 16:00

Sofia, Bulgaria

National Culture Palace, Boulevard "Bulgaria", 1463 NDK, Sofia, Bulgaria

Poster Track 3 – Distributed computing Posters

Jaroslava Schovancova (CERN)

HammerCloud is a framework to commission, test, and benchmark ATLAS computing resources and components of various distributed systems with realistic full-chain experiment workflows. HammerCloud contributes to ATLAS Distributed Computing (ADC) Operations and automation efforts, providing the automated resource exclusion and recovery tools, that help re-focus operational manpower to areas which have yet to be automated, and improve utilization of available computing resources.

We present recent evolution of the auto-exclusion/recovery tools: faster inclusion of new resources in testing machinery, machine learning algorithms for anomaly detection, categorized resources as master vs. slave for the purpose of blacklisting, and a tool for auto-exclusion/recovery of resources triggered by Event Service job failures that is being extended to other workflows besides the Event Service.

We describe how HammerCloud helped commissioning various concepts and components of distributed systems: simplified configuration of queues for workflows of different activities (unified queues), components of Pilot (new movers), components of AGIS (controller), distributed data management system (protocols, direct data access, ObjectStore tests).

We summarize updates that brought HammerCloud up to date with developments in ADC and improved its flexibility to adapt to the new activities and workflows to respond to evolving needs of the ADC Operations team in a timely manner.

Jaroslava Schovancova (CERN) Felix Buhrer (Albert Ludwigs Universitaet Freiburg (DE)) Jose Caballero Bejar (Brookhaven National Laboratory (US)) Guenter Duckeck (Ludwig Maximilians Universitat (DE)) Aristeidis Fkiaras (Athens University of Economics and Business (GR)) Federica Legger (Ludwig Maximilians Universitat (DE)) Thomas Maier (Ludwig Maximilians Universitat (DE)) Valentina Mancinelli Gianfranco Sciacca Antonio Yusta Espla (Ludwig-Maximilians-Univ. Muenchen (DE))

Improving ATLAS computing resource utilization with HammerCloud (ATL-SOFT-SLIDE-2018-392)

CHEP 2018 Conference, Sofia, Bulgaria

Contact us

Improving ATLAS computing resource utilization with HammerCloud

Sofia, Bulgaria

Speaker

Description

Primary authors

Presentation materials

Choose timezone

CHEP 2018 Conference, Sofia, Bulgaria

Contact us

Speaker

Description

Primary authors

Presentation materials

Share this page

Direct link

Social networks

Calendaring