10–14 Oct 2016
San Francisco Marriott Marquis
America/Los_Angeles timezone

Computing shifts to monitor ATLAS distributed computing infrastructure and operations

13 Oct 2016, 15:30
1h 15m
San Francisco Marriott Marquis

San Francisco Marriott Marquis

Poster Track 7: Middleware, Monitoring and Accounting Posters B / Break

Speaker

Michal Svatos (Acad. of Sciences of the Czech Rep. (CZ))

Description

The ATLAS Distributed Computing (ADC) group established a new Computing Run Coordinator (CRC)
shift at the start of LHC Run2 in 2015. The main goal was to rely on a person with a good overview
of the ADC activities to ease the ADC experts' workload. The CRC shifter keeps track of ADC tasks
related to their fields of expertise and responsibility. At the same time, the shifter maintains a
global view of the day-to-day operations of the ADC system. During Run1, this task was accomplished
by the ADC Manager on Duty (AMOD), a position that was removed during the shutdown period due
to the reduced number and availability of ADC experts foreseen for Run2. The CRC position was proposed
to cover some of the AMOD‚s former functions, while allowing more people involved in computing
to participate. In this way, CRC shifters help train future ADC experts.
The CRC shifters coordinate daily ADC shift operations, including tracking open issues, reporting,
and representing ADC in relevant meetings. The CRC also facilitates communication between
the ADC experts team and the other ADC shifters. These include the Distributed Analysis Support Team (DAST),
which is the first point of contact for addressing all distributed analysis questions, and
the ATLAS Distributed Computing Shifters (ADCoS), which check and report problems in central services,
Tier-0 export, data transfers and production tasks. Finally, the CRC looks at the level of ADC
activities on a weekly or monthly timescale to ensure that ADC resources are used efficiently.

Primary Keyword (Mandatory) Monitoring
Secondary Keyword (Optional) Distributed workload management
Tertiary Keyword (Optional) Distributed data handling

Primary author

Sabine Crepe-Renaudin (Laboratoire de Physique Subatomique et de Cosmologie de Grenoble (FR))

Co-author

Michal Svatos (Acad. of Sciences of the Czech Rep. (CZ))

Presentation materials