Speaker
Dr
Jamie Shiers
(CERN)
Description
This talk summarises the main lessons learnt from deploying WLCG production services,
with a focus on Reliability, Scalability, Accountability, which lead to both
manageability and usability.
Each topic is analysed in turn. Techniques for zero-user-visible downtime for the
main service interventions are described, together with pathological cases that need
special treatment. The requirements in terms of scalability are analysed, calling for
as much robustness and automation in the service as possible. The different aspects
of accountability - which covers measuring / tracking / logging / monitoring what is
going on – and has gone on - is examined, with the goal of attaining a manageable
service.
Finally, a simple analogy is drawn with the Web in terms of usability - what do we
need to achieve to cross the chasm from small-scale adoption to ubiquity?
Summary
A summary of the key lessons learnt from deploying WLCG production services, with a
focus on Reliability, Scalability, Accountability, which lead to both manageability
and usability.
Submitted on behalf of Collaboration (ex, BaBar, ATLAS) | WLCG |
---|
Primary author
Dr
Jamie Shiers
(CERN)