21–23 Jan 2019
Other Institutes
America/Detroit timezone

Discussion on SLATE Monitoring with ATLAS as a Driver

21 Jan 2019, 14:30
1h
Other Institutes

Other Institutes

University of Michigan Physics 450 Church Street Ann Arbor, MI

Description

Gabriele and Bob facilitate

Monitoring doc:
https://docs.google.com/document/d/1lMji2dwLPkHPgtgYk5U5f0H500kOLeQswAd9SbBl1RU/edit#heading=h.n9pkgznjiiyp
Goal for planning session: come up with the general deployment strategy that will be implemented over the next few months and record it in JIRA
Metrics
Logs
Goal for working session: Prometheus/Grafana installed at least on one cluster standalone
Items to discuss:
Deployment strategy for metrics monitoring (standard helm charts? create our own?)
Deployment strategy for logs
Outside access (ingress, dns name, …)
Location for central services (i.e. Grafana and Prometheus aggregator)
Data aggregations from the clusters (Push Gateway, prometheus/thanos/cortex?)
Cluster level permission (what requires admin privileges and what doesn’t?)
Metrics (what do we want to monitor from each node?)
User level permissions (i.e. logins and permissions for Grafana)

Presentation materials

There are no materials yet.