Gabriele and Bob facilitate
Monitoring doc:
https://docs.google.com/document/d/1lMji2dwLPkHPgtgYk5U5f0H500kOLeQswAd9SbBl1RU/edit#heading=h.n9pkgznjiiyp
Goal for planning session: come up with the general deployment strategy that will be implemented over the next few months and record it in JIRA
Metrics
Logs
Goal for working session: Prometheus/Grafana installed at least on one cluster standalone
Items to discuss:
Deployment strategy for metrics monitoring (standard helm charts? create our own?)
Deployment strategy for logs
Outside access (ingress, dns name, …)
Location for central services (i.e. Grafana and Prometheus aggregator)
Data aggregations from the clusters (Push Gateway, prometheus/thanos/cortex?)
Cluster level permission (what requires admin privileges and what doesn’t?)
Metrics (what do we want to monitor from each node?)
User level permissions (i.e. logins and permissions for Grafana)