31 March 2025 to 4 April 2025
Hotel De La Paix
Europe/Zurich timezone

Infrastructure Monitoring for GridKa and beyond

1 Apr 2025, 10:15
15m
Hotel De La Paix

Hotel De La Paix

Via Giuseppe Cattori 18 6900 Lugano Switzerland
Software and Services for Operation Software and Services for Operation

Speaker

Evelina Buttitta (Karlsruhe Institute of Technology (KIT))

Description

The Infrastructure Monitoring helps to control and monitor in real-time servers and applications involved in the operation of the WLCG Tier1 center GridKa, including the online and tape storages, the batch system and the GridKa network.
Monitoring data like server metrics (CPU, Memory, Disk, Network), storage operations (I/O Statistics) or visualizing real-time sensors data such as temperature, humidity, power consumption in server rooms are very important to provide a complete picture of availability, performance and resource efficiency of the entire data center.

Through the integration of open source and widely known technologies we have built a scalable solution able to collect, store and visualize infrastructure data across the data center. In this presentation we will talk about the main components of our monitoring architecture and the technologies we use. They include Telegraf as agent to collect metrics, InfluxDB as timeseries database to store data and Grafana as powerful visualization tool to query and visualize data. In addition, we operate a 5-nodes cluster based on OpenSearch search engine to collect logs from many sources.

Desired slot length 15 minutes
Speaker release Yes

Author

Evelina Buttitta (Karlsruhe Institute of Technology (KIT))

Presentation materials