Speaker
Description
The Infrastructure Monitoring helps to control and monitor in real-time servers and applications involved in the operation of the WLCG Tier1 center GridKa, including the online and tape storages, the batch system and the GridKa network.
Monitoring data like server metrics (CPU, Memory, Disk, Network), storage operations (I/O Statistics) or visualizing real-time sensors data such as temperature, humidity, power consumption in server rooms are very important to provide a complete picture of availability, performance and resource efficiency of the entire data center.
Through the integration of open source and widely known technologies we have built a scalable solution able to collect, store and visualize infrastructure data across the data center. In this presentation we will talk about the main components of our monitoring architecture and the technologies we use. They include Telegraf as agent to collect metrics, InfluxDB as timeseries database to store data and Grafana as powerful visualization tool to query and visualize data. In addition, we operate a 5-nodes cluster based on OpenSearch search engine to collect logs from many sources.
Desired slot length | 15 minutes |
---|---|
Speaker release | Yes |