Speaker
Luis Fernandez Alvarez
(CERN)
Description
The CERN HTCondor pool is currently offering 200K cores of compute power to hundreds of users in the HEP community. Managing such cluster requires a significant effort in the daily operations, not only because of the scale, but also because of the diversity of the resources. In this scenario, the adoption of automation and monitoring tools becomes a strong requirement to optimize both the resource usage and the operators time.
This talk presents different projects and prototypes that have been developed and integrated in our infrastructure to make these daily operations more efficient.
Speaker release | Yes |
---|
Primary author
Luis Fernandez Alvarez
(CERN)