25th International Conference on Computing in High Energy & Nuclear Physics

Name: 25th International Conference on Computing in High Energy & Nuclear Physics
Start: 2021-05-17T14:50:00+02:00
End: 2021-05-21T18:10:00+02:00
Location: No location set

17–21 May 2021

Europe/Paris timezone

Contact us

Exploitation of network-segregated CPU resources in CMS

20 May 2021, 15:13

13m

Short Talk Distributed Computing, Data Management and Facilities Facilities and Networks

Antonio Delgado Peris (Centro de Investigaciones Energéti cas Medioambientales y Tecno)

CMS is tackling the exploitation of CPU resources at HPC centers where compute nodes do not have network connectivity to the Internet. Pilot agents and payload jobs need to interact with external services from the compute nodes: access to the application software (cmvfs) and conditions data (Frontier), management of input and output data files (data management services), and job management (HTCondor). Finding an alternative route to these services is challenging. Seamless integration in the CMS production system without causing any operational overhead is a key goal.

The case of the Barcelona Supercomputing Center (BSC), in Spain, is particularly challenging, due to its especially restrictive network setup. We describe in this paper the solutions developed within CMS to overcome these restrictions, and integrate this resource in production. Singularity containers with application software releases are built and pre-placed in the HPC facility shared file system, together with conditions data files. HTCondor has been extended to relay communications between running pilot jobs and HTCondor daemons through the HPC shared file system. This operation mode also allows piping input and output data files through the HPC file system.

Results, issues encountered during the integration process, and remaining concerns are discussed.

Antonio Delgado Peris (Centro de Investigaciones Energéti cas Medioambientales y Tecno) Antonio Perez-Calero Yzquierdo (Centro de Investigaciones Energéti cas Medioambientales y Tecno) Carlos Acosta Silva (IFAE and PIC) Jaime Frey (University of Wisconsin Madison (US)) Dr Jose Flix (Centro de Investigaciones Energéti cas Medioambientales y Tecno) Dr José M. Hernández (CIEMAT) Todd Tannenbaum (University of Wisconsin Madison (US))

BSC_exploitation_vCHEP2021_slides.pdf

Recording

BSC_exploitation_vCHEP2021_v5.pdf

25th International Conference on Computing in High Energy & Nuclear Physics

Contact us

Exploitation of network-segregated CPU resources in CMS

Speaker

Description

Authors

Presentation materials

Proceedings

Paper

Choose timezone

25th International Conference on Computing in High Energy & Nuclear Physics

Contact us

Speaker

Description

Authors

Presentation materials

Proceedings

Paper