CHEP 2016 Conference, San Francisco, October 8-14, 2016

Name: CHEP 2016 Conference, San Francisco, October 8-14, 2016
Start: 2016-10-10T08:00:00-07:00
End: 2016-10-14T18:00:00-07:00
Location: San Francisco Marriott Marquis

10–14 Oct 2016

San Francisco Marriott Marquis

America/Los_Angeles timezone

Effective HTCondor-based monitoring system for CMS

11 Oct 2016, 15:30

1h 15m

San Francisco Marriott Marquis

Poster Track 7: Middleware, Monitoring and Accounting Posters A / Break

Jean-Roch Vlimant (California Institute of Technology (US))

The CMS experiment at LHC relies on HTCondor and glideinWMS as its primary batch and pilot-based Grid provisioning systems. Given the scale of the global queue in CMS, the operators found it increasingly difficult to monitor the pool to find problems and fix them. The operators had to rely on several different web pages, with several different levels of information, and sifting tirelessly through log files in order to monitor the pool completely. Therefore, coming up with a suitable monitoring system was one of the crucial items before the beginning of the LHC Run 2 to ensure early detection of issues and to give a good overview of the whole pool. Our new monitoring page utilizes the HTCondor ClassAd information to provide a complete picture of the whole submission infrastructure in CMS. The monitoring page includes useful information from HTCondor schedulers, central managers, the glideinWMS frontend, and factories. It also incorporates information about users and tasks making it easy for operators to provide support and debug issues.

Primary Keyword (Mandatory)	Monitoring

Brian Paul Bockelman (University of Nebraska (US)) Justas Balcas (California Institute of Technology (US))

Antonio Perez-Calero Yzquierdo (Centro de Investigaciones Energ. Medioambientales y Tecn. - (ES) David Alexander Mason (Fermi National Accelerator Lab. (US)) Farrukh Aftab Khan (National Centre for Physics (PK)) Jadir Marra Da Silva (UNESP - Universidade Estadual Paulista (BR)) James Letts (Univ. of California San Diego (US)) Jean-Roch Vlimant (California Institute of Technology (US)) Jose Hernandez (CIEMAT) Marco Mascheroni (Fermi National Accelerator Lab. (US))

Highlights-343.pdf

Poster-343.pdf

CHEP 2016 Conference, San Francisco, October 8-14, 2016

Effective HTCondor-based monitoring system for CMS

San Francisco Marriott Marquis

Speaker

Description

Authors

Co-authors

Presentation materials