21–25 May 2012
New York City, NY, USA
US/Eastern timezone

Controlled overflowing of data-intensive jobs from oversubscribed sites

22 May 2012, 13:30
4h 45m
Rosenthal Pavilion (10th floor) (Kimmel Center)

Rosenthal Pavilion (10th floor)

Kimmel Center

Poster Distributed Processing and Analysis on Grids and Clouds (track 3) Poster Session

Speaker

Mr Igor Sfiligoi (University of California San Diego)

Description

The CMS analysis computing model was always relying on jobs running near the data, with data allocation between CMS compute centers organized at management level, based on expected needs of the CMS community. While this model provided high CPU utilization during job run times, there were times when a large fraction of CPUs at certain sites were sitting idle due to lack of demand, all while Terabytes of data were never accessed. To improve the utilization of both CPU and disks, CMS is moving toward controlled overflowing of jobs from sites that have data but are oversubscribed to others with spare CPU and network capacity, with those jobs accessing the data through real time xrootd streaming over WAN. The major limiting factor for remote data access is the ability of the source storage system to serve such data, so the number of jobs accessing it must be carefully controlled. The CMS approach to this is to implement the overflowing by means of glideinWMS, a Condor based pilot system, and by providing the WMS with the known storage limits and let it schedule jobs within those limits. This talk presents the detailed architecture of the overflow-enabled glideinWMS system, together with operational experience of the past 6 months.

Primary authors

Alja Mrak Tadel (Univ. of California San Diego (US)) Dr Brian Bockelman (University of Nebraska) Mr Daniel Bradley (University of Wisconsin - Madison) Frank Wuerthwein (Univ. of California San Diego (US)) Mr Igor Sfiligoi (University of California San Diego) James Letts (Univ. of California San Diego (US)) Kenneth Bloom (University of Nebraska (US)) Matevz Tadel (Univ. of California San Diego (US))

Presentation materials