- Compact style
- Indico style
- Indico style - inline minutes
- Indico style - numbered
- Indico style - numbered + minutes
- Indico Weeks View
https://tinyurl.com/T1-GGUS-Open
https://tinyurl.com/T1-GGUS-Closed
https://lcgwww.gridpp.rl.ac.uk/utils/availchart/
https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T1_UK_RAL
http://hammercloud.cern.ch/hc/app/atlas/siteoverview/?site=RAL-LCG2&startTime=2020-01-29&endTime=2020-02-06&templateType=isGolden
Updated Echo allocations for FY 21/22
- https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=406939
- Looks like no issues; will update ATLAS space json today.
GGUS-Ticket-ID: #151098 "IN PROGRESS" "NGI_UK" "High failure rate at RAL-LCG2_TEST"
* Possible that interaction between Docker and pilot causes some unexpected termination of docker.
i.e after Job 1, pilot tries to remove any orphaned processes with kill signal.
might be killing 'something' that terminates docker job (HTCondor receives a ExitReason = “died on signal 9 (Killed)”)
- If confirmed , ... ?
Still running 3k cores. Hoping to increase that number this week. Job failures are ok and efficiency is 40-60% this week.
SAM tests look better...fewer 'missing' tests. ARC-CE01 had no test results for 24 hours after a reboot of that machine, but a second reboot seems to have fixed that.
Talked to James A during the meeting and he agreed to increase the number of CMS jobs running on the newest software (Dell19 tranche). This may have been reduced in the recent past (since end-Feb) due to single-core jobs taking over, and CMS jobs only run multicore.