● Outstanding tickets
- 150912 UKI-NORTHGRID-MAN-HEP less urgent assigned 2021-03-10 19:28:00 UKI-NORTHGRID-MAN-HEP: TRANSFER Transfer canceled because the gsiftp performance marker timeout
- No Site update; JW to follow up with site
- 150896 UKI-LT2-QMUL very urgent assigned 2021-03-10 09:28:00 UKI-LT2-QMUL: Sudden appearance of dark data
- Dark data at site after deletions,
- Reported space values started reporting nonsense values, which didn’t change as deletions proceed; proper fix needed, but working fix in place.
- Dark data to be removed, hopefully with a consistency check.
- Latest storm version should provide an automated ‘du’
- 150820 UKI-LT2-RHUL less urgent waiting for reply 2021-03-11 09:33:00 UKI-LT2-RHUL: 0% Transfer and deletion efficiencies
- File list declared lost, some follow-up files to declare lost.
- Permission denied errors in transfers.
- 149362 UKI-SOUTHGRID-RALPP urgent in progress 2021-02-18 20:00:00 ATLAS CE failures on UKI-SOUTHGRID-RALPP-heplnx207
- Still no progress
- JW to attempt to move over to aCT
- 146651 RAL-LCG2 urgent on hold 2021-02-16 17:37:00 singularity and user NS setup at RAL
- 142329 UKI-SOUTHGRID-SUSX top priority on hold 2021-01-20 20:29:00 CentOS7 migration UKI-SOUTHGRID-SUSX
- 8pm yesterday - running pilot jobs.
- Possible update to provide memory limits
- Lancaster needed some runtime script; to allow for minimum memory requirements
- Matt to provide instructions
● CPU
● Other new issues / tasks
- VAC; BHAM not running ATLAS jobs; site issues should have been resolved.
- JW to prod harvester support list if no active response
- Will ATLAS still want to support VAC?
- Understand why broken now; then see what to do
- Implications for how BHAM may wish to run site.
● Ongoing Items
-
CentOS7 - Sussex
- Should be near production readiness
-
TPC with http
- No update; xrootd 5.1.1 is available
- Sam to update Glasgow TPC gateway.
-
Storageless Site test / storage decomissioning (Oxford)
- RAL side complete; OX to finalise configuration, then ATLAS side.
-
ECDF volatile storage
- Process on ATLAS side; appears that enpoint name changed however?
-
Glasgow DPM Decommissioning
- Sam to update Jira with downtime notification
-
ATLAS: Site Availability/Reliability reports: Glasgow
- Moving forward - not yet resolved
● News round-table
- Dan
- Expect downtime for SE switch, before Easter
- Matt
- Peter
- Mention France datacenter fire (inside self-contained containers)
- Warnings that data-loss can happen in the cloud …
- Sam
- JW
- TPC work sidelined for VectorRead support
- Patrick
There are minutes attached to this event.
Show them.