RAL Tier1 Experiments Liaison Meeting
→
Europe/London
Access Grid (RAL R89)
Access Grid
RAL R89
-
-
13:30
Experiment Operational Issues
-
1
ATLAS Operations ReportSpeakers: Brij Kishor Jashal (Rutherford appelton laboratory), Jyoti Prakash Biswal (Rutherford Appleton Laboratory)
-
2
CMS Operations ReportSpeaker: Katy Ellis (Science and Technology Facilities Council STFC (GB))
Errors yesterday are on CMS side.
A couple of warnings on squid -
- many database queries from the site have connected directly to the Frontier servers or backup proxies, with a high rate of queries not going through the local squid(s): T1_UK_RAL\tcmscernct\t255759\t3922096
- No IPv6 access
A small spike of job failures yesterday. Otherwise good performance.
-
3
LHCb Operations ReportSpeaker: Alexander Rogovskiy (Rutherford Appleton Laboratory)
- Raw data distribution starts very soon (planned for today or tomorrow)
- 1PB free on antares LHCb allocation (which is 2024-25 FY), so need update it soonish.
- ECHO problems (faulty disk on the 30th of April causing some slow IOPs + slow IOPs for uknown reason on the 7th of May) caused file loss and corruption (GGUS 683184)
- Search for corrupted files is finished
- A handful of files found, the ones that have other replicase have already been re-replicated to RAL
- That (hopefully) concludes the incident
- Search for corrupted files is finished
- There were a lot of upload failures from HLTFarm and NIPNE to RAL
- Not our fault
- Network issues at NIPNE and HLTFarm connectivity problems
- Not our fault
CVMFS (sorry, misplaced):
- There is a proposal to make RAL Stratum-1 an official mirror of the EESSI repository. Any thoughts on that?
- squid0[56] machines were added to the
cvmfs-squid
alias. That caused some problems for ATLAS and CMS- PTR records were added to reverse DNS zone as well, e.g.
$ host 130.246.183.211
That's most probably the cause of the issues. As of this morning, the DNS change (full, e.g. for both Forward and Reverse zones) is rolled-back.
211.183.246.130.in-addr.arpa domain name pointer cvmfs-squid.gridpp.rl.ac.uk.
211.183.246.130.in-addr.arpa domain name pointer squid06.gridpp.rl.ac.uk.
211.183.246.130.in-addr.arpa domain name pointer cms-squid.gridpp.rl.ac.uk.
211.183.246.130.in-addr.arpa domain name pointer atlas-squid.gridpp.rl.ac.uk.
- PTR records were added to reverse DNS zone as well, e.g.
- Raw data distribution starts very soon (planned for today or tomorrow)
-
4
ALICE Operations ReportSpeaker: Alexander Rogovskiy (Rutherford Appleton Laboratory)
-
5
LSST Operations ReportSpeakers: Mathew Sims, Timothy Noble (Science and Technology Facilities Council STFC (GB))
-
14:00
Tier-1 Projects
-
6
Anatares Upgrade
New EOS nodes
Repack ProgressSpeakers: George Patargias, Thomas Byrne -
7
Echo Upgrade
Dell 24 Storage Deployment
New gateways
SSD StorageSpeakers: Robert Appleyard, Thomas Byrne -
8
Varnish For ATLASSpeaker: Brij Kishor Jashal (Rutherford appelton laboratory)
-
9
XRootD DevelopmentSpeakers: Alexander Rogovskiy (Rutherford Appleton Laboratory), Jyothish Thomas (STFC)
-
10
Utilizing GPUsSpeakers: Jyoti Prakash Biswal (Rutherford Appleton Laboratory), Thomas Birkett
-
14:45
AOB
-
11
Summary of Operational Status and IssuesSpeakers: Brian Davies (Lancaster University (GB)), Darren Moore
-
12
Any other BusinessSpeakers: Brian Davies (Lancaster University (GB)), Darren Moore
-
13:30