- Over the weekend; issue related to the exipiration of CERN CA intermediate cerficate
- Variety of different problems surfaced.
- Manual interventions in ATAS to bring most services back (and permanent fixes done this week)
- Observed number of job failures (piloterrorcode 1368) with timeouts at point of setting up ATLAS software via CVMFS:
- Errors appeared to stop early am on 25th (?)
- Today's issue:
- Harvester configuration issue, using gridftp to submit jobs via HTcondor 10 to ARC-CE's; which can't work, resulting in pilot faults:
- Now fixed and hopefully the source of reduced jobs running at RAL
FTS: currently have a backlog of 154k transfers from RAL. Likely related to cern cert issues, and a number of CERN FTS hosts that stopped working.