Review of weekly issues by experiment/VO
- LHCb
We have mostly smooth running for LHCb in the UK. Issues :
1. Various CVMFS errors at different sites. Followed up through GGUS tickets.
2. Interesting 3-day oscillation in running jobs at RAL (Tier-1). Trying to understand is origins and implications.
- CMS
- ATLAS
UKI-LT2-IC-HEP: Long standing problem with missing release needs some dedicated testing due to the different setup IC has. Waiting on AdS to supply some code.
UKI-SOUTHGRID-OX-HEP: problems with FTS time out settings. Brian has now changed them to a longer time.
Downtimes
UKI-NORTHGRID-SHEF-HEP: uni power cut
UKI-SOUTHGRID-BHAM-HEP: disruptive installation of new aircon units
UKI-SOUTHGRID-RALPP: site routers maintenance
RAL-LCG2: site routers maintenance
CVMFS
* timeout problem has now two tickets one for atlas and one for cvmfs.
https://savannah.cern.ch/bugs/?95420
https://savannah.cern.ch/support/?129468
Jakob thinks ha has found a solution and has a test version of cvmfs for it.
* Another bug we are looking at is cvmfs hanging every now and then this affects lhcb too so Raja might want to give a look.
https://savannah.cern.ch/bugs/?92112
Transfers errors
UKI-NORTHGRID-SHEF-HEP AND UKI-SOUTHGRID-RALPP had some problem with jobs in tranferring state accumulating after RAL downtime last week. This was due to FTS reporting the same error code for two different errors confusing Site Services. The problem has been noted and reported to the WLCG meeting.
- Other