2 Open Tickets

- 142370  from 22-Jul-2019   AGLT2 timeout transfer errors.
dCache door fails to send the information that the transfer is completed
so the globus client remains stuck until the timeout of 360s kicks in.
This is happening before asking for the checksum.
Already reported by CMS.

- 142695  from 13-Aug-2019   HC jobs failing for analysis queue.
Fraction of jobs failing (2-10/hour), leaving condor_starter running.
The pilot is receiving a continuous stream of SIGSEGV.
Investigation now converging on libgfal_plugin_http.so, at least for initiating the problem.
Instance from cvmfs works as expected but pilot2 at AGLT2 uses the local version from EPEL
which yum updated on July 19 matching the start of this problem. At least CERN and ALGT2 affected.
New Pilot2 v2.1.21 fixes endless waiting on the continous signal thrown by rucio.
Rucio team may aslo have to address this bug.

Operation otherwise stable

Planned purchase  
  - Storage: 6x R740Xd2
  - infrastructure: PDUs and fan doors