- ATLAS
- Issue with Throttler
- Requests remain in WAITING state forever --> #4979
- Way to manually run the throttler to unblock it
- Heartbeats
- With frequent Kubernetes pod-restarts, often outdated heartbeats are found
- Ideally pod-shutdown should issue a heartbeat-removal as well, but doesn't seem to happen #4988
- CMS
- CTA multihop transfer
- More space on EOS to leave space for multihop
- --> More data (and thus Jobs) sent to EOS
- Possibly based on freespace weights on rules
- Freespace weight could be adapted
- ATLAS investigating if using relative freespace weights instead of absolute could be beneficial
- Fermilab/DUNE/ICARUS/RUBIN
- Staging failures on tape system
- Number of files in RUBIN
- Large number of files, might be an issue for transfers/tapes
- Belle II
- mod_gridsite issue
- Writes into /var/cache/gridsite
- More work on metadata
- DUNE
- Work on policy packages for the client
- After that going back to leightweight rucio clients
- Multi-VO
- Conveyor submitter/poller
- After discussion on slack it works much better now
- Radu and Tim implemented a fix which should be able to select the right certficiate for the right VO
- ESCAPE
- DAC21 (Data and Analysis Challenge 21) next week
- Currently two issues:
- Hermes increases memory consumption until it crashes?
- Kubernetes metrics vs prometheus memory consumption does not add up?
- Reaper greedy deletion
- Found small unrelated bug which is fixed in helm-chart
- LSST usecases --> want to reach 60k deletion / h
- SKAO