Rucio Development Meeting
→
Europe/Zurich
Martin Barisits
(CERN)
Description
Zoom: https://cern.zoom.us/j/413496641
Meeting ID: 413 496 641
Find your local number: https://cern.zoom.us/u/aT2QQfXAo
-
-
1
News
- Release schedule
- 1.23.0rc1 released
- Testing ongoing
- Further RCs if needed
- If testing goes well 1.23.0 final next week
- If further RCs are needed the week after
- Code freeze active until then
- 1.23.1 on July 20th
- 1.23.0rc1 released
- Meeting schedule
- Should we reduce meeting schedule during summer
- We did so during July and August 2019
- Fortnightly meeting unless an additional meeting is needed
- Should we reduce meeting schedule during summer
- Planned topics
- Presentation on Rucio Bot by Vasilis (GSOC Student)
- July 16th
- Component lead planning
- See github
- Lack of manpower in maintenance/development of several components
- Component leads from the wider Rucio dev community?
- Discussion on July 9th (Next week)
- Rucio 1.24.0 release planning
- To be scheduled
- Multi-VO enhancement discussion
- To discuss features and change requests specific to Rucio MultiVO mode
- e.g. Reaper certificate handling, etc.
- To be scheduled
- To discuss features and change requests specific to Rucio MultiVO mode
- Presentation on Rucio Bot by Vasilis (GSOC Student)
- Presentation at JUNO Collaboration
- Release schedule
-
2
News from the experiments
- ATLAS
- CTA Migration
- 1.22.8 release deployed in ATLAS
- Works well but configuration "issue" found
- LAN domain can now also be used for write operation
- Requires delete, write and read operation to be set for LAN
- Database downtime
- Oracle at CERN planned downtime for several hours on Saturday
- Daemons and servers came up by themselves
- Oracle at CERN planned downtime for several hours on Saturday
- CMS
- NanoAOD transition going well
- More datasets moved from phedex to Rucio
- CTA Multihop test at 200TB level
- Probes Pullrequest open for a long time
- Dimitrios looking into it tomorrow
- Prometheus PR from Thomas needs to be merged as well
- NanoAOD transition going well
- Belle II
- Belle II General Meeting last week with lots of sessions
- Migration done after KEK downtime (End of August)
- Belle II code frozen and running validation
- RAL/UK
- LDMX
- Running only catalog
- Deployed abbacus to keep counters in sync
- Using prometheus/grafana framework as well
- Generic Rucio prometheus probe for that?
- Fermilab
- OpenShift cluster deployment working
- Changes sent to helm-charts/containers
- Transfers via fts3 working
- OpenShift cluster deployment working
- Folding
- Transfers from WU-servers to RAL
- Cron job for file population
- ATLAS
-
3
Hot topics
- CTA Migration
- Put in production Monday morning
- Issues when reading data back from CTA
- gfal failing full batches of files
- Tickets open on gfal
- Should be addressed in the next days
- Test with small files to validate run3 rate (50-100Hz)
- Write and concurrent VO tests
- Large datarate/full-chain test when FTS check-on-tape work is finished (For full run3 workflow)
- Multihop mode
- Issue with multihop due to the way the intermediate requests are created
- Martin looking into this right now, hopefully some news soon
- CTA Migration
-
4
Developers roundtable
- Burn chart and progress
- 1.23.0 LTS "The Incredible Donkey" priority followup
- In Progress
- To do
- Done
- New Code management Model #3417 [Martin, Ben]
- GH Actions migrations works very well
- Will be changed after 1.23 is out
- Some tools/documentation changes missing still
- AAI/OIDC improvements
- Expand Kubernetes Usage
- MultiVO Functionality #2635 [Eli, Patrick]
- Unification of metadata interfaces #3096 [Aris]
- QoS #3419 [Aris, Mario, Martin]
- Hermes Aggregator for Monitoring #3680 [Cedric]
- Changing gfal protocol (adding protocol) #3537 [Mario]
- New Code management Model #3417 [Martin, Ben]
- Delayed
- Kubernetes
- Moving ATLAS fully into production later this summer
- Comparing deployment config between ATLAS & CMS
- ATLAS Rucio clients Docker Container [Thomas]
- On Dockerhub
-
5
AOB
-
1