BiLD-Dev
Bi-Weekly "Loyal" DIRAC developers meeting. And, following, the LHCbDIRAC developers meeting.
Join Zoom Meeting
https://cern.zoom.us/j/91083694183?pwd=ZkdDY1I1YkJVc2o3UTdBY1BRZE15UT09
Meeting ID: 910 8369 4183
Passcode: 12345678
One tap mobile
+41432107108,,91083694183# Switzerland
+41315280988,,91083694183# Switzerland
Dial by your location
+41 43 210 71 08 Switzerland
+41 31 528 09 88 Switzerland
+41 43 210 70 42 Switzerland
+33 1 8699 5831 France
+33 1 7037 2246 France
+33 1 7037 9729 France
Meeting ID: 910 8369 4183
Find your local number: https://cern.zoom.us/u/artUfAdNB
Join by SIP
91083694183@188.184.89.188
91083694183@188.185.118.153
Join by H.323
188.184.89.188
188.185.118.153
Meeting ID: 910 8369 4183
Passcode: 12345678
BiLD (Bi-weekly DIRAC Development meeting) – 19/05/2022
On Zoom: Federico, Alexandre, André, Christophe, Christopher, Ruben, Simon F, Simon M, Daniela, Janusz, Simon, Hideki, Ueda, Xiaomei
Apologies: Andrei
Follow-up from previous meetings
- DIRAC Workshop the last week
- notes (from the discussions going on after each presentation) are in https://demo.hedgedoc.org/OshClQTkTBKNSCIZf9WkoA#
- Went briefly through the summary slides, mostly slide 8, and the in-house developments have the potential of becoming part of vanilla DIRAC:
- ILCDirac jobs and requests monitoring
- André will look into this one
- Rucio–DIRAC FC methods from Belle2
- Ueda possible
- Scout jobs (Belle2)
- Ueda possible, a colleague looking into this
- Dataset Searcher ? (Belle2)
- Ueda it is actually rather specific
- ILCDirac jobs and requests monitoring
- hackathon of May 12th, on 8.0.0a21:
- Main issue: https://github.com/DIRACGrid/DIRACOS2/issues/72
- DIRACOS on CVMFS updated? seems so
- Main issue: https://github.com/DIRACGrid/DIRACOS2/issues/72
DIRAC communities roundtable
GridPP:
Daniela+SimonF+Janusz
- Not updated yet in production
- Janusz Found a bug with Multi-VO metadata, issue will be created today. Can’t select files based on creation date. Should not affect the single VO installations.
LHCb:
Federico+Christophe+Christopher+Alexandre+Ruben+SimonM+Alexey
- NTR
ILC/CLIC/FCC/Calice:
André
- Moved to Python3 pilots
- diracos(1) gfal libraries no longer able to talk to DESY SEs (certificates)
- I need a way to unset the GFAL (and other variables) from the worker node environment
- Add option or command to DIRACPilot?
- Create My own pilot extension?
- Christopher easiest is a new release of DIRACOS2, will be done today
- DIRACGrid/DIRACOS2#73
Belle2
Hideki
- Still not managed to fully solve the compatibility issues with v7r2
- SiteDirector “StopOnApplicationFailure”: why True by default?
- Federico for avoiding “black holes”. Still, for PoolCE we set StopOnApplicationFailure=False
BES3+Juno
Xiaomei
- Trying python3 pilots, mostly OK. Install extension seems not to be possible…?
- Federico ask in a github discussion, please I will have a look.
Topics from GitHub/Discussions or Google forum
- How to stop users from creating non-indexed metadata
- not converged yet
DIRAC releases
- v7r2:
- [v7r2p44]
- WorkloadManagementSystem FIX: (#6022) utf-8 support in job output for python2
- Elasticsearch NEW: (#5987) option op_type in index API for Elasticsearch
- [v7r2p44]
- v7.3:
- [v7.3.21] buggy
- Resources
FIX: (#6059) use ceType instead of ceName for CE parameters ==> buggy
NEW: (#5988) add CloudComputingElement - WorkloadManagementSystem
FIX: (#5979) StalledJobAgent can force status from Submitting to Failed - Core
FIX: (#6011) Allow rootPath to be overridden with DIRAC_ROOT_PATH environment variable - DataManagementSystem
NEW: (#6008) BringOnline setting for FTS transfers can be SE dependant
- Resources
- [v7.3.21] buggy
- v8.0
- 8.0.0a21:
- Resources
NEW: (#6061) add SRM+HTTPs as a default TPC protocol in SRM - Core
NEW: (#6017) TornadoBaseRequestHandler define a class logger with an extra attribute “tornadoComponent” - All
CHANGE: (#6017) services propagate their local loggers to the DB
- Resources
- 8.0.0a21:
DIRAC projects
DIRAC:
Issues by milestone:
- v7r2:
- Closed ElasticSearch index duplicate document
- Still a few issues out there, please proceed with them, or move them up.
- v7r3:
- CloudDirector -> SiteDirector moved to v8 milestone
- New Singularity needs to mount local directory to use File storage plugin
- Christophe will look at it this afternoon
- v8.0:
- Monitoring generating insane timestamp
- store seconds? then we would need to change each and every stored index. Alternative proposed, but still needs some investigation.
- Monitoring generating insane timestamp
- v8.1:
*
Other issues:
- NTR
PRs discussed:
- https://github.com/DIRACGrid/DIRAC/pull/6067 was not swept. Christopher will check
- Not much discussion in any other.
WebApp:
- [5.0] fix: JobMonitor page with datetime values should be now ready
Pilot:
- Janusz Created a UUID in the pilot wrapper. No need for a finalizer as everything is in the pilot wrapper. 2 remote calls to ship the logs.
- python 2.6 still needed. Not, if you start from v7r3.
- microseconds: can the normal python
logging
module do it?- Christopher yes, I will point it to you
- logs get truncated?
- Christopher buffer on popen - the length come from the system
- Christopher the logs should be sent in batches, not all of it at the same time.
DIRACOS2:
- NeTR
Documentation:
- from previous meeting diracgrid.org still forwards to http://mardirac.in2p3.fr
OAuth2:
- from previous meeting Any news from issue WLCG and Token transition – reminders and requirements ?
- NTR
tornado/HTTPs
- Christophe Pending PR for dealing with error handling.
- from previous meeting Andrii We can eventually have special classes in Configuration and BundleDelivery to pass binary data - to be discussed.
- NTR
management
- NTR
diraccfg
- NTR
COMDIRAC
- NTR
DB12
- NTR
Rucio
- NTR
Release planning, tests and certification
-
8.0 release
- Currently issues still to be resolved:
* DIRAC’s Epoch time has time zone dependence
* Monitoring generating insane timestamp
* Remove use of BLOB storage in RMS - Target: 1 last hackathon, maybe
- Currently issues still to be resolved:
-
Certification machines
- lbcertifdirac70 machine:
- Everyone’s OK if the machine is rebooted
- lbcertifdiracoauth machine:
- Will also be rebooted
- lbcertifdirac70 machine:
-
Next hackathon(s)
- ?
AOB
Next hackathon on June 16th
Next BiLD on June 9th
Just publicized:
- CHEP 2023
- ACAT: https://indico.cern.ch/event/1106990/program ?
LHCbDIRAC
- Alexey agreed to take care of LHCbDIRAC Bookkeeping.
- no news
- v10r4: deploy board in https://trello.com/b/kzUKdMts/deploy-v10r3
- upgrade of
dfc
: Chistophe in contact with DBOD support. Wants to wait and see what happened tolbwms
first - HTTPS services: some work is still needed for most of them.
- upgrade of
- LHCbDIRAC hackathon based on 8.0
- 31st May
- leave DISET
- https://gitlab.cern.ch/lhcb-dirac/diracchart/