- Grafana metrics: Might want to introduce additional rate metrics that subtract the header overhead to have the pure payload: low priority.
- Merged workflow fails if outputs defined after being used as input
- needs to be implemented by Giulio
- Cannot override options for individual processors in a workflow
- requires development by Giulio first
- Problem with 2 devices of the same name
- Usage of valgrind in external terminal: The testcase is currently causing a segfault, which is an unrelated problem and must be fixed first. Reproduced and investigated by Giulio.
- Run getting stuck when too many TFs are in flight.
- Do not use string comparisons to derrive processor type, since DeviceSpec.name is user-defined.
- Support in DPL GUI to send individual START and STOP commands.
- Add additional check on DPL level, to make sure firstOrbit received from all detectors is identical, when creating the TimeFrame first orbit.
- Implement a proper solution to detect wheter a device is firstInChain
- Deploy topology with DPL driver
PDP-SRC issues
- Check if we can remove dependencies on
/home/epn/odc/files
in DPL workflows to remove the dependency on the NFS
- reading / writing already disabled
- remaining checks for file existence?
- check after Pb-Pb by removing files and find remaining dependencies
logWatcher.sh
and logFetcher
scripts modified by EPN to remove dependencies on epnlog
user
- node access privileges fully determined by e-groups
- new
log_access
role to allow access in logWatcher
mode to retrieve log files, e.g. for on-call shifters
- to be validated on STG
- waiting for EPN for further feedback and modifications of the test setup
- new
BEAMTYPE
for oxygen period
- https://its.cern.ch/jira/browse/O2-5797
- beam types
- p-O and O-O
- Ne-Ne still to be confirmed
- scripts to be adjusted to set proper workflow parameters
- will as RC to create new configurations, workflow parameters to be set in the configuration instead of the scripts
- tests using Pb-Pb replay data with new beam types
- some performance issues due to unset process multiplicities, otherwise everything looks fine
- RC asked for a synthetic OO dataset, will check with Sandro if our generators are already set up for this
- O2 code to be checked for pp and PbPb specific variables
- STOP timeout tests
- fixed: O2_DPL_PROCESSING_TRANSITION_TIMEOUT_DEFAULT=10
- EPNs, STOP time between `Stop request for ODC` and `Collection states: READY`
- 2tcEZMn3qXX - 561585 - `O2_DPL_EXIT_TRANSITION_TIMEOUT_DEFAULT=1` - 13 s - 56 tasks fail at shutdown, all `gpu-reconstruction`
2tcF2Aape2M - 561586 - `O2_DPL_EXIT_TRANSITION_TIMEOUT_DEFAULT=10` - 33 s - 52 tasks fail at shutdown, all `gpu-reconstruction`
2tcFeEy12DF - 561587 - `O2_DPL_EXIT_TRANSITION_TIMEOUT_DEFAULT=20` - 40 s - 15977 tasks fail at shutdown
2tcGLWLFhG5 - 561588 - `O2_DPL_EXIT_TRANSITION_TIMEOUT_DEFAULT=30` - 40 s - 112 tasks fail at shutdown, all `gpu-reconstruction`
2tcGp96HGRo - 561589 - `O2_DPL_EXIT_TRANSITION_TIMEOUT_DEFAULT=40` - 50 s - few oldetPossibleInput/Output messages 20 s after STOP, no crashes at shutdown