- Grafana metrics: Might want to introduce additional rate metrics that subtract the header overhead to have the pure payload: low priority.
- Merged workflow fails if outputs defined after being used as input
- needs to be implemented by Giulio
- Cannot override options for individual processors in a workflow
- requires development by Giulio first
- Problem with 2 devices of the same name
- Usage of valgrind in external terminal: The testcase is currently causing a segfault, which is an unrelated problem and must be fixed first. Reproduced and investigated by Giulio.
- Run getting stuck when too many TFs are in flight.
- Do not use string comparisons to derrive processor type, since DeviceSpec.name is user-defined.
- Support in DPL GUI to send individual START and STOP commands.
- Add additional check on DPL level, to make sure firstOrbit received from all detectors is identical, when creating the TimeFrame first orbit.
- Implement a proper solution to detect wheter a device is firstInChain
- Deploy topology with DPL driver
- Start / Stop / Start: 2 problems on O2 side left:
-
- All processes are crashing randomly (usually ~2 out of >10k) when restarting. Stack trace hints to FMQ. https://its.cern.ch/jira/browse/O2-4639
- TPC ITS matching QC crashing accessing CCDB objects. Not clear if same problem as above, or a problem in the task itself
PDP-SRC issues
- Check if we can remove dependencies on
/home/epn/odc/files
in DPL workflows to remove the dependency on the NFS
- reading / writing already disabled
- remaining checks for file existence?
- check after Pb-Pb by removing files and find remaining dependencies
logWatcher.sh
and logFetcher
scripts modified by EPN to remove dependencies on epnlog
user
- node access privileges fully determined by e-groups
- new
log_access
role to allow access in logWatcher
mode to retrieve log files, e.g. for on-call shifters
- to be validated on STG
- waiting for EPN for further feedback and modifications of the test setup
- new
BEAMTYPE
for oxygen period
- https://its.cern.ch/jira/browse/O2-5797
- RC asked for a synthetic OO dataset
- RC request for a pO dataset
- question if we can use the same CTP config for pO as for OO -> yes
- O2 code to be checked for pp and PbPb specific variables
- TMinuit errors (
Initial matrix not pos. def.
) in QC tasks during fitting procedure
- mainly in vertex x, y fits, but also in some TPC QC tasks
- rare random errors, can reproduce locally
- to see and understand if they can be avoided with extra options in the Fit call, otherwise filter them in EPNstderrMonitor