GPU param optimisation
Updates:
- Performed several measurements for independent kernels (
MergerTrackFit
, MergerFollowLoopers
, MergerSliceRefit
, MMergerCollect
, CompressionKernels_step0attached
, CompressionKernels_step1unattached
)
- Created set of best parameters per
beamtype
(pp, PbPb) and per IR
(100kHz, 2MHz for pp and 5kHz, 50kHz for PbPb)
- How to choose best configuration:
- compute
conf_mean_time - default_conf_mean_time
- propagate error (std dev) of the difference and compute 95% confidence interval
- if 0 is in the interval, can't tell with confidence if current configuration is better than the default
- if one or more CIs have upperbound < 0, choose the one with smaller mean (i.e. the best)
- Started to investigate on concurrent kernels
TrackletConstructor_singleSlice
params, as it is the overall longest kernel with other concurrent kernels
- Grid search on its grid and block size
- Measured both mean kernel duration and overall
TrackingSlice
duration (changing params influences whole phase duration)
To do:
- Plug in the optimal set of parameters per (
beamtype,IR
) and check if there is a noticeable improvement in TPC processing when input TF is (beamtype,IR
)
- Continue to investigate on phases with concurrent kernels (more difficult than single stream kernels)
Problems
EPNs are currently down.