Speaker
Daniel Campora
(Nikhef National institute for subatomic physics (NL))
Description
Programming for Heterogeneous Architectures - lecture 4
- GPU streams, pipelined memory transfers
- Good practices: single precision, branchless, avoid register spilling, convert the problem
- Reusable parallel design patterns with real-life applications
- Under the hood: Warps, masked execution, floating point rounding