Speaker
Daniel Campora
(University of Maastricht)
Description
Programming for Heterogeneous Architectures - lecture 3
- Data locality, coalesced memory accesses, tiled data processing
- GPU streams, pipelined memory transfers
- Under the hood: branchless, warps, masked execution
- Debugging and profiling a GPU application