Speaker
            
    Daniel  Campora 
        
            (Nikhef National institute  for subatomic physics (NL))
        
    Description
Programming for Heterogeneous Architectures - lecture 4
- GPU streams, pipelined memory transfers
- Good practices: single precision, branchless, avoid register spilling, convert the problem
- Reusable parallel design patterns with real-life applications
- Under the hood: Warps, masked execution, floating point rounding