Shifting through the Gears of GPU Programming: Understanding Performance and Portability Trade-offs
, NVIDIA
This talk will show implementations of standard linear algebra algorithms in a a range of programming models, including standard parallelism, OpenACC, and CUDA, and how the performance and productivity varies across these.