Basic GPU optimization strategiesWhen I started writing GPU code, I often heard that using shared memory is the only way to get good performance out of my code. As I kept...
Titan-V @ V-Tech: initial benchmarking resultsA new Titan V arrived at Virginia Tech today. Installation went relatively smoothly thanks to the patience of Bill Reilly. The Titan V...
Rough-n-Ready Roofline: NVIDIA V100 editionIn this post we discuss rules of thumb for performance limiters when using shared memory in a NVIDIA V100 CUDA compute kernel. The V100...