Rough-n-ready Roofline: Titan V edition
In this post we discuss rules of thumb for performance limiters when using shared memory in a CUDA compute kernel running on a Titan V -...
Our Recent Posts
Archive
Tags
Rough-n-ready Roofline: Titan V edition
libParanumal: Galerkin-Boltzmann 3D flow simulation
libParanumal: Galerkin-Boltzmann flow simulation
Undergraduate Summer Researchers Join the Paranumal Team
Jesse Chan Talk on Entropy Stable Schemes @VT
Finite Element Stiffness Matrix Action: to precompute or not to precompute
Finite Element Stiffness Matrix Action: to BLAS or not to BLAS, that is the question.
Finite Element Stiffness Matrix Action: monolithic kernel optimization on Titan V
Basic GPU optimization strategies
Titan-V @ V-Tech: initial benchmarking results