LOW DOWN ON HIGH-ORDER BLOG

 

March 14, 2018

Q: does it make sense to partially assembled elemental stiffness matrices for affine tetrahedral finite elements when running on a Volta class GPU ?

Background: In the reviews of our recent paper on optimizing FEM operations for hexahedral elements we were ask...

March 13, 2018

BLAS (Basic Linear Algebra Subprograms) is a specification for performing multiple common, basic linear algebra routines. BLAS functions have been implemented and optimized for GPUs,  and packaged in libraries (i.e., cuBLAS, CULA, and (batched) MAGMA) .
 

A si...

March 6, 2018

In this post we demonstrate how to optimize the performance of a specific finite element operation expressed as a GPU kernel. 

One might ask: why are we doing this? Why do we care so much about optimizing performance? After carefully reading this post, it should be...

Please reload

Our Recent Posts

Please reload

Archive

Please reload

Tags

I'm busy working on my blog posts. Watch this space!

Please reload

 

225 Stanger St
Blacksburg, VA 24061
USA.

©2018 BY THE PARALLEL NUMERICAL ALGORITHMS  RESEARCH GROUP @VT.