top of page
LOW DOWN ON HIGH-ORDER BLOG
Blog: Headliner
Our Recent Posts
Archive
Tags
Basic GPU optimization strategies
When I started writing GPU code, I often heard that using shared memory is the only way to get good performance out of my code. As I kept...
Titan-V @ V-Tech: initial benchmarking results
A new Titan V arrived at Virginia Tech today. Installation went relatively smoothly thanks to the patience of Bill Reilly. The Titan V...
Rough-n-Ready Roofline: NVIDIA V100 edition
In this post we discuss rules of thumb for performance limiters when using shared memory in a NVIDIA V100 CUDA compute kernel. The V100...
Concurrent Cloud Computing: installing occaBench for V100
Overview: This week we have been experimenting with instances on Amazon AWS and Paperspace that come equipped with NVIDIA V100 GPUs....
Vaunted Volta Verified: initial comparison of the NVIDIA V100 & P100 GPUs
We created an Amazon EC2 instance with NVIDIA V100 GPU. We will discuss that process in more detail in a future posting. As usual this is...
CEED Code Competition: VT software release
VT CEED BP Software Release: the VT Parallel Numerical Algorithms team has released GPU optimized implementations for the Center for...
Blog: Blog
bottom of page