Development

Google's datacenter applications: Invited talk at Intel Labs

CUDA Programming

Discussed CUDA programmign concepts as part of guest lecture in course on High Performance Computing

Multi-node multi-GPU programming

Created open-source tutorials and boot-camps for scalable multi-GPU programming for HPC applications.

Highly Scalable FFT on GPUs

Extended Tarang, a parallel computational fluid dynamics simulator, to enable GPU-based FFTs.