GPUs

High Throughput Mixture-of-Expert Serving: Intern Talk at NVIDIA

Memory Systems for Scalable LLM Training: Intern Talk at AMD

Scalable Multi-Node Fast Fourier Transform on GPUs

CUDA Programming

Discussed CUDA programmign concepts as part of guest lecture in course on High Performance Computing