r/MachineLearning · · 1 min read

Kicking off GPU Mode [D]

Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.

Hey !
I’m starting a series to document my work on GPU infrastructure, LLMs, and CV.

Stop #1 is up: A brief look at why GPUs are the center of the industry, the CPU/GPU divide, and why nvidia-smi is the first place you check when things break.

We’ll move past the basics quickly to focus on:

  • Empirical architecture differences (Ampere vs. Hopper vs. Blackwell).
  • Handling register pressure in custom kernels.
  • Asynchronous memory paradigms (TMA/wgmma).

#CUDA #GPU #KernelOptimization #SystemsProgramming

submitted by /u/Positive_Canary1723
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/MachineLearning