Kicking off GPU Mode [D]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
Hey !
I’m starting a series to document my work on GPU infrastructure, LLMs, and CV.
Stop #1 is up: A brief look at why GPUs are the center of the industry, the CPU/GPU divide, and why nvidia-smi is the first place you check when things break.
We’ll move past the basics quickly to focus on:
- Empirical architecture differences (Ampere vs. Hopper vs. Blackwell).
- Handling register pressure in custom kernels.
- Asynchronous memory paradigms (TMA/wgmma).
#CUDA #GPU #KernelOptimization #SystemsProgramming
[link] [comments]
More from r/MachineLearning
-
Loss functions in Instance Representation Learning [R]
Jun 29
-
Price elasticity model [R]
Jun 29
-
Rejected MICCAI paper: workshop -> journal/conference or directly journal/conference [R]
Jun 29
-
I built a demo agricultural planning system with an AI advisor for small-scale farmers in Nicaragua using NASA data [p]
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.