Modern GPU Programming for MLSys

(mlc.ai)

36 points | by crowwork 3 days ago

2 comments

  • hazard 21 minutes ago
    This looks great, but I'd really like to see associated exercises (and solutions) to make it useful for self-study
  • mathisfun123 2 hours ago
    "Modern [NVIDIA GPU] Programming for ..."

    Everything after "Pipelining GEMM with TMA" (inclusive) is specific to NVIDIA. Which is fine but the title (of the guide itself) is clearly misleading.

    • nh23423fefe 0 minutes ago
      > Our main target is the Blackwell generation,

      misleading?