NVIDIA CUDA Tutorial 5: Memory Overview
The GPU has a complicated but flexible set of different memories. It's sometimes called a memory hierarchy. Each type of memory has various characteristics which we'll look into individually later. This tutorial is a short introduction to all the memories. These memories can be confusing. In many instances, they are the same stuff, for instance, the L1 cache and shared memory are the same bytes. The texture, global, constant and local memory are the same bytes as well. At the end I've put a small demonstration of the NVidia Visual Profiler, a program which I think everybody that codes CUDA should definitely have a look at. The website with the Device Query program is: http://docs.nvidia.com/cuda/cuda-samp...

▶︎
NVIDIA CUDA Tutorial 6: An Embarrassingly Parallel Algorithm 1

▶︎
Intro to CUDA (part 5): Memory Model

▶︎
How CUDA Programming Works | GTC 2022

▶︎
Intro to CUDA (part 1): High Level Concepts

▶︎
Stanford Seminar - NVIDIA GPU Computing: A Journey from PC Gaming to Deep Learning

▶︎
CUDA Crash Course: GPU Performance Optimizations Part 1

▶︎
How Huawei Just Built an Impossible Chip

▶︎
China Just Built What TSMC Said Was Impossible

▶︎
NVIDIA CUDA Tutorial 1: Introduction

▶︎
Nvidia GPU Architecture

▶︎
Intro to CUDA - An introduction, how-to, to NVIDIA's GPU parallel programming architecture

▶︎
What Are CUDA Cores?

▶︎
CppCon 2016: “Bringing Clang and C++ to GPUs: An Open-Source, CUDA-Compatible GPU C++ Compiler"

▶︎
Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

▶︎
Introduction to programming in CUDA C

▶︎
Who Is Really Winning the AI Race? Follow the Money

▶︎
NVIDIA CUDA Tutorial 8: Intro to Shared Memory

▶︎
Introduction to GPU Programming with CUDA and Thrust

▶︎
