Homeโ€บ Knowledge Baseโ€บ GPU Memory Hierarchy

GPU Memory Hierarchy is the multi-level, bandwidth-stratified storage system combining registers, caches, shared memory, and DRAM, with fundamentally different access latencies and throughputs that dominate GPU application performance.

GPU Memory Hierarchy Levels

Bandwidth Characteristics at Each Level

Coalescing Rules for Global Memory

Bank Conflict in Shared Memory

L2 Cache Policies and Control

Unified Memory and Page Migration

Prefetching Strategies

Memory Access Optimization Case Studies

heterogeneous memory hbm gddrmemory bandwidth gpu hierarchyl1 l2 shared memory hierarchyunified memory page migrationmemory access pattern coalescing

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization โ€” search the full knowledge base or chat with our AI assistant.