3D NAND Flash Scaling encompasses the technology for manufacturing vertically stacked flash memory cells — now exceeding 200 layers — where charge-trap flash cells are formed along vertical channels etched through alternating oxide/nitride or oxide/poly layers delivering exponentially increasing bit density per die area without requiring the extreme lithographic resolution that limited planar NAND scaling.
From 2D to 3D NAND:
``
Planar NAND hit fundamental limits at ~15nm half-pitch:
- Cell-to-cell interference (capacitive coupling)
- Too few electrons per cell (unreliable storage)
- Extreme lithography cost for small features
3D NAND solution: stack cells vertically
Instead of shrinking horizontally → build taller
Feature sizes RELAX to ~30-40nm (vertical pitch)
Density increase from stacking more layers
`
3D NAND Architecture:
```
Cross-section of 3D NAND:
─── Metal bitline ───
│
┌────┴────┐
Layer 200+ │ WL 200 │ ← Wordline (gate)
│ WL 199 │
│ ... │
│ WL 2 │
│ WL 1 │ ← Each WL = one cell
└────┬────┘
│
─── Source line ───
Vertical channel (poly-Si, ~50-80nm diameter)
Surrounding each channel:
Tunnel oxide (SiO₂) │ Charge trap (Si₃N₄) │ Blocking oxide │ WL metal
Key Technologies:
- High-aspect-ratio etch: Etching memory holes through 200+ layers of alternating oxide/nitride stack. AR exceeds 60:1 at the most advanced nodes. The deepest etches in semiconductor manufacturing (~10-15μm deep, 50-80nm diameter holes).
- Channel formation: Deposit thin poly-Si film lining the memory hole → forms the transistor channel. Must be continuous and uniform through the full stack height.
- Charge trap flash: Si₃N₄ trapping layer (instead of floating gate) stores charge. Electrons tunnel from channel through thin SiO₂ into Si₃N₄. Advantage: more tolerant of cell-to-cell variation than floating gate.
- Gate replacement (TCAT process): Samsung's approach — deposit O/N stack, etch holes + channels, then replace the nitride with tungsten metal gates through access slits.
- Metal gate: Tungsten (W) deposited by CVD fills the thin gate regions between oxide layers. ALD TiN is used as barrier/adhesion.
Scaling Generations:
| Generation | Layers | Vendors | Key Innovation |
|-----------|--------|---------|----------------|
| 1st gen | 24-32L | ~2013-2015 | Basic 3D concept proven |
| 4th gen | 96-128L | ~2019-2020 | String stacking (two stacks bonded) |
| 6th gen | 176L | ~2021 | CMOS-under-array (CuA) |
| 8th gen | 232-236L | ~2023 | Double-stack bonding |
| 9th gen | 280-321L | 2024-2025 | Triple-stack, >300 layers |
| Future | 400-500L | 2026+ | Quad-stack, molybdenum gates |
String Stacking: Instead of etching through the full stack at once (impossible at >200 layers), process the stack in 2-3 segments, each ~100 layers, and bond them using inter-stack connection vias. This relaxes the AR requirement for each individual etch.
CMOS-under-Array (CuA): Place peripheral logic (page buffer, decoder, control) underneath the memory array instead of beside it, dramatically reducing die area. Enabled by wafer-to-wafer bonding: fabricate CMOS logic wafer → fabricate memory array wafer → bond face-to-face.
QLC/PLC Multi-Level:
| Level | Bits/Cell | Levels | Endurance |
|-------|----------|--------|----------|
| SLC | 1 | 2 | 100K cycles |
| MLC | 2 | 4 | 10K cycles |
| TLC | 3 | 8 | 1-3K cycles |
| QLC | 4 | 16 | 500-1K cycles |
| PLC | 5 | 32 | ~100 cycles |
3D NAND is the most successful semiconductor scaling strategy of the past decade — by decoupling density scaling from lithographic resolution and instead scaling vertically, the NAND industry has continued to deliver exponential capacity growth, enabling the data-intensive AI era with ever-cheaper, denser solid-state storage.