Home Knowledge Base Neural Network Pruning Techniques (Unstructured, Structured, Lottery Ticket)

Neural Network Pruning Techniques (Unstructured, Structured, Lottery Ticket) is the systematic removal of redundant or low-importance parameters from trained neural networks to reduce model size, computational cost, and memory footprint — enabling deployment of large models on resource-constrained devices while maintaining accuracy within acceptable tolerances.

Pruning Motivation and Theory

Modern neural networks are vastly overparameterized: GPT-3 has 175B parameters, but empirical evidence suggests that 60-90% of weights can be removed with minimal accuracy loss. The lottery ticket hypothesis (Frankle and Carlin, 2019) provides theoretical grounding—dense networks contain sparse subnetworks (winning tickets) that, when trained in isolation from their original initialization, match the full network's accuracy. Pruning identifies and preserves these critical subnetworks.

Unstructured Pruning

Structured Pruning

Lottery Ticket Hypothesis

Advanced Pruning Methods

Pruning-Aware Training and Deployment

Neural network pruning has matured from an academic curiosity to a practical deployment necessity, with methods like SparseGPT and Wanda enabling compression of the largest language models to fit within constrained inference budgets while preserving the knowledge acquired during expensive pretraining.

neural network pruning techniquesunstructured pruning lottery ticketstructured pruning channelsweight pruning sparse neural networklottery ticket hypothesis

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.