Home Knowledge Base GPU Multi-Instance GPU (MIG)

GPU Multi-Instance GPU (MIG) is a hardware partitioning feature introduced with NVIDIA's A100 (Ampere) architecture that divides a single physical GPU into up to seven independent instances, each with dedicated compute resources, memory bandwidth, and memory capacity — MIG enables multiple users or workloads to share a GPU with hardware-level isolation, guaranteed quality of service, and no performance interference.

MIG Architecture:

A100 MIG Configurations:

MIG Setup and Management:

Use Cases and Deployment:

Performance Characteristics:

Comparison with Other GPU Sharing:

MIG has fundamentally changed GPU datacenter economics — by enabling safe multi-tenancy with hardware-enforced isolation, a single A100 can serve 7 independent inference workloads simultaneously, reducing per-workload GPU cost by up to 7× while maintaining predictable performance.

gpu multi instance gpu mignvidia mig partitioninggpu isolation mig slicesmig compute instance profilea100 mig configuration gpu

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.