Home Knowledge Base CUDA and Compute Capability

CUDA and Compute Capability

What is CUDA? CUDA (Compute Unified Device Architecture) is NVIDIA's parallel computing platform and API that enables GPUs to be used for general-purpose computing. It is the foundation for all modern GPU-accelerated AI/ML workloads.

Compute Capability Explained Compute Capability is a version number indicating which hardware features a GPU supports. Higher versions unlock newer optimizations and instruction sets.

Compute Capability by Architecture

CCArchitectureYearExample GPUsKey AI Features
7.0Volta2017V1001st gen Tensor Cores
7.5Turing2018RTX 2080, T4INT8 inference
8.0Ampere2020A1003rd gen Tensor Cores, TF32
8.6Ampere2021RTX 3090Consumer Ampere
8.9Ada Lovelace2022RTX 4090, L40SFP8, Transformer Engine
9.0Hopper2023H100, H2004th gen Tensor Cores

Why CC Matters for AI

Checking Your Compute Capability

import torch
device = torch.cuda.current_device()
cc = torch.cuda.get_device_capability(device)
print(f"Compute Capability: {cc[0]}.{cc[1]}")
cudacompute capabilitynvidia

Related Topics

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.