Skip to content

MIG Profile Comparison

MIG profiles represent different partitions of a physical NVIDIA A100 80GB GPU. Each profile gives users a slice of compute and memory resources while maintaining full isolation from other workloads running on the same GPU.

On Wulver, the following MIG profiles are supported:

  • 10gb – 10 GB memory
  • 20gb – 20 GB memory
  • 40gb – 40 GB memory

The table below summarizes the hardware characteristics of each MIG profile available on Wulver, alongside the full NVIDIA A100 80 GB GPU. It lists memory capacity, compute resources, and other architectural limits so users can quickly compare performance and capability across profiles.

Specification 10gb 20gb 40gb Full 80GB Notes
Device Name A100 MIG 10gb A100 MIG 20gb A100 MIG 40gb A100-SXM4-80GB GPU Profile
SU Usage factor 2 4 8 16 Service units
Global Memory 10.2 GB 20.9 GB 42.4 GB 85.2 GB Raw hardware memory
Usable Memory ~9.5 GB ~20 GB ~40 GB ~80 GB Available for applications
Multiprocessors (SMs) 14 28 42 108 Parallel compute units
Relative Compute Power 1x 2x 3x 7.7x Performance scaling
Total Parallel Threads 28672 57344 86016 221184 SMs × threads/SMP
Memory Bus Width 640 bits 1280 bits 2560 bits 5120 bits Memory bandwidth
Memory Bandwidth limits ~1.3 TB/s ~2.6 TB/s ~5.1 TB/s ~10.2 TB/s Theoretical peak Bandwidth (B/s)=Memory Clock (Hz)×Bus Width (bits)÷8
L2 Cache Size 5 MB 10 MB 20 MB 41 MB Fast memory cache
Async Engines 1 2 3 5 Concurrent operations
Max Threads per Block 1024 1024 1024 1024 CUDA block limit
Max Threads per SMP 2048 2048 2048 2048 Per multiprocessor
Memory Clock 1593 MHz 1593 MHz 1593 MHz 1593 MHz Memory frequency
Clock Rate 1410 MHz 1410 MHz 1410 MHz 1410 MHz GPU core frequency
Shared Memory/Block 49 KB 49 KB 49 KB 49 KB Per CUDA block
Registers per Block 65536 65536 65536 65536 Per CUDA block
Constant Memory 64 KB 64 KB 64 KB 64 KB Read-only memory
Warp Size 32 32 32 32 SIMD execution width
ECC Support Yes Yes Yes Yes Error correction
Unified Memory Yes Yes Yes Yes CPU-GPU memory sharing
Concurrent Kernels Yes Yes Yes Yes Multiple kernel execution
Max Grid Dimensions 2³¹-1 × 65535 × 65535 2³¹-1 × 65535 × 65535 2³¹-1 × 65535 × 65535 2³¹-1 × 65535 × 65535 CUDA grid limits