NVIDIA Tesla M4 vs NVIDIA Tesla M40
Graphics Processor
| GPU model | GM206 | GM200 |
|---|---|---|
| Architecture | Maxwell 2.0 | Maxwell 2.0 |
| Foundry | TSMC | TSMC |
| Manufacturing process | 28 nm | 28 nm |
| Number of transistors | 2,940 million | 8,000 million |
| Die size | 228 mm² | 601 mm² |
| GPU variant | — | GM200-895-A1 |
Graphics Card
| Release date | Nov 10th, 2015 | Nov 10th, 2015 |
|---|---|---|
| Generation | Tesla | Tesla |
| Production | End-of-life | End-of-life |
| Interface | PCIe 3.0 x16 | PCIe 3.0 x16 |
Clocks
| Base clock | 872 MHz | 948 MHz |
|---|---|---|
| Boost clock | 1072 MHz | 1112 MHz |
| Memory clock | 1375 MHz 5.5 Gbps effective | 1502 MHz 6 Gbps effective |
Memory Configuration
| Memory size | 4 GB | 12 GB |
|---|---|---|
| Memory type | GDDR5 | GDDR5 |
| Memory bus width | 128 bit | 384 bit |
| Bandwidth | 88.00 GB/s | 288.4 GB/s |
Render Configuration
| Shading units | 1024 | 3072 |
|---|---|---|
| TMUs | 64 | 192 |
| ROPs | 32 | 96 |
| SMM count | 8 | 24 |
| Cache L1 | 48 KB (per SMM) | 48 KB (per SMM) |
| Cache L2 | 1024 KB | 3 MB |
Performance
| Pixel rate | 34.30 GPixel/s | 106.8 GPixel/s |
|---|---|---|
| Texture rate | 68.61 GTexel/s | 213.5 GTexel/s |
| FP32 (float) performance | 2.195 TFLOPS | 6.832 TFLOPS |
| FP64 (double) performance | 68.61 GFLOPS (1:32) | 213.5 GFLOPS (1:32) |
Dimensions & Outputs
| Slot width | Single-slot | Dual-slot |
|---|---|---|
| TDP | 50 W | 250 W |
| Suggested PSU | 250 W | 600 W |
| Outputs | No outputs | No outputs |
| Length | — | 267 mm 10.5 inches |
| Power connectors | — | 1x 6-pin + 1x 8-pin |
| Board number | — | PG600 SKU 202 |
API Support & Features
| DirectX | 12 (12_1) | 12 (12_1) |
|---|---|---|
| OpenGL | 4.6 | 4.6 |
| OpenCL | 3.0 | 3.0 |
| Vulkan | 1.1 | 1.1 |
| CUDA | 5.2 | 5.2 |
| Shader model | 6.4 | 6.4 |