AMD Radeon R9 380 OEM vs NVIDIA Tesla P100 SXM2
Graphics Processor
| GPU model | Antigua | GP100 |
|---|---|---|
| GPU variant | Antigua PRO (215-0877000) | GP100-890-A1 |
| Architecture | GCN 3.0 | Pascal |
| Foundry | TSMC | TSMC |
| Manufacturing process | 28 nm | 16 nm |
| Number of transistors | 5,000 million | 15,300 million |
| Die size | 366 mm² | 610 mm² |
Graphics Card
| Release date | May 5th, 2015 | Apr 5th, 2016 |
|---|---|---|
| Generation | Pirate Islands | Tesla |
| Production | End-of-life | End-of-life |
| Interface | PCIe 3.0 x16 | PCIe 3.0 x16 |
| Predecessor | Volcanic Islands | — |
| Successor | Arctic Islands | — |
Clocks
| GPU clock | 918 MHz | — |
|---|---|---|
| Memory clock | 1375 MHz 5.5 Gbps effective | 715 MHz 1430 Mbps effective |
| Base clock | — | 1328 MHz |
| Boost clock | — | 1480 MHz |
Memory Configuration
| Memory size | 4 GB | 16 GB |
|---|---|---|
| Memory type | GDDR5 | HBM2 |
| Memory bus width | 256 bit | 4096 bit |
| Bandwidth | 176.0 GB/s | 732.2 GB/s |
Render Configuration
| Shading units | 1792 | 3584 |
|---|---|---|
| TMUs | 112 | 224 |
| ROPs | 32 | 96 |
| Compute units | 28 | — |
| Cache L1 | 16 KB (per CU) | 24 KB (per SM) |
| Cache L2 | 512 KB | 4 MB |
| SM count | — | 56 |
Performance
| Pixel rate | 29.38 GPixel/s | 142.1 GPixel/s |
|---|---|---|
| Texture rate | 102.8 GTexel/s | 331.5 GTexel/s |
| FP16 (half) performance | 3.290 TFLOPS (1:1) | 21.22 TFLOPS (2:1) |
| FP32 (float) performance | 3.290 TFLOPS | 10.61 TFLOPS |
| FP64 (double) performance | 205.6 GFLOPS (1:16) | 5.304 TFLOPS (1:2) |
Dimensions & Outputs
| Slot width | Dual-slot | — |
|---|---|---|
| Length | 221 mm 8.7 inches | — |
| Width | 111 mm 4.4 inches | — |
| TDP | 190 W | 300 W |
| Suggested PSU | 450 W | 700 W |
| Outputs | 2x DVI1x HDMI1x DisplayPort | No outputs |
| Power connectors | 2x 6-pin | None |
| Board number | C766 | — |
API Support & Features
| DirectX | 12 (12_0) | 12 (12_1) |
|---|---|---|
| OpenGL | 4.6 | 4.6 |
| OpenCL | 2.0 | 3.0 |
| Vulkan | 1.2 | 1.2 |
| Shader model | 6.3 | 6.4 |
| CUDA | — | 6.0 |