GPU model | G96 | GA102 |
---|---|---|
Architecture | Tesla | Ampere |
Foundry | UMC | TSMC |
Manufacturing process | 65 nm | 7 nm |
Number of transistors | 314 million | 40,000 million |
Die size | 144 mm² | 627 mm² |
Release date | Jul 29th, 2008 | Unknown |
---|---|---|
Generation | GeForce 9 | Quadro RTX |
Production | End-of-life | Active |
Interface | PCIe 2.0 x16 | PCIe 4.0 x16 |
Reviews | 22 in our database | — |
Predecessor | GeForce 8 | — |
Successor | GeForce 200 | — |
GPU clock | 600 MHz | — |
---|---|---|
Shader clock | 1500 MHz | — |
Memory clock | 1000 MHz 2 Gbps effective | 1750 MHz 14000 MHz effective |
Base clock | — | 1110 MHz |
Boost clock | — | 1500 MHz |
Memory size | 512 MB | 48 GB |
---|---|---|
Memory type | GDDR3 | GDDR6 |
Memory bus width | 128 bit | 384 bit |
Bandwidth | 32.00 GB/s | 672.0 GB/s |
Shading units | 32 | 7552 |
---|---|---|
TMUs | 16 | 472 |
ROPs | 8 | 96 |
SM count | 4 | 118 |
Cache L2 | 32 KB | 6 MB |
Tensor cores | — | 472 |
RT cores | — | 118 |
Cache L1 | — | 64 KB (per SM) |
Pixel rate | 4.800 GPixel/s | 144.0 GPixel/s |
---|---|---|
Texture rate | 9.600 GTexel/s | 708.0 GTexel/s |
FP32 (float) performance | 96.00 GFLOPS | 22.66 TFLOPS |
FP16 (half) performance | — | 45.31 TFLOPS (2:1) |
FP64 (double) performance | — | 708.0 GFLOPS (1:32) |
Slot width | Single-slot | Dual-slot |
---|---|---|
TDP | 50 W | 260 W |
Suggested PSU | 250 W | 600 W |
Outputs | 2x DVI1x S-Video | 4x DisplayPort1x USB Type-C |
Power connectors | None | 1x 6-pin + 1x 8-pin |
Length | — | 267 mm 10.5 inches |
DirectX | 11.1 (10_0) | 12 Ultimate (12_2) |
---|---|---|
OpenGL | 3.3 | 4.6 |
OpenCL | 1.1 | 2.0 |
Vulkan | — | 1.2.140 |
CUDA | 1.1 | 8.5 |
Shader model | 4.0 | 6.5 |