Tesla T4 vs Tesla P40
Aggregate performance score
We've compared Tesla P40 and Tesla T4, covering specs and all relevant benchmarks.
P40 outperforms T4 by a moderate 11% based on our aggregate benchmark results.
Primary details
GPU architecture, market segment, value for money and other general parameters compared.
| Place in the ranking | 229 | 254 | 
| Place by popularity | not in top-100 | not in top-100 | 
| Cost-effectiveness evaluation | 0.99 | no data | 
| Power efficiency | 8.63 | 27.68 | 
| Architecture | Pascal (2016−2021) | Turing (2018−2022) | 
| GPU code name | GP102 | TU104 | 
| Market segment | Workstation | Workstation | 
| Release date | 13 September 2016 (9 years ago) | 13 September 2018 (7 years ago) | 
| Launch price (MSRP) | $5,699 | no data | 
Cost-effectiveness evaluation
The higher the ratio, the better. We use the manufacturer's recommended prices.
Performance to price scatter graph
Detailed specifications
General parameters such as number of shaders, GPU core base clock and boost clock speeds, manufacturing process, texturing and calculation speed. Note that power consumption of some graphics cards can well exceed their nominal TDP, especially when overclocked.
| Pipelines / CUDA cores | 3840 | 2560 | 
| Core clock speed | 1303 MHz | 585 MHz | 
| Boost clock speed | 1531 MHz | 1590 MHz | 
| Number of transistors | 11,800 million | 13,600 million | 
| Manufacturing process technology | 16 nm | 12 nm | 
| Power consumption (TDP) | 250 Watt | 70 Watt | 
| Texture fill rate | 367.4 | 254.4 | 
| Floating-point processing power | 11.76 TFLOPS | 8.141 TFLOPS | 
| ROPs | 96 | 64 | 
| TMUs | 240 | 160 | 
| Tensor Cores | no data | 320 | 
| Ray Tracing Cores | no data | 40 | 
| L1 Cache | 1.4 MB | 2.5 MB | 
| L2 Cache | 3 MB | 4 MB | 
Form factor & compatibility
Information on compatibility with other computer components. Useful when choosing a future computer configuration or upgrading an existing one. For desktop graphics cards it's interface and bus (motherboard compatibility), additional power connectors (power supply compatibility).
| Interface | PCIe 3.0 x16 | PCIe 3.0 x16 | 
| Length | 267 mm | 168 mm | 
| Width | 2-slot | 1-slot | 
| Supplementary power connectors | 8-pin EPS | None | 
VRAM capacity and type
Parameters of VRAM installed: its type, size, bus, clock and resulting bandwidth. Integrated GPUs have no dedicated video RAM and use a shared part of system RAM.
| Memory type | GDDR5 | GDDR6 | 
| Maximum RAM amount | 24 GB | 16 GB | 
| Memory bus width | 384 Bit | 256 Bit | 
| Memory clock speed | 1808 MHz | 1250 MHz | 
| Memory bandwidth | 347.1 GB/s | 320.0 GB/s | 
Connectivity and outputs
This section shows the types and number of video connectors on each GPU. The data applies specifically to desktop reference models (for example, NVIDIA’s Founders Edition). OEM partners often modify both the number and types of ports. On notebook GPUs, video‐output options are determined by the laptop’s design rather than the graphics chip itself.
| Display Connectors | No outputs | No outputs | 
API and SDK support
List of supported 3D and general-purpose computing APIs, including their specific versions.
| DirectX | 12 (12_1) | 12 Ultimate (12_1) | 
| Shader Model | 6.7 | 6.5 | 
| OpenGL | 4.6 | 4.6 | 
| OpenCL | 3.0 | 1.2 | 
| Vulkan | 1.3 | 1.2.131 | 
| CUDA | 6.1 | 7.5 | 
| DLSS | - | + | 
Synthetic benchmarks
Non-gaming benchmark results comparison. The combined score is measured on a 0-100 point scale.
Combined synthetic benchmark score
This is our combined benchmark score.
Passmark
This is the most ubiquitous GPU benchmark. It gives the graphics card a thorough evaluation under various types of load, providing four separate benchmarks for Direct3D versions 9, 10, 11 and 12 (the last being done in 4K resolution if possible), and few more tests engaging DirectCompute capabilities.
Gaming performance
Let's see how good the compared graphics cards are for gaming. Particular gaming benchmark results are measured in FPS.
Pros & cons summary
| Performance score | 26.71 | 23.99 | 
| Recency | 13 September 2016 | 13 September 2018 | 
| Maximum RAM amount | 24 GB | 16 GB | 
| Chip lithography | 16 nm | 12 nm | 
| Power consumption (TDP) | 250 Watt | 70 Watt | 
Tesla P40 has a 11.3% higher aggregate performance score, and a 50% higher maximum VRAM amount.
Tesla T4, on the other hand, has an age advantage of 2 years, a 33.3% more advanced lithography process, and 257.1% lower power consumption.
The Tesla P40 is our recommended choice as it beats the Tesla T4 in performance tests.
Other comparisons
We selected several comparisons of graphics cards with performance close to those reviewed, providing you with more options to consider.



