Tesla T4 vs Tesla M40
Aggregate performance score
We've compared Tesla M40 and Tesla T4, covering specs and all relevant benchmarks.
Tesla T4 outperforms Tesla M40 by a minimal 3% based on our aggregate benchmark results.
Primary details
GPU architecture, market segment, value for money and other general parameters compared.
Place in the ranking | 204 | 200 |
Place by popularity | not in top-100 | not in top-100 |
Power efficiency | 7.44 | 27.29 |
Architecture | Maxwell 2.0 (2014−2019) | Turing (2018−2022) |
GPU code name | GM200 | TU104 |
Market segment | Workstation | Workstation |
Release date | 10 November 2015 (9 years ago) | 13 September 2018 (6 years ago) |
Detailed specifications
General parameters such as number of shaders, GPU core base clock and boost clock speeds, manufacturing process, texturing and calculation speed. Note that power consumption of some graphics cards can well exceed their nominal TDP, especially when overclocked.
Pipelines / CUDA cores | 3072 | 2560 |
Core clock speed | 948 MHz | 585 MHz |
Boost clock speed | 1112 MHz | 1590 MHz |
Number of transistors | 8,000 million | 13,600 million |
Manufacturing process technology | 28 nm | 12 nm |
Power consumption (TDP) | 250 Watt | 70 Watt |
Texture fill rate | 213.5 | 254.4 |
Floating-point processing power | 6.832 TFLOPS | 8.141 TFLOPS |
ROPs | 96 | 64 |
TMUs | 192 | 160 |
Tensor Cores | no data | 320 |
Ray Tracing Cores | no data | 40 |
Form factor & compatibility
Information on compatibility with other computer components. Useful when choosing a future computer configuration or upgrading an existing one. For desktop graphics cards it's interface and bus (motherboard compatibility), additional power connectors (power supply compatibility).
Interface | PCIe 3.0 x16 | PCIe 3.0 x16 |
Length | 267 mm | 168 mm |
Width | 2-slot | 1-slot |
Supplementary power connectors | 8-pin EPS | None |
VRAM capacity and type
Parameters of VRAM installed: its type, size, bus, clock and resulting bandwidth. Integrated GPUs have no dedicated video RAM and use a shared part of system RAM.
Memory type | GDDR5 | GDDR6 |
Maximum RAM amount | 12 GB | 16 GB |
Memory bus width | 384 Bit | 256 Bit |
Memory clock speed | 1502 MHz | 1250 MHz |
Memory bandwidth | 288.4 GB/s | 320.0 GB/s |
Connectivity and outputs
Types and number of video connectors present on the reviewed GPUs. As a rule, data in this section is precise only for desktop reference ones (so-called Founders Edition for NVIDIA chips). OEM manufacturers may change the number and type of output ports, while for notebook cards availability of certain video outputs ports depends on the laptop model rather than on the card itself.
Display Connectors | No outputs | No outputs |
API compatibility
List of supported 3D and general-purpose computing APIs, including their specific versions.
DirectX | 12 (12_1) | 12 Ultimate (12_1) |
Shader Model | 6.7 | 6.5 |
OpenGL | 4.6 | 4.6 |
OpenCL | 3.0 | 1.2 |
Vulkan | 1.3 | 1.2.131 |
CUDA | 5.2 | 7.5 |
Synthetic benchmark performance
Non-gaming benchmark results comparison. The combined score is measured on a 0-100 point scale.
Combined synthetic benchmark score
This is our combined benchmark score. We are regularly improving our combining algorithms, but if you find some perceived inconsistencies, feel free to speak up in comments section, we usually fix problems quickly.
Passmark
This is the most ubiquitous GPU benchmark. It gives the graphics card a thorough evaluation under various types of load, providing four separate benchmarks for Direct3D versions 9, 10, 11 and 12 (the last being done in 4K resolution if possible), and few more tests engaging DirectCompute capabilities.
GeekBench 5 CUDA
Geekbench 5 is a widespread graphics card benchmark combined from 11 different test scenarios. All these scenarios rely on direct usage of GPU's processing power, no 3D rendering is involved. This variation uses CUDA API by NVIDIA.
Gaming performance
Let's see how good the compared graphics cards are for gaming. Particular gaming benchmark results are measured in FPS.
Pros & cons summary
Performance score | 27.16 | 27.88 |
Recency | 10 November 2015 | 13 September 2018 |
Maximum RAM amount | 12 GB | 16 GB |
Chip lithography | 28 nm | 12 nm |
Power consumption (TDP) | 250 Watt | 70 Watt |
Tesla T4 has a 2.7% higher aggregate performance score, an age advantage of 2 years, a 33.3% higher maximum VRAM amount, a 133.3% more advanced lithography process, and 257.1% lower power consumption.
Given the minimal performance differences, no clear winner can be declared between Tesla M40 and Tesla T4.
Should you still have questions concerning choice between the reviewed GPUs, ask them in Comments section, and we shall answer.
Comparisons with similar GPUs
We selected several comparisons of graphics cards with performance close to those reviewed, providing you with more options to consider.