NVIDIA today announced its Tesla P100 with PCI-Express interface, a slightly less powerful variant based on the NVLink  interface optimized for servers. The new GPU will be available in two variants of 16 and 12 GB of HBM2 memory, so in total we have already three Tesla P100 available.

NVIDIA launches PCI Express version of the Tesla P100 accelerator

NVIDIA Tesla P100

You Might Like: NVIDIA Announces Pascal GP100 – 3840 CUDA Cores and 16Gb HBM2 Memory

NVIDIA Tesla P100 accelerator is equipped with  flagship Pascal P100 GPU offers 3840 CUDA Cores with 240 TMUs  and comes with up to 16GB of HBM2 memory with 512 bits eight memory controllers, which translates into a memory interface 4096 bits would give a bandwidth of 720 GB / s  at best. This silicon manufactured at a lithography 16nm FinFET so we can expect a dramatic improvement in performance further reducing power consumption.
The NVIDIA Tesla P100 provides a Single Precision performance of 9.3 TFLOPs (vs 10.6TFLOPS with NVLink) and 4.7 TFLOPs (vs 5.3) double Precision performance. The model with 16GB HBM2 memory reaches a bandwidth of 720 GB / s while the 12GB reaches 540 GB / s. In both cases, both GPUs are cooled passively despite that we have a TDP of 250W. Not a word about the price of these two new models.

NVIDIA Tesla Series Specification

Tesla P100 Tesla P100
Tesla P100
Tesla M40
Stream Processors 3584 3584 3584 3072
Core Clock 1328MHz ? ? 948MHz
Boost Clock(s) 1480MHz 1300MHz 1300MHz 1114MHz
Memory Clock 1.4Gbps HBM2 1.4Gbps HBM2 1.4Gbps HBM2 6Gbps GDDR5
Memory Bus Width 4096-bit 4096-bit 3072-bit 384-bit
Memory Bandwidth 720GB/sec 720GB/sec 540GB/sec 288GB/sec
VRAM 16GB 16GB 12GB 12GB
Half Precision 21.2 TFLOPS 18.7 TFLOPS 18.7 TFLOPS 6.8 TFLOPS
Single Precision 10.6 TFLOPS 9.3 TFLOPS 9.3 TFLOPS 6.8 TFLOPS
Double Precision 5.3 TFLOPS
(1/2 rate)
(1/2 rate)
(1/2 rate)
(1/32 rate)
GPU GP100 GP100 GP100 GM200
Transistor Count 15.3B 15.3B 15.3B 8B
TDP 300W 250W 250W 250W
Form Factor Mezzanine PCIe PCIe PCIe
Cooling N/A Passive Passive Passive
Process Node TSMC 16nm FinFET TSMC 16nm FinFET TSMC 16nm FinFET TSMC 28nm
Architecture Pascal Pascal Pascal Maxwell 2
Previous articleFallout 4 Mods on PS4 will be limited to 900MB
Next articleKiloCore: First Programmable Processor With 1,000 Cores
Hardware enthusiast, Gamer, Writer. I enjoy picking up games, putting them back down, and then writing about it.