Documentos de Académico
Documentos de Profesional
Documentos de Cultura
® ®
GPU ACCELERATOR
The NVIDIA Tesla P100 is the most advanced data center accelerator ever built, SPECIFICATIONS
leveraging the groundbreaking NVIDIA Pascal™ GPU architecture to deliver the GPU Architecture NVIDIA Pascal
world’s fastest compute node. It’s powered by four innovative technologies with NVIDIA CUDA® Cores 3584
huge jumps in performance for HPC and deep learning workloads. Double-Precision 5.3 TeraFLOPS
Performance
The Tesla P100 also features NVIDIA NVLink technology that enables superior
™
Single-Precision 10.6 TeraFLOPS
strong-scaling performance for HPC and hyperscale applications. Up to eight Tesla Performance
P100 GPUs interconnected in a single node can deliver the performance of racks of Half-Precision 21.2 TeraFLOPS
commodity CPU servers. Performance
GPU Memory 16 GB CoWoS HBM2
TESLA P100 AND NVLINK DELIVERS UP TO 50X PERFORMANCE BOOST FOR Memory Bandwidth 732 GB/s
DATA CENTER APPLICATIONS
Interconnect NVIDIA NVLink
Max Power Consumption 300 W
NVIDIA Tesla P100 Performance ECC Native support with no
capacity or performance
50 X
overhead
2x K80 (M40 for Alexnet) 2X P100 4X P100 8X P100
Application Speed-up over
40 X
Form Factor SXM2
30 X Compute APIs NVIDIA CUDA,
DirectCompute,
20 X OpenCL™, OpenACC
TeraFLOPS measurements with NVIDIA GPU Boost™ technology
10 X
0X
P100
Bi-directional BW (GB/Sec)
25 800
P100 (FP16)
Teraflops (FP32/FP16)
20
600
15 P100 (FP32)
M40 400 K40 M40
10 K40
200
5
0 0
160 10,000
Addressable Memory (GB)
120 1,000
80 100 M40
K40
K40 M40
40 10
0 0