Question 1

Which is better for AI inference: K80 or P4?

Accepted Answer

K80 has 4 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, K80 delivers 1.6× higher FP16 throughput. At $0.18/hr vs $0.60/hr, K80 is the more cost-efficient choice for inference.

Question 2

How much VRAM does the K80 have compared to the P4?

Accepted Answer

The K80 has 12 GB of VRAM (GDDR5), while the P4 has 8 GB (GDDR5).

Question 3

Which GPU is cheaper to rent in the cloud, the K80 or P4?

Accepted Answer

The cheapest on-demand price for the K80 is $0.18/hr, while the P4 starts at $0.60/hr. K80 is the more affordable option.

	K80	P4
VRAM	12 GB	8 GB
VRAM Type	GDDR5	GDDR5
Memory Bandwidth	0.2 TB/s	0.2 TB/s
FP16 Performance	9 TFLOPS	6 TFLOPS
Manufacturer	NVIDIA	NVIDIA
FP8 Support	No	No
FP4 Support	No	No

	K80	P4
$/hr (cheapest)	$0.18✓ best	$0.60
$/TFLOP (compute value)	$0.0206✓ best	$0.1091
$/GB VRAM (memory value)	$0.0150✓ best	$0.0750

K80 vs P4

Verdict

Specifications

Price / Performance

Cloud Pricing

K80

P4

Model Compatibility

K80 (1000 models)

P4 (1000 models)

You might also compare…