Question 1

Which is better for AI inference: A10 or L40?

Accepted Answer

L40 has 24 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, L40 delivers 2.9× higher FP16 throughput. At $0.82/hr vs $1.29/hr, L40 is the more cost-efficient choice for inference.

Question 2

How much VRAM does the A10 have compared to the L40?

Accepted Answer

The A10 has 24 GB of VRAM (GDDR6), while the L40 has 48 GB (GDDR6).

Question 3

Which GPU is cheaper to rent in the cloud, the A10 or L40?

Accepted Answer

The cheapest on-demand price for the A10 is $1.29/hr, while the L40 starts at $0.82/hr. L40 is the more affordable option.

	A10	L40
VRAM	24 GB	48 GB
VRAM Type	GDDR6	GDDR6
Memory Bandwidth	0.6 TB/s	0.9 TB/s
FP16 Performance	63 TFLOPS	181 TFLOPS
Manufacturer	NVIDIA	NVIDIA
FP8 Support	No	Yes
FP4 Support	No	No

	A10	L40
$/hr (cheapest)	$1.29	$0.82✓ best
$/TFLOP (compute value)	$0.0206	$0.0045✓ best
$/GB VRAM (memory value)	$0.0537	$0.0171✓ best

Provider	On-demand	Spot	Rent
Lambda	$1.29/hr	—
Microsoft Azure	$3.20/hr	$0.80/hr

A10 vs L40

Verdict

Specifications

Price / Performance

Cloud Pricing

A10

L40

Model Compatibility

A10 (1000 models)

L40 (1000 models)

You might also compare…