Question 1

Which is better for AI inference: L40 or L40S?

Accepted Answer

For compute-bound workloads like training, L40S delivers 2.0× higher FP16 throughput. At $0.60/hr vs $0.82/hr, L40S is the more cost-efficient choice for inference.

Question 2

How much VRAM does the L40 have compared to the L40S?

Accepted Answer

The L40 has 48 GB of VRAM (GDDR6), while the L40S has 48 GB (GDDR6).

Question 3

Which GPU is cheaper to rent in the cloud, the L40 or L40S?

Accepted Answer

The cheapest on-demand price for the L40 is $0.82/hr, while the L40S starts at $0.60/hr. L40S is the more affordable option.

	L40	L40S
VRAM	48 GB	48 GB
VRAM Type	GDDR6	GDDR6
Memory Bandwidth	0.9 TB/s	0.9 TB/s
FP16 Performance	181 TFLOPS	366 TFLOPS
Manufacturer	NVIDIA	NVIDIA
FP8 Support	Yes	Yes
FP4 Support	No	No

	L40	L40S
$/hr (cheapest)	$0.82	$0.60✓ best
$/TFLOP (compute value)	$0.0045	$0.0016✓ best
$/GB VRAM (memory value)	$0.0171	$0.0125✓ best

Provider	On-demand	Spot	Rent
Vast.ai	$0.60/hr	—	Rent
RunPod	$0.99/hr	$0.86/hr	Rent
Nebius	$1.55/hr	$0.75/hr
Amazon Web Services	$1.86/hr	$0.53/hr

L40 vs L40S

Verdict

Specifications

Price / Performance

Cloud Pricing

L40

L40S

Model Compatibility

L40 (1000 models)

L40S (1000 models)

You might also compare…