Question 1

Which is better for AI inference: H100 SXM or L40S?

Accepted Answer

H100 SXM has 32 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, H100 SXM delivers 2.7× higher FP16 throughput. At $0.60/hr vs $1.80/hr, L40S is the more cost-efficient choice for inference.

Question 2

How much VRAM does the H100 SXM have compared to the L40S?

Accepted Answer

The H100 SXM has 80 GB of VRAM (HBM3), while the L40S has 48 GB (GDDR6).

Question 3

Which GPU is cheaper to rent in the cloud, the H100 SXM or L40S?

Accepted Answer

The cheapest on-demand price for the H100 SXM is $1.80/hr, while the L40S starts at $0.60/hr. L40S is the more affordable option.

	H100 SXM	L40S
VRAM	80 GB	48 GB
VRAM Type	HBM3	GDDR6
Memory Bandwidth	3.4 TB/s	0.9 TB/s
FP16 Performance	990 TFLOPS	366 TFLOPS
Manufacturer	NVIDIA	NVIDIA
FP8 Support	Yes	Yes
FP4 Support	No	No

	H100 SXM	L40S
$/hr (cheapest)	$1.80	$0.60✓ best
$/TFLOP (compute value)	$0.0018	$0.0016✓ best
$/GB VRAM (memory value)	$0.0225	$0.0125✓ best

Provider	On-demand	Spot	Rent
Vast.ai	$1.80/hr	—	Rent
RunPod	$2.89/hr	$2.39/hr	Rent
Nebius	$2.95/hr	$1.25/hr
Lambda	$3.29/hr	—
Google Cloud	$4.20/hr	$0.98/hr
Amazon Web Services	$6.88/hr	$2.81/hr
Microsoft Azure	$6.98/hr	$6.98/hr

Provider	On-demand	Spot	Rent
Vast.ai	$0.60/hr	—	Rent
RunPod	$0.86/hr	$0.86/hr	Rent
Nebius	$1.55/hr	$0.75/hr
Amazon Web Services	$1.86/hr	$0.36/hr

H100 SXM vs L40S

Verdict

Specifications

Price / Performance

Cloud Pricing

H100 SXM

L40S

Model Compatibility

H100 SXM (1000 models)

L40S (1000 models)

You might also compare…