H100 SXM vs L40S

Open Advisor →

Side-by-side GPU comparison: specs, memory, compute performance, and live cloud pricing.

Verdict

H100 SXM has 32 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, H100 SXM delivers 2.7× higher FP16 throughput. At $0.60/hr vs $1.80/hr, L40S is the more cost-efficient choice for inference. H100 SXM supports a broader range of models (636 vs 624 from this catalog), giving more flexibility.

Specifications

H100 SXML40S
VRAM80 GB48 GB
VRAM TypeHBM3GDDR6
Memory Bandwidth3.4 TB/s0.9 TB/s
FP16 Performance990 TFLOPS366 TFLOPS
ManufacturerNVIDIANVIDIA
FP8 SupportYesYes
FP4 SupportNoNo

Price / Performance

Based on cheapest single-GPU on-demand pricing. Lower $/TFLOP = better compute value; lower $/GB = better memory value.

H100 SXML40S
$/hr (cheapest)$1.80$0.60✓ best
$/TFLOP (compute value)$0.0018$0.0016✓ best
$/GB VRAM (memory value)$0.0225$0.0125✓ best

Cloud Pricing

Cheapest on-demand price per provider (single GPU).

H100 SXM

ProviderOn-demandSpotRent
Vast.ai$1.80/hrRent
RunPod$2.39/hr$1.25/hrRent
Nebius$2.95/hr$1.25/hr
Lambda$3.29/hr
Google Cloud$4.20/hr$1.14/hr
Amazon Web Services$6.88/hr$2.81/hr
Microsoft Azure$6.98/hr$6.98/hr

L40S

ProviderOn-demandSpotRent
Vast.ai$0.60/hrRent
RunPod$0.86/hr$0.26/hrRent
Nebius$1.55/hr$0.75/hr
Amazon Web Services$1.86/hr$0.36/hr

Model Compatibility

Models from the catalog that fit on each GPU, grouped by required precision.

H100 SXM (636 models)

L40S (624 models)

You might also compare…

Pricing data refreshed hourly · Last updated April 11, 2026 · Browse all comparisons