A100 40GB vs L40S

Open Advisor →

Side-by-side GPU comparison: specs, memory, compute performance, and live cloud pricing.

Verdict

L40S has 8 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, L40S delivers 1.2× higher FP16 throughput. At $0.60/hr vs $0.93/hr, L40S is the more cost-efficient choice for inference. L40S supports a broader range of models (624 vs 612 from this catalog), giving more flexibility.

Specifications

A100 40GBL40S
VRAM40 GB48 GB
VRAM TypeHBM2GDDR6
Memory Bandwidth1.6 TB/s0.9 TB/s
FP16 Performance312 TFLOPS366 TFLOPS
ManufacturerNVIDIANVIDIA
FP8 SupportNoYes
FP4 SupportNoNo

Price / Performance

Based on cheapest single-GPU on-demand pricing. Lower $/TFLOP = better compute value; lower $/GB = better memory value.

A100 40GBL40S
$/hr (cheapest)$0.93$0.60✓ best
$/TFLOP (compute value)$0.0030$0.0016✓ best
$/GB VRAM (memory value)$0.0232$0.0125✓ best

Cloud Pricing

Cheapest on-demand price per provider (single GPU).

A100 40GB

ProviderOn-demandSpotRent
Vast.ai$0.93/hrRent
Google Cloud$1.61/hr$1.17/hr
Lambda$1.99/hr

L40S

ProviderOn-demandSpotRent
Vast.ai$0.60/hrRent
RunPod$0.86/hr$0.26/hrRent
Nebius$1.55/hr$0.75/hr
Amazon Web Services$1.86/hr$0.36/hr

Model Compatibility

Models from the catalog that fit on each GPU, grouped by required precision.

A100 40GB (612 models)

L40S (624 models)

You might also compare…

Pricing data refreshed hourly · Last updated April 11, 2026 · Browse all comparisons