A100 40GB vs L4

Open Advisor →

Side-by-side GPU comparison: specs, memory, compute performance, and live cloud pricing.

Verdict

A100 40GB has 16 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, A100 40GB delivers 2.6× higher FP16 throughput. At $0.39/hr vs $0.93/hr, L4 is the more cost-efficient choice for inference. A100 40GB supports a broader range of models (612 vs 566 from this catalog), giving more flexibility.

Specifications

A100 40GBL4
VRAM40 GB24 GB
VRAM TypeHBM2GDDR6
Memory Bandwidth1.6 TB/s0.3 TB/s
FP16 Performance312 TFLOPS121 TFLOPS
ManufacturerNVIDIANVIDIA
FP8 SupportNoYes
FP4 SupportNoNo

Price / Performance

Based on cheapest single-GPU on-demand pricing. Lower $/TFLOP = better compute value; lower $/GB = better memory value.

A100 40GBL4
$/hr (cheapest)$0.93$0.39✓ best
$/TFLOP (compute value)$0.0030✓ best$0.0032
$/GB VRAM (memory value)$0.0232$0.0163✓ best

Cloud Pricing

Cheapest on-demand price per provider (single GPU).

A100 40GB

ProviderOn-demandSpotRent
Vast.ai$0.93/hrRent
Google Cloud$1.61/hr$1.17/hr
Lambda$1.99/hr

L4

ProviderOn-demandSpotRent
RunPod$0.39/hr$0.22/hrRent
Google Cloud$0.56/hr$0.16/hr
Amazon Web Services$0.80/hr$0.13/hr

Model Compatibility

Models from the catalog that fit on each GPU, grouped by required precision.

A100 40GB (612 models)

L4 (566 models)

You might also compare…

Pricing data refreshed hourly · Last updated April 11, 2026 · Browse all comparisons