Side-by-side GPU comparison: specs, memory, compute performance, and live cloud pricing.
Verdict
L40S has 24 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, L40S delivers 10.3× higher FP16 throughput. At $0.60/hr vs $1.01/hr, L40S is the more cost-efficient choice for inference.
Specifications
| A10G | L40S | |
|---|---|---|
| VRAM | 24 GB | 48 GB |
| VRAM Type | GDDR6 | GDDR6 |
| Memory Bandwidth | 0.6 TB/s | 0.9 TB/s |
| FP16 Performance | 36 TFLOPS | 366 TFLOPS |
| Manufacturer | NVIDIA | NVIDIA |
| FP8 Support | No | Yes |
| FP4 Support | No | No |
Price / Performance
Based on cheapest single-GPU on-demand pricing. Lower $/TFLOP = better compute value; lower $/GB = better memory value.
| A10G | L40S | |
|---|---|---|
| $/hr (cheapest) | $1.01 | $0.60✓ best |
| $/TFLOP (compute value) | $0.0283 | $0.0016✓ best |
| $/GB VRAM (memory value) | $0.0419 | $0.0125✓ best |
Cloud Pricing
Cheapest on-demand price per provider (single GPU).
Model Compatibility
Models from the catalog that fit on each GPU, grouped by required precision.
A10G (1000 models)
L40S (1000 models)
You might also compare…
Pricing data refreshed hourly · Last updated June 1, 2026 · Browse all comparisons