Side-by-side GPU comparison: specs, memory, compute performance, and live cloud pricing.
Verdict
H100 NVL has 46 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, H100 NVL delivers 5.6× higher FP16 throughput. At $0.44/hr vs $1.80/hr, A40 is the more cost-efficient choice for inference. H100 NVL supports a broader range of models (637 vs 624 from this catalog), giving more flexibility.
Specifications
| A40 | H100 NVL | |
|---|---|---|
| VRAM | 48 GB | 94 GB |
| VRAM Type | GDDR6 | HBM3 |
| Memory Bandwidth | 0.7 TB/s | 3.9 TB/s |
| FP16 Performance | 150 TFLOPS | 835 TFLOPS |
| Manufacturer | NVIDIA | NVIDIA |
| FP8 Support | No | Yes |
| FP4 Support | No | No |
Price / Performance
Based on cheapest single-GPU on-demand pricing. Lower $/TFLOP = better compute value; lower $/GB = better memory value.
| A40 | H100 NVL | |
|---|---|---|
| $/hr (cheapest) | $0.44✓ best | $1.80 |
| $/TFLOP (compute value) | $0.0029 | $0.0022✓ best |
| $/GB VRAM (memory value) | $0.0092✓ best | $0.0191 |
Cloud Pricing
Cheapest on-demand price per provider (single GPU).
Model Compatibility
Models from the catalog that fit on each GPU, grouped by required precision.
A40 (624 models)
H100 NVL (637 models)
You might also compare…
Pricing data refreshed hourly · Last updated April 11, 2026 · Browse all comparisons