Side-by-side GPU comparison: specs, memory, compute performance, and live cloud pricing.
Verdict
B200 has 51 GB more VRAM, making it better suited for large models and long context windows. For compute-bound workloads like training, B200 delivers 2.3× higher FP16 throughput. At $2.35/hr vs $5.49/hr, H200 SXM is the more cost-efficient choice for inference. B200 supports a broader range of models (659 vs 657 from this catalog), giving more flexibility.
Specifications
| B200 | H200 SXM | |
|---|---|---|
| VRAM | 192 GB | 141 GB |
| VRAM Type | HBM3e | HBM3e |
| Memory Bandwidth | 8.0 TB/s | 4.8 TB/s |
| FP16 Performance | 2250 TFLOPS | 990 TFLOPS |
| Manufacturer | NVIDIA | NVIDIA |
| FP8 Support | Yes | Yes |
| FP4 Support | Yes | No |
Price / Performance
Based on cheapest single-GPU on-demand pricing. Lower $/TFLOP = better compute value; lower $/GB = better memory value.
| B200 | H200 SXM | |
|---|---|---|
| $/hr (cheapest) | $5.49 | $2.35✓ best |
| $/TFLOP (compute value) | $0.0024 | $0.0024✓ best |
| $/GB VRAM (memory value) | $0.0286 | $0.0167✓ best |
Cloud Pricing
Cheapest on-demand price per provider (single GPU).
Model Compatibility
Models from the catalog that fit on each GPU, grouped by required precision.
B200 (659 models)
H200 SXM (657 models)
You might also compare…
Pricing data refreshed hourly · Last updated April 11, 2026 · Browse all comparisons