granite-20b-code-instruct-8k
Open Advisor →20.1BGPTBigCodeForCausalLMgpt_bigcode
VRAM Requirements
| Quantization | VRAM Required |
|---|---|
| FP16 | 44.9 GB |
| Q8 | 22.4 GB |
| Q6 | 16.8 GB |
| Q4 | 11.2 GB |
Compatible GPUs
| GPU | VRAM | Best Quant | From | Rent |
|---|---|---|---|---|
| A40 | 48 GB | Q8 | $0.44/hr | Rent |
| L40S | 48 GB | Q8 | $0.60/hr | Rent |
| L40 | 48 GB | Q8 | $0.82/hr | Rent |
| RTX A6000 | 48 GB | Q8 | $1.09/hr | |
| A100 80GB PCIe | 80 GB | FP16 | $1.39/hr | Rent |
| A100 80GB | 80 GB | FP16 | $1.39/hr | Rent |
| MI300X | 192 GB | FP16 | $1.99/hr | Rent |
| RTX Pro 6000 Blackwell | 96 GB | FP16 | $2.09/hr | Rent |
| GH200 | 96 GB | FP16 | $2.29/hr | |
| H100 MEGA | 80 GB | FP16 | $2.89/hr | Rent |
Showing top 10. Open Advisor for full results.