glm-4-9b
Open Advisor →9.4BChatGLMModelchatglm
8,232 downloads
VRAM Requirements
| Quantization | VRAM Required |
|---|---|
| FP16 | 21 GB |
| Q8 | 10.5 GB |
| Q6 | 7.9 GB |
| Q4 | 5.3 GB |
Compatible GPUs
| GPU | VRAM | Best Quant | From | Rent |
|---|---|---|---|---|
| L4 | 24 GB | Q8 | $0.20/hr | |
| A40 | 48 GB | FP16 | $0.44/hr | Rent |
| A100 40GB | 40 GB | FP16 | $0.93/hr | Rent |
| A10G | 24 GB | Q8 | $1.01/hr | |
| A10 | 24 GB | Q8 | $1.29/hr | |
| A100 80GB PCIe | 80 GB | FP16 | $1.39/hr | Rent |
| A100 80GB | 80 GB | FP16 | $1.39/hr | Rent |
| H100 MEGA | 80 GB | FP16 | $1.80/hr | Rent |
| H100 SXM | 80 GB | FP16 | $1.80/hr | Rent |
| H100 NVL | 94 GB | FP16 | $1.80/hr | Rent |
Showing top 10. Open Advisor for full results.