Qwen3-4B
Open Advisor →4BQwen3ForCausalLMqwen3
8,255,392 downloads
VRAM Requirements
| Quantization | VRAM Required |
|---|---|
| FP16 | 9 GB |
| Q8 | 4.5 GB |
| Q6 | 3.4 GB |
| Q4 | 2.2 GB |
Compatible GPUs
| GPU | VRAM | Best Quant | From | Rent |
|---|---|---|---|---|
| L4 | 24 GB | FP16 | $0.20/hr | |
| RTX A4000 | 16 GB | FP16 | $0.25/hr | Rent |
| T4 | 16 GB | FP16 | $0.35/hr | |
| A40 | 48 GB | FP16 | $0.44/hr | Rent |
| RTX 3090 | 24 GB | FP16 | $0.46/hr | Rent |
| RTX A4500 | 20 GB | FP16 | $0.50/hr | Rent |
| L40S | 48 GB | FP16 | $0.86/hr | Rent |
| RTX 5090 | 32 GB | FP16 | $0.99/hr | Rent |
| A10G | 24 GB | FP16 | $1.01/hr | |
| A10 | 24 GB | FP16 | $1.29/hr |
Showing top 10. Open Advisor for full results.