Llama-3.2-1B-Instruct
Open Advisor →1.2B
4,172,246 downloads
VRAM Requirements
| Quantization | VRAM Required |
|---|---|
| FP16 | 2.8 GB |
| Q8 | 1.4 GB |
| Q6 | 1 GB |
| Q4 | 0.7 GB |
Compatible GPUs
| GPU | VRAM | Best Quant | From | Rent |
|---|---|---|---|---|
| L4 | 24 GB | FP16 | $0.20/hr | |
| M60 | 8 GB | FP16 | $0.23/hr | |
| RTX A4000 | 16 GB | FP16 | $0.25/hr | Rent |
| T4 | 16 GB | FP16 | $0.35/hr | |
| A40 | 48 GB | FP16 | $0.44/hr | Rent |
| RTX 3090 | 24 GB | FP16 | $0.46/hr | Rent |
| RTX A4500 | 20 GB | FP16 | $0.50/hr | Rent |
| P4 | 8 GB | FP16 | $0.60/hr | |
| L40S | 48 GB | FP16 | $0.86/hr | Rent |
| RTX 5090 | 32 GB | FP16 | $0.99/hr | Rent |
Showing top 10. Open Advisor for full results.