Llama-3.1-Nemotron-Nano-4B-v1.1

Open Advisor →
4.5BLlamaForCausalLMllama

25,054 downloads

VRAM Requirements

VRAM requirements for Llama-3.1-Nemotron-Nano-4B-v1.1 at different quantization levels
QuantizationVRAM Required
FP1610.1 GB
Q85 GB
Q63.8 GB
Q42.5 GB

Compatible GPUs

Top 10 compatible GPUs for Llama-3.1-Nemotron-Nano-4B-v1.1, sorted by cheapest price
GPUVRAMBest QuantFromRent
L424 GBFP16$0.20/hr
RTX A400016 GBFP16$0.25/hrRent
T416 GBFP16$0.35/hr
A4048 GBFP16$0.44/hrRent
RTX 309024 GBFP16$0.46/hrRent
RTX A450020 GBFP16$0.50/hrRent
L40S48 GBFP16$0.86/hrRent
RTX 509032 GBFP16$0.99/hrRent
A10G24 GBFP16$1.01/hr
A1024 GBFP16$1.29/hr

Showing top 10. Open Advisor for full results.

Llama-3.1-Nemotron-Nano-4B-v1.1 FAQ