41.8BFalconForCausalLMfalcon

21,196 downloads

VRAM Requirements

VRAM requirements for falcon-40b at different quantization levels
QuantizationVRAM Required
FP1693.5 GB
Q846.8 GB
Q635.1 GB
Q423.4 GB

Compatible GPUs

Top 10 compatible GPUs for falcon-40b, sorted by cheapest price
GPUVRAMBest QuantFromRent
H100 NVL94 GBQ8$1.80/hrRent
RTX Pro 6000 Blackwell96 GBQ8$1.89/hrRent
MI300X192 GBFP16$1.99/hrRent
GH20096 GBQ8$2.29/hr
H200 SXM141 GBFP16$2.35/hrRent
B200192 GBFP16$5.49/hrRent
B100192 GBFP16
MI325X256 GBFP16
MI350X288 GBFP16
Gaudi 3128 GBFP16

Showing top 10. Open Advisor for full results.

falcon-40b FAQ