Phi-3.5-MoE-instruct
Open Advisor →41.9BPhiMoEForCausalLMMoE 2/16phimoe
140,082 downloads
VRAM Requirements
| Quantization | VRAM Required |
|---|---|
| FP16 | 93.6 GB |
| Q8 | 46.8 GB |
| Q6 | 35.1 GB |
| Q4 | 23.4 GB |
Compatible GPUs
| GPU | VRAM | Best Quant | From | Rent |
|---|---|---|---|---|
| H100 NVL | 94 GB | Q8 | $1.80/hr | Rent |
| MI300X | 192 GB | FP16 | $1.99/hr | Rent |
| RTX Pro 6000 Blackwell | 96 GB | Q8 | $2.09/hr | Rent |
| GH200 | 96 GB | Q8 | $2.29/hr | |
| H200 SXM | 141 GB | FP16 | $2.35/hr | Rent |
| B200 | 192 GB | FP16 | $5.50/hr | |
| B100 | 192 GB | FP16 | — | |
| MI325X | 256 GB | FP16 | — | |
| MI350X | 288 GB | FP16 | — | |
| Gaudi 3 | 128 GB | FP16 | — |
Showing top 10. Open Advisor for full results.