switch-c-2048_qmoe
This is thegoogle/switch-c-2048 model quantized with the QMoE framework to ternary precision and stored in the custom further compressed QMoE format.
Please see theQMoE repository for how to use this model.
- Downloads last month
- 5
Inference ProvidersNEW
This model isn't deployed by any Inference Provider.🙋Ask for provider support
HF Inference deployability: The model has no pipeline_tag.