Disclaimer:
The model is reproduced based on the paperVPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Modelsgithub andarXiv
The model itself is sourced from a community release.
It is intended only for experimental purposes.
Users are responsible for any consequences arising from the use of this model.
Note:
The PPL test results are for reference only and were collected using GPTQ testing script.
{"ctx_2048":{"wikitext2":7.414072513580322},"ctx_4096":{"wikitext2":6.940601348876953},"ctx_8192":{"wikitext2":6.678436756134033}}
- Downloads last month
- 25