ipex-llm/python
Xin Qiu cd7a980ec4 Transformer int4 add qtype, support q4_1 q5_0 q5_1 q8_0 (#8481)
* quant in Q4 5 8

* meet code review

* update readme

* style

* update

* fix error

* fix error

* update

* fix style

* update

* Update README.md

* Add load_in_low_bit
2023-07-12 08:23:08 +08:00
..
llm Transformer int4 add qtype, support q4_1 q5_0 q5_1 q8_0 (#8481) 2023-07-12 08:23:08 +08:00