ipex-llm/python
Ruonan Wang d61f4905ac LLM: 2bit quantization initial support (#10042)
* basis quantize support

* fix new module name

* small update

* and mixed int4 with iq2_xxs

* remove print

* code refactor

* fix style

* meet code review
2024-02-06 14:58:32 +08:00
..
llm LLM: 2bit quantization initial support (#10042) 2024-02-06 14:58:32 +08:00