ipex-llm/python/llm/src/ipex_llm/ggml/model
Ruonan Wang 4b6c3160be
Support imatrix-guided quantization for NPU CW (#12468)
* init commit

* remove print

* add interface

* fix

* fix

* fix style
2024-12-02 11:31:26 +08:00
..
bloom Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
generation Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gptneox Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
llama Support imatrix-guided quantization for NPU CW (#12468) 2024-12-02 11:31:26 +08:00
starcoder Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
__init__.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00