ipex-llm/python/llm/src/ipex_llm/ggml/model
Zhao Changmin cf8eb7b128
Init NPU quantize method and support q8_0_rtn (#11452)
* q8_0_rtn

* fix float point
2024-07-01 13:45:07 +08:00
..
bloom Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
generation Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gptneox Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
llama Init NPU quantize method and support q8_0_rtn (#11452) 2024-07-01 13:45:07 +08:00
starcoder Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
__init__.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00