ipex-llm/python/llm/src/ipex_llm/transformers/npu_models
Zhao Changmin cf8eb7b128
Init NPU quantize method and support q8_0_rtn (#11452)
* q8_0_rtn

* fix float point
2024-07-01 13:45:07 +08:00
..
__init__.py optimize llama npu perf (#11426) 2024-06-25 17:43:20 +08:00
common.py optimize npu llama perf again (#11431) 2024-06-26 10:52:54 +08:00
convert.py Init NPU quantize method and support q8_0_rtn (#11452) 2024-07-01 13:45:07 +08:00
llama.py fix npu llama2 (#11471) 2024-07-01 10:14:11 +08:00