ipex-llm

History

Yuwen Hu 828fa01ad3 [NPU] Add `mixed_precision` for Qwen2 7B (#12098 ) * Add mix_precision argument to control whether use INT8 lm_head for Qwen2-7B-Instruct * Small fix * Fixed on load low bit with mixed precision * Small fix * Update example accordingly * Update for default prompt * Update base on comments * Final fix		2024-09-20 16:36:21 +08:00
..
dev	add `transpose_value_cache` for NPU benchmark (#12092 )	2024-09-19 18:45:05 +08:00
example	[NPU] Add `mixed_precision` for Qwen2 7B (#12098 )	2024-09-20 16:36:21 +08:00
portable-zip	Fix null pointer dereferences error. (#11125 )	2024-05-30 16:16:10 +08:00
scripts	fix typo in python/llm/scripts/README.md (#11536 )	2024-07-09 09:53:14 +08:00
src/ipex_llm	[NPU] Add `mixed_precision` for Qwen2 7B (#12098 )	2024-09-20 16:36:21 +08:00
test	fix UT (#12005 )	2024-09-04 18:02:49 +08:00
tpp	OSPDT: add tpp licenses (#11165 )	2024-06-06 10:59:06 +08:00
.gitignore	[LLM] add chatglm pybinding binary file release (#8677 )	2023-08-04 11:45:27 +08:00
setup.py	upgrade OneAPI version for cpp Windows (#12063 )	2024-09-12 11:12:12 +08:00
version.txt	Update pypi tag to 2.2.0.dev0 (#11895 )	2024-08-22 16:48:09 +08:00