ipex-llm

History

Yuwen Hu ef4028ac2d [NPU] Support split `lm_head` for Qwen2 with CPP (#12491 ) * Use split for Qwen2 lm_head instead of slice in optimize_pre * Support split lm_head in Qwen2 python cpp backend * Fit with Python acc lib pipeline * Removed default mixed_precision=True in all-in-one and related examples * Small fix * Style fix * Fix based on comments * Fix based on comments * Stype fix		2024-12-04 14:41:08 +08:00
..
benchmark	[NPU] Support split `lm_head` for Qwen2 with CPP (#12491 )	2024-12-04 14:41:08 +08:00
test	Add benchmark_util for `transformers >= 4.44.0` (#12171 )	2024-10-14 15:40:12 +08:00
print_glib_requirement.py	Fix null pointer dereferences error. (#11125 )	2024-05-30 16:16:10 +08:00
release.sh	remove (#11527 )	2024-07-08 15:49:52 +08:00
release_default_linux.sh	Update_document by heyang (#30 )	2024-03-25 10:06:02 +08:00
release_default_windows.sh	Update_document by heyang (#30 )	2024-03-25 10:06:02 +08:00