ipex-llm

History

Yina Chen 841dbcdf3a Fix compresskv with lookahead issue (#11767 ) * fix compresskv + lookahead attn_mask qwen2 * support llama chatglm * support mistral & chatglm * address comments * revert run.py		2024-08-12 18:53:55 +08:00
..
dev	Update all-in-one benchmark for `continuation` task input preparation (#11760 )	2024-08-12 17:49:45 +08:00
example	optimize lookahead init time (#11769 )	2024-08-12 17:19:12 +08:00
portable-zip	Fix null pointer dereferences error. (#11125 )	2024-05-30 16:16:10 +08:00
scripts	fix typo in python/llm/scripts/README.md (#11536 )	2024-07-09 09:53:14 +08:00
src/ipex_llm	Fix compresskv with lookahead issue (#11767 )	2024-08-12 18:53:55 +08:00
test	Add `qwen2-1.5b-instruct` into igpu performance (#11735 )	2024-08-08 16:42:18 +08:00
tpp	OSPDT: add tpp licenses (#11165 )	2024-06-06 10:59:06 +08:00
.gitignore
setup.py	update doc/setup to use onednn gemm for cpp (#11598 )	2024-07-18 13:04:38 +08:00
version.txt