ipex-llm/python/llm
Yina Chen 841dbcdf3a
Fix compresskv with lookahead issue (#11767)
* fix compresskv + lookahead attn_mask qwen2

* support llama chatglm

* support mistral & chatglm

* address comments

* revert run.py
2024-08-12 18:53:55 +08:00
..
dev Update all-in-one benchmark for continuation task input preparation (#11760) 2024-08-12 17:49:45 +08:00
example optimize lookahead init time (#11769) 2024-08-12 17:19:12 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm Fix compresskv with lookahead issue (#11767) 2024-08-12 18:53:55 +08:00
test Add qwen2-1.5b-instruct into igpu performance (#11735) 2024-08-08 16:42:18 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py update doc/setup to use onednn gemm for cpp (#11598) 2024-07-18 13:04:38 +08:00
version.txt Update setup.py and add new actions and add compatible mode (#25) 2024-03-22 15:44:59 +08:00