ipex-llm/python/llm
Yina Chen 70037ad55f
Groupwise prefill optimization (#12291)
* except lm_head

* remove

* support gw lm_head

* update

* fix

* remove run.bat

* fix style

* support llama3

* slice -> split

* remove debug

* fix style

* add dpu
2024-10-30 14:59:45 +08:00
..
dev [NPU] Reuse prefill of acc lib for pipeline (#12279) 2024-10-28 16:05:49 +08:00
example [NPU pipeline] Support save & load and update examples (#12293) 2024-10-30 10:02:00 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm Groupwise prefill optimization (#12291) 2024-10-30 14:59:45 +08:00
test fix UT (#12247) 2024-10-23 14:13:06 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Support cpp release for ARL on Windows (#12189) 2024-10-14 17:20:31 +08:00
version.txt Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00