ipex-llm/python/llm
Yuwen Hu 5a15098835
Initial support for quantized forward on CPU when quantization_group_size=0 (#12282)
* Initial support for quantized forward on CPU when quantization_group_size=0

* Style fix

* Style fix

* Small fix

* Small fix
2024-10-29 19:40:17 +08:00
..
dev [NPU] Reuse prefill of acc lib for pipeline (#12279) 2024-10-28 16:05:49 +08:00
example Support baichuan2 for level0 pipeline (#12289) 2024-10-29 19:24:16 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm Initial support for quantized forward on CPU when quantization_group_size=0 (#12282) 2024-10-29 19:40:17 +08:00
test fix UT (#12247) 2024-10-23 14:13:06 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Support cpp release for ARL on Windows (#12189) 2024-10-14 17:20:31 +08:00
version.txt Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00