ipex-llm/python/llm
Cengguang Zhang 9930351112
LLM: add new qtype woq_int4 to support gemm int4 temporary. (#12706)
This PR add temporary qtype woq_int4 to avoid affecting other qtype and models.

Co-authored-by: leonardozcm <leonardo1997zcm@gmail.com>
2025-01-15 14:41:33 +08:00
..
dev Remove all ipex usage (#12666) 2025-01-08 10:31:18 +08:00
example Add Qwen2-VL HF GPU example with ModelScope Support (#12606) 2025-01-13 15:42:04 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm LLM: add new qtype woq_int4 to support gemm int4 temporary. (#12706) 2025-01-15 14:41:33 +08:00
test Remove dummy model from performance tests (#12682) 2025-01-09 14:50:17 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Add option with PyTorch 2.6 RC version for testing purposes (#12668) 2025-01-07 18:28:55 +08:00
version.txt Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00