ipex-llm/python/llm
Yuwen Hu 68f2873bd3
[NPU] Support repetition penalty for simple generate, Python (cpp backend) (#12522)
* Initial support of repetition penalty on NPU (cpp backend) for simple generate

* Bug fix for generation config and others

* Remove unnecessary print and style fix

* Remove unnecessary print

* Fix based on comments
2024-12-11 14:55:25 +08:00
..
dev [NPU] Fix load-low-bit benchmark script (#12502) 2024-12-05 10:01:32 +08:00
example [NPU] Support glm-edge models (#12511) 2024-12-09 14:06:27 +08:00
portable-zip
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm [NPU] Support repetition penalty for simple generate, Python (cpp backend) (#12522) 2024-12-11 14:55:25 +08:00
test Add MiniCPM-V-2_6 to arc perf test (#12349) 2024-11-06 16:32:28 +08:00
tpp
.gitignore
setup.py Add release support for option xpu_arc (#12422) 2024-12-02 17:16:04 +08:00
version.txt Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00