ipex-llm/python/llm/example
Ruonan Wang 3fe2ea3081
[NPU] Reuse prefill of acc lib for pipeline (#12279)
* first commit

* update example

* fix style

* update example

* embedding as const

* fix generate

* code  refactor

* meet code review

* fix style

* change max_output_len to max_context_len

* fix all-in-one

* fix example

* add check for new tokens
2024-10-28 16:05:49 +08:00
..
CPU refactor ot remove old rope usage (#12224) 2024-10-17 17:06:09 +08:00
GPU refactor ot remove old rope usage (#12224) 2024-10-17 17:06:09 +08:00
NPU/HF-Transformers-AutoModels [NPU] Reuse prefill of acc lib for pipeline (#12279) 2024-10-28 16:05:49 +08:00