ipex-llm/python/llm/example/NPU
Ruonan Wang 3fe2ea3081
[NPU] Reuse prefill of acc lib for pipeline (#12279)
* first commit

* update example

* fix style

* update example

* embedding as const

* fix generate

* code  refactor

* meet code review

* fix style

* change max_output_len to max_context_len

* fix all-in-one

* fix example

* add check for new tokens
2024-10-28 16:05:49 +08:00
..
HF-Transformers-AutoModels [NPU] Reuse prefill of acc lib for pipeline (#12279) 2024-10-28 16:05:49 +08:00