ipex-llm/python
Yina Chen b38fb67bec
[NPU] lm head to cpu (#11943)
* lm head to cpu

* qwen2

* mv logic and add param to disable cpu_lm_head

* use env and lm_head opt to mp file

* fix

* update

* remove print
2024-08-28 16:34:07 +08:00
..
llm [NPU] lm head to cpu (#11943) 2024-08-28 16:34:07 +08:00