ipex-llm/python
Yina Chen 70037ad55f
Groupwise prefill optimization (#12291)
* except lm_head

* remove

* support gw lm_head

* update

* fix

* remove run.bat

* fix style

* support llama3

* slice -> split

* remove debug

* fix style

* add dpu
2024-10-30 14:59:45 +08:00
..
llm Groupwise prefill optimization (#12291) 2024-10-30 14:59:45 +08:00