ipex-llm/python
Ruonan Wang b8aee7bb1b LLM: Fix Qwen kv_cache optimization (#9148)
* first commit

* ut pass

* accelerate rotate half by using common util function

* fix style
2023-10-12 15:49:42 +08:00
..
llm LLM: Fix Qwen kv_cache optimization (#9148) 2023-10-12 15:49:42 +08:00