ipex-llm/python
Xin Qiu 28c4a8cf5c Qwen fused qkv (#10368)
* fused qkv + rope for qwen

* quantized kv cache

* fix

* update qwen

* fixed quantized qkv

* fix

* meet code review

* update split

* convert.py

* extend when no enough kv

* fix
2024-03-12 17:39:00 +08:00
..
llm Qwen fused qkv (#10368) 2024-03-12 17:39:00 +08:00