ipex-llm/python
SONG Ge 3f79128ed7 [LLM] Enable kv_cache optimization for Qwen2 on transformers-v4.37.0 (#10131)
* add support for kv_cache optimization on transformers-v4.37.0

* enable attention forward

* style fix

* disable rotary for now
2024-02-08 14:20:26 +08:00
..
llm [LLM] Enable kv_cache optimization for Qwen2 on transformers-v4.37.0 (#10131) 2024-02-08 14:20:26 +08:00