ipex-llm

History

SONG Ge 3f79128ed7 [LLM] Enable kv_cache optimization for Qwen2 on transformers-v4.37.0 (#10131 ) * add support for kv_cache optimization on transformers-v4.37.0 * enable attention forward * style fix * disable rotary for now		2024-02-08 14:20:26 +08:00
..
llm	[LLM] Enable kv_cache optimization for Qwen2 on transformers-v4.37.0 (#10131 )	2024-02-08 14:20:26 +08:00