ipex-llm

History

Yina Chen 3cd4e87168 Support compress KV with quantize KV (#11812 ) * update llama * support llama 4.41 * fix style * support minicpm * support qwen2 * support minicpm & update * support chatglm4 * support chatglm * remove print * add DynamicCompressFp8Cache & support qwen * support llama * support minicpm phi3 * update chatglm2/4 * small fix & support qwen 4.42 * remove print	2024-08-19 15:32:32 +08:00
..
llm	Support compress KV with quantize KV (#11812 )	2024-08-19 15:32:32 +08:00

Support compress KV with quantize KV (#11812 )

* update llama

* support llama 4.41

* fix style

* support minicpm

* support qwen2

* support minicpm & update

* support chatglm4

* support chatglm

* remove print

* add DynamicCompressFp8Cache & support qwen

* support llama

* support minicpm phi3

* update chatglm2/4

* small fix & support qwen 4.42

* remove print

2024-08-19 15:32:32 +08:00

llm

Support compress KV with quantize KV (#11812 )

2024-08-19 15:32:32 +08:00