* Enable kv cache quantization by default for flex when 1 < batch <= 8. * Change up bound from <8 to <=8. |
||
|---|---|---|
| .. | ||
| ipex_llm | ||
* Enable kv cache quantization by default for flex when 1 < batch <= 8. * Change up bound from <8 to <=8. |
||
|---|---|---|
| .. | ||
| ipex_llm | ||