ipex-llm/python
Yuwen Hu f0ff0eebe1 [LLM] Support quantize kv cache for Baichuan2 7B (#10280)
* Add quatized kv cache framework for Baichuan2 7B

* Support quantize kv cache for baichuan2

* Small fix

* Fix python style
2024-03-01 13:35:42 +08:00
..
llm [LLM] Support quantize kv cache for Baichuan2 7B (#10280) 2024-03-01 13:35:42 +08:00