ipex-llm/python
Ruonan Wang 004c45c2be LLM: Support optimized kv_cache for baichuan family (#8997)
* add initial support for baichuan attantion

* support baichuan1

* update based on comment

* update based on comment

* support baichuan2

* update link, change how to jusge baichuan2

* fix style

* add model parameter for pob emb

* update based on comment
2023-09-19 15:38:54 +08:00
..
llm LLM: Support optimized kv_cache for baichuan family (#8997) 2023-09-19 15:38:54 +08:00