ipex-llm/python
Cengguang Zhang 735a17f7b4 LLM: add kv cache to falcon family. (#8995)
* add kv cache to falcon family.

* fix: import error.

* refactor

* update comments.

* add two version falcon attention forward.

* fix

* fix.

* fix.

* fix.

* fix style.

* fix style.
2023-09-20 15:36:30 +08:00
..
llm LLM: add kv cache to falcon family. (#8995) 2023-09-20 15:36:30 +08:00