ipex-llm/python
Yina Chen 9ea499ca68 Optimize speculative decoding PVC memory usage (#10329)
* optimize memory

* update

* update

* update

* support other models

* update

* fix style
2024-03-06 09:54:21 +08:00
..
llm Optimize speculative decoding PVC memory usage (#10329) 2024-03-06 09:54:21 +08:00