ipex-llm/python
Yina Chen 23c91cdce6 [LLM] Add min_step_draft in speculative decoding (#10142)
* Fix gptj kvcache & position id

* Add min_draft_tokens in speculative decoding

* fix style

* update
2024-02-19 14:31:41 +08:00
..
llm [LLM] Add min_step_draft in speculative decoding (#10142) 2024-02-19 14:31:41 +08:00