ipex-llm/python
Ruonan Wang bf37b3a670 LLM: optimize CPU speculative decoding of chatglm3 (#9928)
* update

* fix style

* meet code review
2024-01-19 14:10:22 +08:00
..
llm LLM: optimize CPU speculative decoding of chatglm3 (#9928) 2024-01-19 14:10:22 +08:00