ipex-llm/python/llm/src
Ruonan Wang bf37b3a670 LLM: optimize CPU speculative decoding of chatglm3 (#9928)
* update

* fix style

* meet code review
2024-01-19 14:10:22 +08:00
..
bigdl LLM: optimize CPU speculative decoding of chatglm3 (#9928) 2024-01-19 14:10:22 +08:00