ipex-llm/python/llm/example
Heyang Sun 36a9e88104 Speculative Starcoder on CPU (#10138)
* Speculative Starcoder on CPU

* enable kv-cache pre-allocation

* refine codes

* refine

* fix style

* fix style

* fix style

* refine

* refine

* Update speculative.py

* Update gptbigcode.py

* fix style

* Update speculative.py

* enable mixed-datatype layernorm on top of torch API

* adaptive dtype

* Update README.md
2024-02-27 09:57:29 +08:00
..
CPU Speculative Starcoder on CPU (#10138) 2024-02-27 09:57:29 +08:00
GPU update Gemma readme (#10229) 2024-02-23 16:57:08 +08:00