Wang, Jian4
|
fb53b994f8
|
LLM : Add llama ipex optimized (#10046)
* init ipex
* remove padding
|
2024-01-31 10:38:46 +08:00 |
|
Heyang Sun
|
b1ff28ceb6
|
LLama2 CPU example of speculative decoding (#9962)
* LLama2 example of speculative decoding
* add docs
* Update speculative.py
* Update README.md
* Update README.md
* Update speculative.py
* remove autocast
|
2024-01-31 09:45:20 +08:00 |
|
Xiangyu Tian
|
9978089796
|
[LLM] Enable BIGDL_OPT_IPEX in speculative baichuan2 13b example (#10028)
Enable BIGDL_OPT_IPEX in speculative baichuan2 13b example
|
2024-01-30 17:11:37 +08:00 |
|
Heyang Sun
|
cc3f122f6a
|
Baichuan2 CPU example of speculative decoding (#10003)
* Baichuan2 CPU example of speculative decoding
* Update generate.py
* Update README.md
* Update generate.py
* Update generate.py
* Update generate.py
* fix default model
* fix wrong chinese coding
* Update generate.py
* update prompt
* update sample outputs
* baichuan 7b needs transformers==4.31.0
* rename example file's name
|
2024-01-29 14:21:09 +08:00 |
|
Wang, Jian4
|
093e6f8f73
|
LLM: Add qwen CPU speculative example (#9985)
* init from gpu
* update for cpu
* update
* update
* fix xpu readme
* update
* update example prompt
* update prompt and add 72b
* update
* update
|
2024-01-25 17:01:34 +08:00 |
|