Jiao Wang
|
93146b9433
|
Reconstruct Speculative Decoding example directory (#11136)
* update
* update
* update
|
2024-05-29 13:15:27 -07:00 |
|
Wang, Jian4
|
23c6a52fb0
|
LLM: Fix ipex torchscript=True error (#10832)
* remove
* update
* remove torchscript
|
2024-04-22 15:53:09 +08:00 |
|
ZehuaCao
|
0646e2c062
|
Fix short prompt for IPEX_CPU speculative decoding cause no_attr error (#10783)
|
2024-04-17 16:19:57 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|
Heyang Sun
|
36a9e88104
|
Speculative Starcoder on CPU (#10138)
* Speculative Starcoder on CPU
* enable kv-cache pre-allocation
* refine codes
* refine
* fix style
* fix style
* fix style
* refine
* refine
* Update speculative.py
* Update gptbigcode.py
* fix style
* Update speculative.py
* enable mixed-datatype layernorm on top of torch API
* adaptive dtype
* Update README.md
|
2024-02-27 09:57:29 +08:00 |
|