ZehuaCao
|
0646e2c062
|
Fix short prompt for IPEX_CPU speculative decoding cause no_attr error (#10783)
|
2024-04-17 16:19:57 +08:00 |
|
Shaojun Liu
|
f37a1f2a81
|
Upgrade to python 3.11 (#10711)
* create conda env with python 3.11
* recommend to use Python 3.11
* update
|
2024-04-09 17:41:17 +08:00 |
|
Wang, Jian4
|
16b2ef49c6
|
Update_document by heyang (#30)
|
2024-03-25 10:06:02 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|
Xiangyu Tian
|
f445217d02
|
LLM: Update IPEX to 2.2.0+cpu and Refactor for _ipex_optimize (#10189)
Update IPEX to 2.2.0+cpu and refactor for _ipex_optimize.
|
2024-02-22 16:01:11 +08:00 |
|
Wang, Jian4
|
fb53b994f8
|
LLM : Add llama ipex optimized (#10046)
* init ipex
* remove padding
|
2024-01-31 10:38:46 +08:00 |
|
Heyang Sun
|
b1ff28ceb6
|
LLama2 CPU example of speculative decoding (#9962)
* LLama2 example of speculative decoding
* add docs
* Update speculative.py
* Update README.md
* Update README.md
* Update speculative.py
* remove autocast
|
2024-01-31 09:45:20 +08:00 |
|