Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|
Wang, Jian4
|
093e6f8f73
|
LLM: Add qwen CPU speculative example (#9985)
* init from gpu
* update for cpu
* update
* update
* fix xpu readme
* update
* update example prompt
* update prompt and add 72b
* update
* update
|
2024-01-25 17:01:34 +08:00 |
|
Yina Chen
|
99ff6cf048
|
Update gpu spec decoding baichuan2 example dependency (#9990)
* add dependency
* update
* update
|
2024-01-25 11:05:04 +08:00 |
|
Jason Dai
|
3bc3d0bbcd
|
Update self-speculative readme (#9986)
|
2024-01-24 22:37:32 +08:00 |
|
Yina Chen
|
b176cad75a
|
LLM: Add baichuan2 gpu spec example (#9973)
* add baichuan2 gpu spec example
* update readme & example
* remove print
* fix typo
* meet comments
* revert
* update
|
2024-01-24 16:40:16 +08:00 |
|
Yina Chen
|
5aa4b32c1b
|
LLM: Add qwen spec gpu example (#9965)
* add qwen spec gpu example
* update readme
---------
Co-authored-by: rnwang04 <ruonan1.wang@intel.com>
|
2024-01-23 15:59:43 +08:00 |
|