Yina Chen
|
ea5b373a97
|
Add lookahead GPU example (#10785)
* Add lookahead example
* fix style & attn mask
* fix typo
* address comments
|
2024-04-17 17:41:55 +08:00 |
|
Yina Chen
|
766fe45222
|
Fix spec error caused by lookup pr (#10777)
* Fix spec error
* remove
* fix style
|
2024-04-17 11:27:35 +08:00 |
|
Yina Chen
|
899d392e2f
|
Support prompt lookup in ipex-llm (#10768)
* lookup init
* add lookup
* fix style
* remove redundant code
* change param name
* fix style
|
2024-04-16 16:52:38 +08:00 |
|
Wang, Jian4
|
47cabe8fcc
|
LLM: Fix no return_last_logit running bigdl_ipex chatglm3 (#10678)
* fix no return_last_logits
* update only for chatglm
|
2024-04-07 15:27:58 +08:00 |
|
ZehuaCao
|
52a2135d83
|
Replace ipex with ipex-llm (#10554)
* fix ipex with ipex_llm
* fix ipex with ipex_llm
* update
* update
* update
* update
* update
* update
* update
* update
|
2024-03-28 13:54:40 +08:00 |
|
Xiangyu Tian
|
51d34ca68e
|
Fix wrong import in speculative (#10562)
|
2024-03-27 18:21:07 +08:00 |
|
Xiangyu Tian
|
11550d3f25
|
LLM: Add length check for IPEX-CPU speculative decoding (#10529)
Add length check for IPEX-CPU speculative decoding.
|
2024-03-26 17:47:10 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|