Yishuo Wang
|
ccf618ff4a
|
Remove all ipex usage (#12666)
|
2025-01-08 10:31:18 +08:00 |
|
Guancheng Fu
|
7f772c5a4f
|
Add half precision for fastchat models (#11130)
|
2024-05-24 15:41:14 +08:00 |
|
Guancheng Fu
|
74997a3ed1
|
Adding load_low_bit interface for ipex_llm_worker (#11000)
* initial implementation, need tests
* fix
* fix baichuan issue
* fix typo
|
2024-05-13 15:30:19 +08:00 |
|
Wang, Jian4
|
0e0bd309e2
|
LLM: Enable Speculative on Fastchat (#10909)
* init
* enable streamer
* update
* update
* remove deprecated
* update
* update
* add gpu example
|
2024-05-06 10:06:20 +08:00 |
|
Guancheng Fu
|
fbcd7bc737
|
Fix Loader issue with dtype fp16 (#10907)
|
2024-04-29 10:16:02 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|