Yishuo Wang
|
1dc680341b
|
fix phi-3-vision import (#11129)
|
2024-05-24 15:57:15 +08:00 |
|
Ruonan Wang
|
f1156e6b20
|
support gguf_q4k_m / gguf_q4k_s (#10887)
* initial commit
* UPDATE
* fix style
* fix style
* add gguf_q4k_s
* update comment
* fix
|
2024-05-17 14:30:09 +08:00 |
|
Yina Chen
|
893197434d
|
Add fp6 support on gpu (#11008)
* add fp6 support
* fix style
|
2024-05-14 16:31:44 +08:00 |
|
Zhao Changmin
|
0d6e12036f
|
Disable fast_init_ in load_low_bit (#10945)
* fast_init_ disable
|
2024-05-08 10:46:19 +08:00 |
|
Yang Wang
|
1ce8d7bcd9
|
Support the desc_act feature in GPTQ model (#10851)
* support act_order
* update versions
* fix style
* fix bug
* clean up
|
2024-04-24 10:17:13 -07:00 |
|
Ruonan Wang
|
439c834ed3
|
LLM: add mixed precision for lm_head (#10795)
* add mixed_quantization
* meet code review
* update
* fix style
* meet review
|
2024-04-18 19:11:31 +08:00 |
|
Yina Chen
|
8796401b08
|
Support q4k in ipex-llm (#10796)
* support q4k
* update
|
2024-04-18 18:55:28 +08:00 |
|
Ruonan Wang
|
0e8aac19e3
|
add q6k precision in ipex-llm (#10792)
* add q6k
* add initial 16k
* update
* fix style
|
2024-04-18 16:52:09 +08:00 |
|
Yina Chen
|
899d392e2f
|
Support prompt lookup in ipex-llm (#10768)
* lookup init
* add lookup
* fix style
* remove redundant code
* change param name
* fix style
|
2024-04-16 16:52:38 +08:00 |
|
Zhicun
|
b4147a97bb
|
Fix dtype mismatch error (#10609)
* fix llama
* fix
* fix code style
* add torch type in model.py
---------
Co-authored-by: arda <arda@arda-arc19.sh.intel.com>
|
2024-04-09 17:50:33 +08:00 |
|
Shaojun Liu
|
a10f5a1b8d
|
add python style check (#10620)
* add python style check
* fix style checks
* update runner
* add ipex-llm-finetune-qlora-cpu-k8s to manually_build workflow
* update tag to 2.1.0-SNAPSHOT
|
2024-04-02 16:17:56 +08:00 |
|
Ruonan Wang
|
0136fad1d4
|
LLM: support iq1_s (#10564)
* init version
* update utils
* remove unsed code
|
2024-03-29 09:43:55 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|