binbin Deng
|
66f6ffe4b2
|
Update GPU HF-Transformers example structure (#11526)
|
2024-07-08 17:58:06 +08:00 |
|
Jin Qiao
|
10ee786920
|
Replace with IPEX-LLM in example comments (#10671)
* Replace with IPEX-LLM in example comments
* More replacement
* revert some changes
|
2024-04-07 13:29:51 +08:00 |
|
ZehuaCao
|
52a2135d83
|
Replace ipex with ipex-llm (#10554)
* fix ipex with ipex_llm
* fix ipex with ipex_llm
* update
* update
* update
* update
* update
* update
* update
* update
|
2024-03-28 13:54:40 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|
Ruonan Wang
|
a9fd20b6ba
|
LLM: Update qkv fusion for GGUF-IQ2 (#10271)
* first commit
* update mistral
* fix transformers==4.36.0
* fix
* disable qk for mixtral now
* fix style
|
2024-02-29 12:49:53 +08:00 |
|
Ruonan Wang
|
5e1fee5e05
|
LLM: add GGUF-IQ2 examples (#10207)
* add iq2 examples
* small fix
* meet code review
* fix
* meet review
* small fix
|
2024-02-22 14:18:45 +08:00 |
|