|
cli
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
gptq
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
serving
|
Remove tgi parameter validation (#11688)
|
2024-07-30 16:37:44 +08:00 |
|
transformers
|
Qwen support compress kv (#11680)
|
2024-07-30 11:16:42 +08:00 |
|
vllm
|
Enable ipex-llm optimization for lm head (#11589)
|
2024-07-16 16:48:44 +08:00 |
|
__init__.py
|
IPEX Duplicate importer V2 (#11310)
|
2024-06-19 16:29:19 +08:00 |
|
convert_model.py
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
format.sh
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
optimize.py
|
Update tests for transformers 4.36 (#10858)
|
2024-05-24 10:26:38 +08:00 |