|
cli
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
ggml
|
Support q4k in ipex-llm (#10796)
|
2024-04-18 18:55:28 +08:00 |
|
gptq
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
langchain
|
Add tokenizer_id in Langchain (#10588)
|
2024-04-03 14:25:35 +08:00 |
|
serving
|
Update README.md (#11003)
|
2024-05-13 16:44:48 +08:00 |
|
utils
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
vllm
|
Add tensor parallel for vLLM (#10879)
|
2024-04-26 17:10:49 +08:00 |
|
convert_model.py
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
format.sh
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
models.py
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
optimize.py
|
Add vLLM[xpu] related code (#10779)
|
2024-04-18 15:29:20 +08:00 |