|
cli
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
ggml
|
support gguf_q4k_m / gguf_q4k_s (#10887)
|
2024-05-17 14:30:09 +08:00 |
|
gptq
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
langchain
|
Add tokenizer_id in Langchain (#10588)
|
2024-04-03 14:25:35 +08:00 |
|
serving
|
Fix tgi_api_server error file name (#11075)
|
2024-05-20 16:48:40 +08:00 |
|
transformers
|
fix qwen vl (#11090)
|
2024-05-21 18:40:29 +08:00 |
|
utils
|
Update benchmark util for example using (#11027)
|
2024-05-15 14:16:35 +08:00 |
|
vllm
|
Add tensor parallel for vLLM (#10879)
|
2024-04-26 17:10:49 +08:00 |
|
convert_model.py
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
format.sh
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
models.py
|
Refactor bigdl.llm to ipex_llm (#24)
|
2024-03-22 15:41:21 +08:00 |
|
optimize.py
|
Add vLLM[xpu] related code (#10779)
|
2024-04-18 15:29:20 +08:00 |