ipex-llm/python/llm/src/ipex_llm
2024-04-22 18:56:47 +08:00
..
cli
ggml Support q4k in ipex-llm (#10796) 2024-04-18 18:55:28 +08:00
gptq
langchain
llamaindex Llamaindex: add tokenizer_id and support chat (#10590) 2024-04-07 13:51:34 +08:00
serving [vLLM]Remove vllm-v1, refactor v2 (#10842) 2024-04-22 17:51:32 +08:00
transformers add phi-2 optimization (#10843) 2024-04-22 18:56:47 +08:00
utils
vllm [vLLM]Remove vllm-v1, refactor v2 (#10842) 2024-04-22 17:51:32 +08:00
__init__.py
convert_model.py
format.sh
llm_patching.py Axolotl v0.4.0 support (#10773) 2024-04-17 09:49:11 +08:00
models.py
optimize.py Add vLLM[xpu] related code (#10779) 2024-04-18 15:29:20 +08:00