ipex-llm/python/llm/src/ipex_llm
2024-06-13 12:22:58 +08:00
..
cli Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
ggml Support Fp6 k in ipex-llm (#11222) 2024-06-05 17:34:36 +08:00
gptq Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
langchain Remove chatglm_C Module to Eliminate LGPL Dependency (#11178) 2024-05-31 17:03:11 +08:00
llamaindex Llamaindex: add tokenizer_id and support chat (#10590) 2024-04-07 13:51:34 +08:00
serving Expose timeout for streamer for fastchat worker (#11288) 2024-06-12 17:02:40 +08:00
transformers refactor chatglm2/3 (#11290) 2024-06-13 12:22:58 +08:00
utils Remove duplicate check for ipex (#11281) 2024-06-12 13:52:02 +08:00
vllm LLM: Fix vLLM CPU model convert mismatch (#11254) 2024-06-07 15:54:34 +08:00
__init__.py Update setup.py and add new actions and add compatible mode (#25) 2024-03-22 15:44:59 +08:00
convert_model.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
format.sh Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
llm_patching.py Upgrade Peft version to 0.10.0 for LLM finetune (#10886) 2024-05-07 15:09:14 +08:00
models.py Remove chatglm_C Module to Eliminate LGPL Dependency (#11178) 2024-05-31 17:03:11 +08:00
optimize.py Update tests for transformers 4.36 (#10858) 2024-05-24 10:26:38 +08:00