ipex-llm/python/llm/src/ipex_llm
2025-03-12 20:58:04 +08:00
..
cli Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
ggml LLM: add new qtype woq_int4 to support gemm int4 temporary. (#12706) 2025-01-15 14:41:33 +08:00
gptq Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
langchain Remove chatglm_C Module to Eliminate LGPL Dependency (#11178) 2024-05-31 17:03:11 +08:00
llamaindex Llamaindex: add tokenizer_id and support chat (#10590) 2024-04-07 13:51:34 +08:00
serving Upgrade to vllm 0.6.2 (#12338) 2024-11-12 20:35:34 +08:00
transformers optimize moonlight again (#12909) 2025-03-03 09:21:15 +08:00
utils R1 Hybrid: Add Benchmark for DeepSeek R1 transformers example (#12854) 2025-02-19 18:33:21 +08:00
vllm Add vllm api_server input output log (#12962) 2025-03-12 20:58:04 +08:00
__init__.py IPEX Duplicate importer V2 (#11310) 2024-06-19 16:29:19 +08:00
convert_model.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
format.sh Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
llm_patching.py Upgrade Peft version to 0.10.0 for LLM finetune (#10886) 2024-05-07 15:09:14 +08:00
models.py Remove chatglm_C Module to Eliminate LGPL Dependency (#11178) 2024-05-31 17:03:11 +08:00
optimize.py initial implementation for low_bit_loader vLLM (#12838) 2025-02-19 19:45:34 +08:00