ipex-llm/python/llm/src/ipex_llm
2024-12-04 17:14:16 +08:00
..
cli
ggml Support imatrix-guided quantization for NPU CW (#12468) 2024-12-02 11:31:26 +08:00
gptq
langchain
llamaindex
serving Upgrade to vllm 0.6.2 (#12338) 2024-11-12 20:35:34 +08:00
transformers optimize minicpm (#12496) 2024-12-04 17:14:16 +08:00
utils fix ipex 2.3 bug (#12366) 2024-11-08 13:29:15 +08:00
vllm add vLLM glm4 fix (#12474) 2024-12-02 14:05:16 +08:00
__init__.py IPEX Duplicate importer V2 (#11310) 2024-06-19 16:29:19 +08:00
convert_model.py
format.sh
llm_patching.py
models.py
optimize.py fix and optimize sd (#12436) 2024-11-25 14:09:48 +08:00