ipex-llm/python/llm/src/ipex_llm/ggml
Ruonan Wang f1156e6b20
support gguf_q4k_m / gguf_q4k_s (#10887)
* initial commit

* UPDATE

* fix style

* fix style

* add gguf_q4k_s

* update comment

* fix
2024-05-17 14:30:09 +08:00
..
model Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
__init__.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
convert.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
convert_model.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
quantize.py support gguf_q4k_m / gguf_q4k_s (#10887) 2024-05-17 14:30:09 +08:00