ipex-llm/python/llm
Yina Chen 0af0102e61
Add quantization scale search switch (#11326)
* add scale_search switch

* remove llama3 instruct

* remove print
2024-06-14 18:46:52 +08:00
..
dev Fix import error of ds autotp (#11307) 2024-06-13 16:22:52 +08:00
example LLM: Add /generate_stream endpoint for Pipeline-Parallel-FastAPI example (#11187) 2024-06-14 15:15:32 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts Miniconda/Anaconda -> Miniforge update in examples (#11194) 2024-06-04 10:14:02 +08:00
src/ipex_llm Add quantization scale search switch (#11326) 2024-06-14 18:46:52 +08:00
test exclude dolly-v2-12b for arc perf test (#11315) 2024-06-14 15:35:56 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Remove chatglm_C Module to Eliminate LGPL Dependency (#11178) 2024-05-31 17:03:11 +08:00
version.txt Update setup.py and add new actions and add compatible mode (#25) 2024-03-22 15:44:59 +08:00