ipex-llm/python/llm
Yina Chen 5dad33e5af
Support fp8_e4m3 scale search (#11339)
* fp8e4m3 switch off

* fix style
2024-06-18 11:47:43 +08:00
..
dev Add lookahead in test_api: transformer_int4_fp16_gpu (#11337) 2024-06-17 17:41:41 +08:00
example Support finishing PP inference once eos_token_id is found (#11336) 2024-06-18 09:55:40 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts Miniconda/Anaconda -> Miniforge update in examples (#11194) 2024-06-04 10:14:02 +08:00
src/ipex_llm Support fp8_e4m3 scale search (#11339) 2024-06-18 11:47:43 +08:00
test Modify arc nightly perf to fp16 (#11275) 2024-06-17 13:47:22 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Upgrade accelerate to 0.23.0 (#11331) 2024-06-17 15:03:11 +08:00
version.txt Update setup.py and add new actions and add compatible mode (#25) 2024-03-22 15:44:59 +08:00