Ruonan Wang
|
4b6c3160be
|
Support imatrix-guided quantization for NPU CW (#12468)
* init commit
* remove print
* add interface
* fix
* fix
* fix style
|
2024-12-02 11:31:26 +08:00 |
|
Zhao Changmin
|
cf8eb7b128
|
Init NPU quantize method and support q8_0_rtn (#11452)
* q8_0_rtn
* fix float point
|
2024-07-01 13:45:07 +08:00 |
|
Yina Chen
|
0af0102e61
|
Add quantization scale search switch (#11326)
* add scale_search switch
* remove llama3 instruct
* remove print
|
2024-06-14 18:46:52 +08:00 |
|
Shaojun Liu
|
401013a630
|
Remove chatglm_C Module to Eliminate LGPL Dependency (#11178)
* remove chatglm_C.**.pyd to solve ngsolve weak copyright vunl
* fix style check error
* remove chatglm native int4 from langchain
|
2024-05-31 17:03:11 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|