xingyuan li
|
610084e3c0
|
[LLM] Complete windows unittest (#8611)
* add windows nightly test workflow
* use github runner to run pr test
* model load should use lowbit
* remove tmp dir after testing
|
2023-08-03 14:48:42 +09:00 |
|
Xin Qiu
|
fccae91461
|
Add load_low_bit save_load_bit to AutoModelForCausalLM (#8531)
* transformers save_low_bit load_low_bit
* update example and add readme
* update
* update
* update
* add ut
* update
|
2023-07-17 15:29:55 +08:00 |
|
Xin Qiu
|
90e3d86bce
|
rename low bit type name (#8512)
* change qx_0 to sym_intx
* update
* fix typo
* update
* fix type
* fix style
* add python doc
* meet code review
* fix style
|
2023-07-13 15:53:31 +08:00 |
|
Xin Qiu
|
cd7a980ec4
|
Transformer int4 add qtype, support q4_1 q5_0 q5_1 q8_0 (#8481)
* quant in Q4 5 8
* meet code review
* update readme
* style
* update
* fix error
* fix error
* update
* fix style
* update
* Update README.md
* Add load_in_low_bit
|
2023-07-12 08:23:08 +08:00 |
|
Zhao Changmin
|
81d655cda9
|
LLM: transformer int4 save and load (#8462)
* LLM: transformer int4 save and load
|
2023-07-10 16:34:41 +08:00 |
|
Ruonan Wang
|
4be784a49d
|
LLM: add UT for starcoder (convert, inference) update examples and readme (#8379)
* first commit to add path
* update example and readme
* update path
* fix
* update based on comment
|
2023-06-27 12:12:11 +08:00 |
|
Zhao Changmin
|
4d177ca0a1
|
LLM: Merge convert pth/gptq model script into one shell script (#8348)
* convert model in one
* model type
* license
* readme and pep8
* ut path
* rename
* readme
* fix docs
* without lines
|
2023-06-19 11:50:05 +08:00 |
|
Yuwen Hu
|
1aa33d35d5
|
[LLM] Refactor LLM Linux tests (#8349)
* Small name fix
* Add convert nightly tests, and for other llm tests, use stable ckpt
* Small fix and ftp fix
* Small fix
* Small fix
|
2023-06-16 15:22:48 +08:00 |
|