Heyang Sun
|
c64e2248ef
|
fix str returned by get_int_from_str rather than expected int (#9667)
|
2023-12-13 11:01:21 +08:00 |
|
ZehuaCao
|
6eca8a8bb5
|
update transformer version (#9631)
|
2023-12-08 09:36:00 +08:00 |
|
Heyang Sun
|
4e70e33934
|
[LLM] code and document for distributed qlora (#9585)
* [LLM] code and document for distributed qlora
* doc
* refine for gradient checkpoint
* refine
* Update alpaca_qlora_finetuning_cpu.py
* Update alpaca_qlora_finetuning_cpu.py
* Update alpaca_qlora_finetuning_cpu.py
* add link in doc
|
2023-12-06 09:23:17 +08:00 |
|
Wang, Jian4
|
ed0dc57c6e
|
LLM: Add cpu qlora support other models guide (#9567)
* use bf16 flag
* add using baichuan model
* update merge
* remove
* update
|
2023-12-01 11:18:04 +08:00 |
|
Wang, Jian4
|
a0a80d232e
|
LLM: Add qlora cpu distributed readme (#9561)
* init readme
* add distributed guide
* update
|
2023-11-30 13:42:30 +08:00 |
|
Wang, Jian4
|
b824754256
|
LLM: Update for cpu qlora mpirun (#9548)
|
2023-11-29 10:56:17 +08:00 |
|
Heyang Sun
|
48fbb1eb94
|
support ccl (MPI) distributed mode in alpaca_qlora_finetuning_cpu (#9507)
|
2023-11-23 10:58:09 +08:00 |
|
binbin Deng
|
86743fb57b
|
LLM: fix transformers version in CPU finetuning example (#9511)
|
2023-11-22 15:53:07 +08:00 |
|
Wang, Jian4
|
c5cb3ab82e
|
LLM : Add CPU alpaca qlora example (#9469)
* init
* update xpu to cpu
* update
* update readme
* update example
* update
* add refer
* add guide to train different datasets
* update readme
* update
|
2023-11-21 09:19:58 +08:00 |
|