Ruonan Wang
|
076d106ef5
|
LLM: GPU QLoRA update to bf16 to accelerate gradient checkpointing (#9499)
* update to bf16 to accelerate gradient checkpoint
* add utils and fix ut
|
2023-11-21 17:08:36 +08:00 |
|
Ruonan Wang
|
0f82b8c3a0
|
LLM: update qlora example (#9454)
* update qlora example
* fix loss=0
|
2023-11-15 09:24:15 +08:00 |
|
Yang Wang
|
7a2de00b48
|
Fixes for xpu Bf16 training (#9156)
* Support bf16 training
* Use a stable transformer version
* remove env
* fix style
|
2023-10-14 21:28:59 -07:00 |
|
binbin Deng
|
5e9962b60e
|
LLM: update example layout (#9046)
|
2023-10-09 15:36:39 +08:00 |
|