binbin Deng
|
2b9c7d2a59
|
LLM: quick fix alpaca qlora finetuning script (#9534)
|
2023-11-27 11:04:27 +08:00 |
|
binbin Deng
|
1a2129221d
|
LLM: support resume from checkpoint in Alpaca QLoRA (#9502)
|
2023-11-22 13:49:14 +08:00 |
|
Ruonan Wang
|
076d106ef5
|
LLM: GPU QLoRA update to bf16 to accelerate gradient checkpointing (#9499)
* update to bf16 to accelerate gradient checkpoint
* add utils and fix ut
|
2023-11-21 17:08:36 +08:00 |
|
binbin Deng
|
b7ae572ac3
|
LLM: update Alpaca QLoRA finetuning example on GPU (#9492)
|
2023-11-21 14:22:19 +08:00 |
|
binbin Deng
|
3dac21ac7b
|
LLM: add more example usages about alpaca qlora on different hardware (#9458)
|
2023-11-17 09:56:43 +08:00 |
|
Ruonan Wang
|
0f82b8c3a0
|
LLM: update qlora example (#9454)
* update qlora example
* fix loss=0
|
2023-11-15 09:24:15 +08:00 |
|
binbin Deng
|
54d95e4907
|
LLM: add alpaca qlora finetuning example (#9276)
|
2023-11-08 16:25:17 +08:00 |
|
Ruonan Wang
|
d383ee8efb
|
LLM: update QLoRA example about accelerate version(#9314)
|
2023-10-31 13:54:38 +08:00 |
|
Yang Wang
|
7a2de00b48
|
Fixes for xpu Bf16 training (#9156)
* Support bf16 training
* Use a stable transformer version
* remove env
* fix style
|
2023-10-14 21:28:59 -07:00 |
|
binbin Deng
|
5e9962b60e
|
LLM: update example layout (#9046)
|
2023-10-09 15:36:39 +08:00 |
|