Wang, Jian4
|
0193f29411
|
LLM : Enable gguf float16 and Yuan2 model (#10372)
* enable float16
* add yun files
* enable yun
* enable set low_bit on yuan2
* update
* update license
* update generate
* update readme
* update python style
* update
|
2024-03-13 10:19:18 +08:00 |
|
Wang, Jian4
|
496bb2e845
|
LLM: Support load BaiChuan model family gguf model (#9685)
* support baichuan model family gguf model
* update gguf generate.py
* add verify models
* add support model_family
* update
* update style
* update type
* update readme
* update
* remove support model_family
|
2023-12-15 13:34:33 +08:00 |
|
dingbaorong
|
89069d6173
|
Add gpu gguf example (#9603)
* add gpu gguf example
* some fixes
* address kai's comments
* address json's comments
|
2023-12-06 15:17:54 +08:00 |
|