Yina Chen
|
b2e69a896c
|
[NPU] Support Baichuan groupwise & gw code refactor (#12337)
* support minicpm 1b & qwen 1.5b gw
* support minicpm 1b
* baichuan part
* update
* support minicpm 1b & qwen 1.5b gw
* support minicpm 1b
* baichuan part
* update
* update
* update
* baichuan support
* code refactor
* remove code
* fix style
* address comments
* revert
|
2024-11-08 11:42:42 +08:00 |
|
binbin Deng
|
d409d9d0eb
|
[NPU L0] Update streaming mode of example (#12312)
|
2024-11-01 15:38:10 +08:00 |
|
Ruonan Wang
|
2b2cb9c693
|
[NPU pipeline] Support save & load and update examples (#12293)
* support save & load, update llama examples
* update baichuan2 example
* update readme
|
2024-10-30 10:02:00 +08:00 |
|
binbin Deng
|
3feb58d1e4
|
Support baichuan2 for level0 pipeline (#12289)
|
2024-10-29 19:24:16 +08:00 |
|