binbin Deng
|
ab01753b1c
|
[NPU] update save-load API usage (#12473)
|
2024-12-03 09:46:15 +08:00 |
|
Yina Chen
|
d872639395
|
[NPU] Llama3, Qwen2 1.5b, MiniCPM 1/2B groupwise support (#12327)
* support minicpm 1b & qwen 1.5b gw
* support minicpm 1b
* support minicpm 2b
* fix style & error
* fix style & update
* remove print
|
2024-11-05 15:51:31 +08:00 |
|
binbin Deng
|
d409d9d0eb
|
[NPU L0] Update streaming mode of example (#12312)
|
2024-11-01 15:38:10 +08:00 |
|
binbin Deng
|
eda764909c
|
Add minicpm-2b in L0 pipeline (#12308)
|
2024-11-01 09:30:01 +08:00 |
|
binbin Deng
|
41b8064554
|
Support minicpm-1B in level0 pipeline (#12297)
|
2024-10-30 17:21:47 +08:00 |
|