Ruonan Wang
|
f41405368a
|
Support minicpm for NPU C++ (#12434)
* support minicpm-1b
* update
* tune fused_layers
* update readme.md
|
2024-11-25 10:42:02 +08:00 |
|
Ruonan Wang
|
0819fad34e
|
support Llama2-7B / Llama3-8B for NPU C++ (#12431)
* support llama2
* update
* support fused_layers=4 for Llama2-7B
|
2024-11-22 18:47:19 +08:00 |
|
Ruonan Wang
|
4ffa6c752c
|
New convert support for C++ NPU (#12430)
* initial commit
* fix
* fix style
* fix style
* fix
* fix
|
2024-11-22 14:28:30 +08:00 |
|
Ruonan Wang
|
2935e97610
|
small fix of cpp readme(#12425)
|
2024-11-21 18:21:34 +08:00 |
|
Ruonan Wang
|
7288c759ce
|
Initial NPU C++ Example (#12417)
* temp save
* meet review, update
* update
* meet review, add license
* typo
|
2024-11-21 10:09:26 +08:00 |
|