Yuwen Hu
|
f61b1785fb
|
Small update to NPU example readme (#12034)
* Small update to NPU example readme
* Small fix
|
2024-09-06 15:54:23 +08:00 |
|
Ruonan Wang
|
79978e6f36
|
update npu multimodal readme (#11979)
* update npu readme of multimodal
* small fix
* meet comment
|
2024-08-30 19:02:06 +08:00 |
|
Ruonan Wang
|
4811a490ef
|
small fix (#11978)
* fix
* meet comment
|
2024-08-30 17:55:15 +08:00 |
|
Ruonan Wang
|
573c20bae6
|
fix npu lm_head cpu condition (#11976)
* fix
* fix
* fix
* fix stype
* fix style
* fix style
|
2024-08-30 17:11:26 +08:00 |
|
Ruonan Wang
|
60aa1a2c0f
|
Initial NPU support for MiniCPM-V-2_6 (#11966)
* initial pr
* update npu model
* fix
* fix kv cache type
* fix
* small fix
* fix style
* fix model id
* change inter_pp=4
* address comment
* fix
* fix style
* fix
* rebase
|
2024-08-30 16:34:35 +08:00 |
|
SONG Ge
|
158289d205
|
[NPU] Add initial support for minicpm-llama-v2.5 (#11962)
* add initial support for minicpm-llama-v2.5
* update impl
* add minicpm-llama3-v2.5 example
|
2024-08-30 16:00:33 +08:00 |
|
Jin, Qiao
|
c28b3389e6
|
Update npu multimodal example (#11773)
|
2024-08-13 14:14:59 +08:00 |
|
Jin, Qiao
|
a44ab32153
|
Switch to conhost when running on NPU (#11687)
|
2024-07-30 17:08:06 +08:00 |
|
Zhao Changmin
|
06745e5742
|
Add npu benchmark all-in-one script (#11571)
* npu benchmark
|
2024-07-15 10:42:37 +08:00 |
|
Zhao Changmin
|
105e124752
|
optimize phi3-v encoder npu performance and add multimodal example (#11553)
* phi3-v
* readme
|
2024-07-11 13:59:14 +08:00 |
|