binbin Deng
|
ab01753b1c
|
[NPU] update save-load API usage (#12473)
|
2024-12-03 09:46:15 +08:00 |
|
binbin Deng
|
7a97fbb779
|
Support vpm and resampler module of minicpm-v on NPU (#12375)
|
2024-11-12 15:59:55 +08:00 |
|
Ruonan Wang
|
3fe2ea3081
|
[NPU] Reuse prefill of acc lib for pipeline (#12279)
* first commit
* update example
* fix style
* update example
* embedding as const
* fix generate
* code refactor
* meet code review
* fix style
* change max_output_len to max_context_len
* fix all-in-one
* fix example
* add check for new tokens
|
2024-10-28 16:05:49 +08:00 |
|
Ruonan Wang
|
573c20bae6
|
fix npu lm_head cpu condition (#11976)
* fix
* fix
* fix
* fix stype
* fix style
* fix style
|
2024-08-30 17:11:26 +08:00 |
|
SONG Ge
|
158289d205
|
[NPU] Add initial support for minicpm-llama-v2.5 (#11962)
* add initial support for minicpm-llama-v2.5
* update impl
* add minicpm-llama3-v2.5 example
|
2024-08-30 16:00:33 +08:00 |
|