Chu,Youcheng
|
ffa9a9e1b3
|
Update streaming in npu examples (#12495)
* feat: add streaming
* Update readme accordingly
---------
Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
|
2024-12-04 17:51:10 +08:00 |
|
Jin, Qiao
|
7082844f3f
|
Fix NPU LLM example save/load tokenizer (#12485)
|
2024-12-03 16:30:55 +08:00 |
|
binbin Deng
|
ab01753b1c
|
[NPU] update save-load API usage (#12473)
|
2024-12-03 09:46:15 +08:00 |
|
Ruonan Wang
|
3fe2ea3081
|
[NPU] Reuse prefill of acc lib for pipeline (#12279)
* first commit
* update example
* fix style
* update example
* embedding as const
* fix generate
* code refactor
* meet code review
* fix style
* change max_output_len to max_context_len
* fix all-in-one
* fix example
* add check for new tokens
|
2024-10-28 16:05:49 +08:00 |
|
Ch1y0q
|
73a4360f3f
|
update lowbit path for baichuan2, qwen2, generate.py (#12051)
* update lowbit path for baichuan2, qwen2, `generate.py`
* update readme
|
2024-09-10 15:35:24 +08:00 |
|
Zijie Li
|
90f692937d
|
Update npu baichuan2 (#11939)
|
2024-08-27 16:56:26 +08:00 |
|
Jiao Wang
|
b4b6ddf73c
|
NPU Baichuan2 Multi- Process example (#11928)
|
2024-08-27 15:25:49 +08:00 |
|