binbin Deng
|
ec362e6133
|
Add llama3 level0 example (#12275)
|
2024-10-28 09:24:51 +08:00 |
|
Ruonan Wang
|
854398f6e0
|
update example to reduce peak memory usage (#12274)
|
2024-10-25 17:09:26 +08:00 |
|
Ruonan Wang
|
ae57e23e4f
|
fix incompatibility between llama GW & llama pipeline (#12267)
* fix
* fix
|
2024-10-25 10:31:44 +08:00 |
|
Ruonan Wang
|
821fd96367
|
Initial integrate our L0 Llama impl into ipex-llm (#12255)
* temp save
* initial support
* fix
* simplify code
* fix style
* fix example
* make default value of pipeline as False
|
2024-10-24 09:49:27 +08:00 |
|
Ruonan Wang
|
4d93bb81fe
|
Initial support of NPU level0 Model (#12177)
* first commit to support load dll and init llm pipeline
* add init generate
* fix style
* small updates
* fix style and check tokens number
|
2024-10-11 09:45:53 +08:00 |
|