ipex-llm/python/llm/dev
Yina Chen e37f951cce
[NPU] Groupwise (#12241)
* dq divide

* fix

* support attn divide

* update qwen2 7b

* divide down_proj & other linear

* use concat & reduce sum

* support scale after

* support qwen2

* w/ mm

* update reshape

* spda

* split

* split 2+

* update

* lm head-> 28

* no scale

* update

* update

* update

* fix style

* fix style

* to split linear

* update

* update code

* address comments

* fix style & remove redundant code & revert benchmark scripts

* fix style & remove code

* update save & load

---------

Co-authored-by: Yang Wang <yang3.wang@intel.com>
2024-10-23 14:10:58 +08:00
..
benchmark [NPU] Groupwise (#12241) 2024-10-23 14:10:58 +08:00
test Add benchmark_util for transformers >= 4.44.0 (#12171) 2024-10-14 15:40:12 +08:00
print_glib_requirement.py Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
release.sh remove (#11527) 2024-07-08 15:49:52 +08:00
release_default_linux.sh Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
release_default_windows.sh Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00