Commit graph

454 commits

Author SHA1 Message Date
Yishuo Wang
a596f1ae5f
remove bigdl-llm test to fix langchain UT (#12613) 2024-12-26 10:17:25 +08:00
Yuwen Hu
6278cafc25
Add setuptools as a basic dependency (#12563)
* Add setuptools as a basic dependency

* Remove unnecessary requirements of setuptools in example/unit/nightly tests
2024-12-17 16:56:41 +08:00
Chu,Youcheng
acd77d9e87
Remove env variable BIGDL_LLM_XMX_DISABLED in documentation (#12445)
* fix: remove BIGDL_LLM_XMX_DISABLED in mddocs

* fix: remove set SYCL_CACHE_PERSISTENT=1 in example

* fix: remove BIGDL_LLM_XMX_DISABLED in workflows

* fix: merge igpu and A-series Graphics

* fix: remove set BIGDL_LLM_XMX_DISABLED=1 in example

* fix: remove BIGDL_LLM_XMX_DISABLED in workflows

* fix: merge igpu and A-series Graphics

* fix: textual adjustment

* fix: textual adjustment

* fix: textual adjustment
2024-11-27 11:16:36 +08:00
Ruonan Wang
6c5e8fc70c
fix again (#12407) 2024-11-15 11:57:58 +08:00
Ruonan Wang
fcc0fa7316
fix workflow again (#12406)
* fix again

* fix name
2024-11-15 11:01:35 +08:00
Ruonan Wang
548dec5185
fix npu pipeline workflow (#12404) 2024-11-15 10:01:33 +08:00
Yuwen Hu
923d696854
Small fix to LNL performance tests (#12333) 2024-11-05 13:24:58 +08:00
Yuwen Hu
e2adc974fd
Small fix to LNL performance tests (#12331) 2024-11-04 19:22:41 +08:00
Yuwen Hu
522cdf8e9d
Add initial support for LNL nightly performance tests (#12326)
* Add initial support for LNL nightly performance tests

* Small fix
2024-11-04 18:53:51 +08:00
Yuwen Hu
4644cb640c
Perf test further fix regarding trl version (#12321) 2024-11-04 11:01:25 +08:00
Ruonan Wang
8fe01c9e4d
[NPU pipeline] update cmake usage of pipeline (#12320) 2024-11-04 10:30:03 +08:00
Yuwen Hu
94ce447794
Fix performance tests regarding trl version (#12319)
* Fix performance tests regarding trl version

* Small fix
2024-11-04 09:42:18 +08:00
Yuwen Hu
d8c1287335
Further update for Windows dGPU performance tests (#12244) 2024-10-22 15:07:21 +08:00
Yuwen Hu
ac2dac857c
Disable 4k input test for now for Windows dGPU performance test (#12239) 2024-10-21 15:03:26 +08:00
Yuwen Hu
ea5154d85e
Further update to Windows dGPU perf test (#12237) 2024-10-21 10:27:16 +08:00
Yuwen Hu
da9270be2d
Further update to Windows dGPU perf test (#12233) 2024-10-18 23:20:17 +08:00
Yuwen Hu
5935b25622
Further update windows gpu perf test regarding results integrity check (#12232) 2024-10-18 18:15:13 +08:00
Yuwen Hu
ef659629f3
Small update to Windows dGPU perf test (#12230)
* Small update to Windows dGPU perf test

* Small fix

* Small fixes

* Remove unnecessary file
2024-10-18 16:39:59 +08:00
Yuwen Hu
9d7f42fd0f
Support manually trigger of dGPU perf test on Windows (#12229)
* Support manually trigger of dgpu perf test on Windows

* Small fix

* Small fix

* Small update
2024-10-18 15:38:21 +08:00
Yuwen Hu
b88c1df324
Add Llama 3.1 & 3.2 to Arc Performance test (#12225)
* Add llama3.1 and llama3.2 in arc perf (#12202)

* Add llama3.1 and llama3.2 in arc perf

* Uninstall trl after arc test on transformers>=4.40

* Fix arc llama3 perf (#12212)

* Fix pip uninstall

* Uninstall trl after test on transformers==4.43.1

* Fix llama3 arc perf (#12218)

---------

Co-authored-by: Jin, Qiao <89779290+JinBridger@users.noreply.github.com>
2024-10-17 21:12:45 +08:00
Yuwen Hu
c9ac39fc1e
Add Llama 3.2 to iGPU performance test (transformers 4.45) (#12209)
* Add Llama 3.2 to iGPU Perf (#12200)

* Add Llama 3.2 to iGPU Perf

* Downgrade accelerate after step

* Temporarily disable model for test

* Temporarily change ERRORLEVEL check (#12201)

* Restore llama3.2 perf (#12206)

* Revert "Temporarily change ERRORLEVEL check"

This reverts commit 909dbbc930ab4283737161a55bb32006e6ca1991.

* Revert "Temporarily disable model for test"

This reverts commit 95322dc3c6429aa836f21bda0b5ba8d9b48592f8.

---------

Co-authored-by: Jin, Qiao <89779290+JinBridger@users.noreply.github.com>
2024-10-15 17:44:46 +08:00
Shaojun Liu
724b2ae66d
add npu-level0 pipeline.dll to ipex-llm (#12181)
* add npu-level0 pipeline.dll to ipex-llm

* test

* update runner label

* fix

* update

* fix

* fix
2024-10-11 16:05:20 +08:00
Shaojun Liu
9b4fee8b5b
disable nightly release for finetune images (#12070) 2024-09-12 15:10:50 +08:00
Yuwen Hu
c94032f97e
Try to fix llamaindex ut again (#12061) 2024-09-11 12:11:04 +08:00
Yuwen Hu
94dade9aca
Fix UT of ipex_llm.llamaindex (#12055) 2024-09-11 09:58:43 +08:00
Shaojun Liu
77cb348220
fix dependabot alerts (#12006)
* fix dependabot alerts

* update
2024-09-04 17:13:45 +08:00
Shaojun Liu
e5dc4e9123
disable outdated scheduled workflow (#11915) 2024-08-24 07:17:42 +08:00
Shaojun Liu
4cf640c548
update docker image tag to 2.2.0-SNAPSHOT (#11904) 2024-08-23 13:57:41 +08:00
Shaojun Liu
c5b51d41fb
Update pypi tag to 2.2.0.dev0 (#11895) 2024-08-22 16:48:09 +08:00
Yuwen Hu
bac98baab9
Make performance test install specific ipex-llm version from pypi (#11892) 2024-08-22 11:10:12 +08:00
Yuwen Hu
37106a877c
igpu performance test smal fix (#11872) 2024-08-21 03:09:14 +08:00
Yuwen Hu
0d58c2fdf9
Update performance test regarding updated default transformers==4.37.0 (#11869)
* Update igpu performance from transformers 4.36.2 to 4.37.0 (#11841)

* upgrade arc perf test to transformers 4.37 (#11842)

* fix load low bit com dtype (#11832)

* feat: add mixed_precision argument on ppl longbench evaluation

* fix: delete extra code

* feat: upgrade arc perf test to transformers 4.37

* fix: add missing codes

* fix: keep perf test for qwen-vl-chat in transformers 4.36

* fix: remove extra space

* fix: resolve pr comment

* fix: add empty line

* fix: add pip install for spr and core test

* fix: delete extra comments

* fix: remove python -m for pip

* Revert "fix load low bit com dtype (#11832)"

This reverts commit 6841a9ac8f.

---------

Co-authored-by: Zhao Changmin <changmin.zhao@intel.com>
Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com>

* add transformers==4.36 for qwen vl in igpu-perf (#11846)

* add transformers==4.36.2 for qwen-vl

* Small update

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>

* fix: remove qwen-7b on core test (#11851)

* fix: remove qwen-7b on core test

* fix: change delete to comment

---------

Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com>

* replce filename (#11854)

* fix: remove qwen-7b on core test

* fix: change delete to comment

* fix: replace filename

---------

Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com>

* fix: delete extra comments (#11863)

* Remove transformers installation for temp test purposes

* Small fix

* Small update

---------

Co-authored-by: Chu,Youcheng <70999398+cranechu0131@users.noreply.github.com>
Co-authored-by: Zhao Changmin <changmin.zhao@intel.com>
Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com>
Co-authored-by: Zijie Li <michael20001122@gmail.com>
Co-authored-by: Chu,Youcheng <1340390339@qq.com>
2024-08-20 17:59:28 +08:00
Yuwen Hu
016e840eed
Fix performance tests (#11802)
* Fix performance tests

* Small fix
2024-08-15 01:37:01 +08:00
Shaojun Liu
e3c1dae619
Fix Windows Unit Test (#11801)
* Update llm_unit_tests.yml

* remove debug information

* Delete .github/actions/llm/cli-test-windows directory
2024-08-14 19:16:48 +08:00
Ruonan Wang
43cca3be27
fix gemma2 runtime error caused by sliding window (#11788)
* fix runtime error

* revert workflow
2024-08-14 10:43:33 +08:00
Yuwen Hu
ec184af243
Add gemma-2-2b-it and gemma-2-9b-it to igpu nightly performance test (#11778)
* add yaml and modify `concat_csv.py` for `transformers` 4.43.1 (#11758)

* add yaml and modify `concat_csv.py` for `transformers` 4.43.1

* remove 4.43 for arc; fix;

* remove 4096-512 for 4.43

* comment some models

* Small fix

* uncomment models (#11777)

---------

Co-authored-by: Ch1y0q <qiyue2001@gmail.com>
2024-08-13 15:39:56 +08:00
hxsz1997
8ef4caaf5d
add 3k and 4k input of nightly perf test on iGPU (#11701)
* Add 3k&4k input in workflow for iGPU (#11685)

* add 3k&4k input in workflow

* comment for test

* comment models for accelarate test

* remove OOM models

* modify typo

* change test model (#11696)

* reverse test models (#11700)
2024-08-01 14:17:46 +08:00
Shaojun Liu
4d56ef5646
Fix openssf issue (#11632) 2024-07-22 14:14:28 +08:00
Yuwen Hu
2478e2c14b
Add check in iGPU perf workflow for results integrity (#11616)
* Add csv check for igpu benchmark workflow (#11610)

* add csv check for igpu benchmark workflow

* ready to test

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>

* Restore the temporarily removed models in iGPU-perf (#11615)

Co-authored-by: ATMxsp01 <shou.xu@intel.com>

---------

Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com>
Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-07-18 14:13:16 +08:00
Shaojun Liu
2b17536424
Fix python style check: update python version to 3.11 (#11601)
* Update python version to 3.11
2024-07-17 15:39:46 +08:00
Xu, Shuo
13a72dc51d
Test MiniCPM performance on iGPU in a more stable way (#11573)
* Test MiniCPM performance on iGPU in a more stable way

* small fix

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-07-12 17:07:41 +08:00
Xu, Shuo
1355b2ce06
Add model Qwen-VL-Chat to iGPU-perf (#11558)
* Add model Qwen-VL-Chat to iGPU-perf

* small fix

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-07-11 15:39:02 +08:00
Wang, Jian4
51f2effb05
Add xpu-tgi manually_build (#11556) 2024-07-11 10:35:40 +08:00
Yuwen Hu
8982ab73d5
Add Yi-6B and StableLM to iGPU perf test (#11546)
* Add transformer4.38.2 test to igpu benchmark (#11529)

* add transformer4.38.1 test to igpu benchmark

* use transformers4.38.2 & fix csv name error in 4.38 workflow

* add model Yi-6B-Chat & remove temporarily most models

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>

* filter some errorlevel (#11541)

Co-authored-by: ATMxsp01 <shou.xu@intel.com>

* Restore the temporarily removed models in iGPU-perf (#11544)

* filter some errorlevel

* restore the temporarily removed models in iGPU-perf

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>

---------

Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com>
Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-07-09 18:51:23 +08:00
Xu, Shuo
64cfed602d
Add new models to benchmark (#11505)
* Add new models to benchmark

* remove Qwen/Qwen-VL-Chat to pass the validation

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-07-08 10:35:55 +08:00
Yuwen Hu
8f376e5192
Change igpu perf to mainly test int4+fp16 (#11513) 2024-07-05 17:12:33 +08:00
Shaojun Liu
932ef78131
Update Workflow Inputs, Runner, and PR Validation Process (#11501)
* update check-artifact runner label to Shire

* update github.event.inputs to inputs

* update PR template
2024-07-03 16:49:54 +08:00
Jun Wang
18c973dc3e
Wang jun/ipex llm workflow (#11499)
* [update] merge manually build for testing function to manualy build

* [FIX] change public type to string

* [FIX] change public type to string

* [FIX] remove github.event prefix for inputs
2024-07-03 10:13:42 +08:00
Yuwen Hu
e53bd4401c
Small typo fixes in binary build workflow (#11494) 2024-07-02 19:11:43 +08:00
Yuwen Hu
4e32c92979
Further fix for triggering perf test from commit (#11493)
* Further fix for triggering perf test from commit

* Small fix
2024-07-02 18:56:53 +08:00