ipex-llm

Author	SHA1	Message	Date
Chu,Youcheng	acd77d9e87	Remove env variable `BIGDL_LLM_XMX_DISABLED` in documentation (#12445 ) * fix: remove BIGDL_LLM_XMX_DISABLED in mddocs * fix: remove set SYCL_CACHE_PERSISTENT=1 in example * fix: remove BIGDL_LLM_XMX_DISABLED in workflows * fix: merge igpu and A-series Graphics * fix: remove set BIGDL_LLM_XMX_DISABLED=1 in example * fix: remove BIGDL_LLM_XMX_DISABLED in workflows * fix: merge igpu and A-series Graphics * fix: textual adjustment * fix: textual adjustment * fix: textual adjustment	2024-11-27 11:16:36 +08:00
Yuwen Hu	923d696854	Small fix to LNL performance tests (#12333 )	2024-11-05 13:24:58 +08:00
Yuwen Hu	e2adc974fd	Small fix to LNL performance tests (#12331 )	2024-11-04 19:22:41 +08:00
Yuwen Hu	522cdf8e9d	Add initial support for LNL nightly performance tests (#12326 ) * Add initial support for LNL nightly performance tests * Small fix	2024-11-04 18:53:51 +08:00
Yuwen Hu	4644cb640c	Perf test further fix regarding trl version (#12321 )	2024-11-04 11:01:25 +08:00
Yuwen Hu	94ce447794	Fix performance tests regarding `trl` version (#12319 ) * Fix performance tests regarding trl version * Small fix	2024-11-04 09:42:18 +08:00
Yuwen Hu	d8c1287335	Further update for Windows dGPU performance tests (#12244 )	2024-10-22 15:07:21 +08:00
Yuwen Hu	ac2dac857c	Disable 4k input test for now for Windows dGPU performance test (#12239 )	2024-10-21 15:03:26 +08:00
Yuwen Hu	ea5154d85e	Further update to Windows dGPU perf test (#12237 )	2024-10-21 10:27:16 +08:00
Yuwen Hu	da9270be2d	Further update to Windows dGPU perf test (#12233 )	2024-10-18 23:20:17 +08:00
Yuwen Hu	5935b25622	Further update windows gpu perf test regarding results integrity check (#12232 )	2024-10-18 18:15:13 +08:00
Yuwen Hu	ef659629f3	Small update to Windows dGPU perf test (#12230 ) * Small update to Windows dGPU perf test * Small fix * Small fixes * Remove unnecessary file	2024-10-18 16:39:59 +08:00
Yuwen Hu	9d7f42fd0f	Support manually trigger of dGPU perf test on Windows (#12229 ) * Support manually trigger of dgpu perf test on Windows * Small fix * Small fix * Small update	2024-10-18 15:38:21 +08:00
Yuwen Hu	b88c1df324	Add Llama 3.1 & 3.2 to Arc Performance test (#12225 ) * Add llama3.1 and llama3.2 in arc perf (#12202) * Add llama3.1 and llama3.2 in arc perf * Uninstall trl after arc test on transformers>=4.40 * Fix arc llama3 perf (#12212) * Fix pip uninstall * Uninstall trl after test on transformers==4.43.1 * Fix llama3 arc perf (#12218) --------- Co-authored-by: Jin, Qiao <89779290+JinBridger@users.noreply.github.com>	2024-10-17 21:12:45 +08:00
Yuwen Hu	c9ac39fc1e	Add Llama 3.2 to iGPU performance test (`transformers 4.45`) (#12209 ) * Add Llama 3.2 to iGPU Perf (#12200) * Add Llama 3.2 to iGPU Perf * Downgrade accelerate after step * Temporarily disable model for test * Temporarily change ERRORLEVEL check (#12201) * Restore llama3.2 perf (#12206) * Revert "Temporarily change ERRORLEVEL check" This reverts commit 909dbbc930ab4283737161a55bb32006e6ca1991. * Revert "Temporarily disable model for test" This reverts commit 95322dc3c6429aa836f21bda0b5ba8d9b48592f8. --------- Co-authored-by: Jin, Qiao <89779290+JinBridger@users.noreply.github.com>	2024-10-15 17:44:46 +08:00
Shaojun Liu	c5b51d41fb	Update pypi tag to 2.2.0.dev0 (#11895 )	2024-08-22 16:48:09 +08:00
Yuwen Hu	bac98baab9	Make performance test install specific ipex-llm version from pypi (#11892 )	2024-08-22 11:10:12 +08:00
Yuwen Hu	37106a877c	igpu performance test smal fix (#11872 )	2024-08-21 03:09:14 +08:00
Yuwen Hu	0d58c2fdf9	Update performance test regarding updated default `transformers==4.37.0` (#11869 ) * Update igpu performance from transformers 4.36.2 to 4.37.0 (#11841) * upgrade arc perf test to transformers 4.37 (#11842) * fix load low bit com dtype (#11832) * feat: add mixed_precision argument on ppl longbench evaluation * fix: delete extra code * feat: upgrade arc perf test to transformers 4.37 * fix: add missing codes * fix: keep perf test for qwen-vl-chat in transformers 4.36 * fix: remove extra space * fix: resolve pr comment * fix: add empty line * fix: add pip install for spr and core test * fix: delete extra comments * fix: remove python -m for pip * Revert "fix load low bit com dtype (#11832)" This reverts commit `6841a9ac8f`. --------- Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * add transformers==4.36 for qwen vl in igpu-perf (#11846) * add transformers==4.36.2 for qwen-vl * Small update --------- Co-authored-by: Yuwen Hu <yuwen.hu@intel.com> * fix: remove qwen-7b on core test (#11851) * fix: remove qwen-7b on core test * fix: change delete to comment --------- Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * replce filename (#11854) * fix: remove qwen-7b on core test * fix: change delete to comment * fix: replace filename --------- Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * fix: delete extra comments (#11863) * Remove transformers installation for temp test purposes * Small fix * Small update --------- Co-authored-by: Chu,Youcheng <70999398+cranechu0131@users.noreply.github.com> Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> Co-authored-by: Zijie Li <michael20001122@gmail.com> Co-authored-by: Chu,Youcheng <1340390339@qq.com>	2024-08-20 17:59:28 +08:00
Yuwen Hu	016e840eed	Fix performance tests (#11802 ) * Fix performance tests * Small fix	2024-08-15 01:37:01 +08:00
Ruonan Wang	43cca3be27	fix gemma2 runtime error caused by sliding window (#11788 ) * fix runtime error * revert workflow	2024-08-14 10:43:33 +08:00
Yuwen Hu	ec184af243	Add `gemma-2-2b-it` and `gemma-2-9b-it` to igpu nightly performance test (#11778 ) * add yaml and modify `concat_csv.py` for `transformers` 4.43.1 (#11758) * add yaml and modify `concat_csv.py` for `transformers` 4.43.1 * remove 4.43 for arc; fix; * remove 4096-512 for 4.43 * comment some models * Small fix * uncomment models (#11777) --------- Co-authored-by: Ch1y0q <qiyue2001@gmail.com>	2024-08-13 15:39:56 +08:00
hxsz1997	8ef4caaf5d	add 3k and 4k input of nightly perf test on iGPU (#11701 ) * Add 3k&4k input in workflow for iGPU (#11685) * add 3k&4k input in workflow * comment for test * comment models for accelarate test * remove OOM models * modify typo * change test model (#11696) * reverse test models (#11700)	2024-08-01 14:17:46 +08:00
Yuwen Hu	2478e2c14b	Add check in iGPU perf workflow for results integrity (#11616 ) * Add csv check for igpu benchmark workflow (#11610) * add csv check for igpu benchmark workflow * ready to test --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> * Restore the temporarily removed models in iGPU-perf (#11615) Co-authored-by: ATMxsp01 <shou.xu@intel.com> --------- Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com> Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-18 14:13:16 +08:00
Xu, Shuo	13a72dc51d	Test MiniCPM performance on iGPU in a more stable way (#11573 ) * Test MiniCPM performance on iGPU in a more stable way * small fix --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-12 17:07:41 +08:00
Xu, Shuo	1355b2ce06	Add model Qwen-VL-Chat to iGPU-perf (#11558 ) * Add model Qwen-VL-Chat to iGPU-perf * small fix --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-11 15:39:02 +08:00
Yuwen Hu	8982ab73d5	Add Yi-6B and StableLM to iGPU perf test (#11546 ) * Add transformer4.38.2 test to igpu benchmark (#11529) * add transformer4.38.1 test to igpu benchmark * use transformers4.38.2 & fix csv name error in 4.38 workflow * add model Yi-6B-Chat & remove temporarily most models --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> * filter some errorlevel (#11541) Co-authored-by: ATMxsp01 <shou.xu@intel.com> * Restore the temporarily removed models in iGPU-perf (#11544) * filter some errorlevel * restore the temporarily removed models in iGPU-perf --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> --------- Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com> Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-09 18:51:23 +08:00
Xu, Shuo	64cfed602d	Add new models to benchmark (#11505 ) * Add new models to benchmark * remove Qwen/Qwen-VL-Chat to pass the validation --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-08 10:35:55 +08:00
Yuwen Hu	8f376e5192	Change igpu perf to mainly test int4+fp16 (#11513 )	2024-07-05 17:12:33 +08:00
Shaojun Liu	932ef78131	Update Workflow Inputs, Runner, and PR Validation Process (#11501 ) * update check-artifact runner label to Shire * update github.event.inputs to inputs * update PR template	2024-07-03 16:49:54 +08:00
Yuwen Hu	4e32c92979	Further fix for triggering perf test from commit (#11493 ) * Further fix for triggering perf test from commit * Small fix	2024-07-02 18:56:53 +08:00
Yuwen Hu	986b10e397	Further fix for performance tests triggered by pr (#11488 )	2024-07-02 15:29:42 +08:00
Yuwen Hu	bb6953c19e	Support pr validate perf test (#11486 ) * Support triggering performance tests through commits * Small fix * Small fix * Small fixes	2024-07-02 15:20:42 +08:00
Yuwen Hu	ca24794dd0	Fixes for performance test triggering (#11481 )	2024-07-01 18:39:54 +08:00
Yuwen Hu	6bdc562f4c	Enable triggering nightly tests/performance tests from another repo (#11480 ) * Enable triggering from another workflow for nightly tests and example tests * Enable triggering from another workflow for nightly performance tests	2024-07-01 17:45:42 +08:00
Yuwen Hu	75f836f288	Add extra warmup for THUDM/glm-4-9b-chat in igpu-performance test (#11417 )	2024-06-24 18:08:05 +08:00
Shaojun Liu	5e823ef2ce	Fix nightly arc perf (#11404 ) * pip install pytest for arc perf test * trigger test	2024-06-24 15:58:41 +08:00
Wenjing Margaret Mao	c0e86c523a	Add qwen-moe batch1 to nightly perf (#11369 ) * add moe * reduce 437 models * rename * fix syntax * add moe check result * add 430 + 437 * all modes * 4-37-4 exclud * revert & comment --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-20 14:17:41 +08:00
Wenjing Margaret Mao	b2f62a8561	Add batch 4 perf test (#11355 ) * copy files to this branch * add tasks * comment one model * change the model to test the 4.36 * only test batch-4 * typo * typo * typo * typo * typo * typo * add 4.37-batch4 * change the file name * revet yaml file * no print * add batch4 task * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-20 09:48:52 +08:00
Yuwen Hu	a2a5890b48	Make manually-triggered perf test able to choose which test to run (#11324 )	2024-06-17 10:23:13 +08:00
Yuwen Hu	1978f63f6b	Fix igpu performance guide regarding html generation (#11328 )	2024-06-17 10:21:30 +08:00
Wenjing Margaret Mao	b61f6e3ab1	Add update_parent_folder for nightly_perf_test (#11287 ) * add update_parent_folder and change the workflow file * add update_parent_folder and change the workflow file * move to pr mode and comment the test * use one model per comfig * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-12 17:58:13 +08:00
Wenjing Margaret Mao	70b17c87be	Merge multiple batches (#11264 ) * add merge steps * move to pr mode * remove build + add merge.py * add tohtml and change cp * change test_batch folder path * change merge_temp path * change to html folder * revert * change place * revert 437 * revert space --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-07 18:38:45 +08:00
Wenjing Margaret Mao	c825a7e1e9	change the workflow file to test ftp (#11241 ) * change the workflow to test ftp * comment some models * revert file --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-06 16:53:19 +08:00
Wenjing Margaret Mao	231b968aba	Modify the check_results.py to support batch 2&4 (#11133 ) * add batch 2&4 and exclude to perf_test * modify the perf-test&437 yaml * modify llm_performance_test.yml * remove batch 4 * modify check_results.py to support batch 2&4 * change the batch_size format * remove genxir * add str(batch_size) * change actual_test_casese in check_results file to support batch_size * change html highlight * less models to test html and html_path * delete the moe model * split batch html * split * use installing from pypi * use installing from pypi - batch2 * revert cpp * revert cpp * merge two jobs into one, test batch_size in one job * merge two jobs into one, test batch_size in one job * change file directory in workflow * try catch deal with odd file without batch_size * modify pandas version * change the dir * organize the code * organize the code * remove Qwen-MOE * modify based on feedback * modify based on feedback * modify based on second round of feedback * modify based on second round of feedback + change run-arc.sh mode * modify based on second round of feedback + revert config * modify based on second round of feedback + revert config * modify based on second round of feedback + remove comments * modify based on second round of feedback + remove comments * modify based on second round of feedback + revert arc-perf-test * modify based on third round of feedback * change error type * change error type * modify check_results.html * split batch into two folders * add all models * move csv_name * revert pr test * revert pr test --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-05 15:04:55 +08:00
Yuwen Hu	9f8074c653	Add extra warmup for chatglm3-6b in igpu-performance test (#11197 ) * Add extra warmup for chatglm3-6b to record more stable performance (int4+fp32) * Small updates	2024-06-04 14:06:09 +08:00
Yina Chen	b6b70d1ba0	Divide core-xe packages (#11131 ) * temp * add batch * fix style * update package name * fix style * add workflow * use temp version to run uts * trigger performance test * trigger win igpu perf * revert workflow & setup	2024-05-28 12:00:18 +08:00
Jiao Wang	0a06a6e1d4	Update tests for transformers 4.36 (#10858 ) * update unit test * update * update * update * update * update * fix gpu attention test * update * update * update * update * update * update * update example test * replace replit code * update * update * update * update * set safe_serialization false * perf test * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * delete * update * update * update * update * update * update * revert * update	2024-05-24 10:26:38 +08:00
Yuwen Hu	b3027e2d60	Update for cpu install option in performance tests (#11060 )	2024-05-17 10:33:43 +08:00
Yuwen Hu	8010af700f	Update igpu performance test to use pypi installed oneAPI (#11010 )	2024-05-14 14:05:33 +08:00

1 2 3

105 commits