ipex-llm

Author	SHA1	Message	Date
Yuwen Hu	ec184af243	Add `gemma-2-2b-it` and `gemma-2-9b-it` to igpu nightly performance test (#11778 ) * add yaml and modify `concat_csv.py` for `transformers` 4.43.1 (#11758) * add yaml and modify `concat_csv.py` for `transformers` 4.43.1 * remove 4.43 for arc; fix; * remove 4096-512 for 4.43 * comment some models * Small fix * uncomment models (#11777) --------- Co-authored-by: Ch1y0q <qiyue2001@gmail.com>	2024-08-13 15:39:56 +08:00
hxsz1997	8ef4caaf5d	add 3k and 4k input of nightly perf test on iGPU (#11701 ) * Add 3k&4k input in workflow for iGPU (#11685) * add 3k&4k input in workflow * comment for test * comment models for accelarate test * remove OOM models * modify typo * change test model (#11696) * reverse test models (#11700)	2024-08-01 14:17:46 +08:00
Shaojun Liu	4d56ef5646	Fix openssf issue (#11632 )	2024-07-22 14:14:28 +08:00
Yuwen Hu	2478e2c14b	Add check in iGPU perf workflow for results integrity (#11616 ) * Add csv check for igpu benchmark workflow (#11610) * add csv check for igpu benchmark workflow * ready to test --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> * Restore the temporarily removed models in iGPU-perf (#11615) Co-authored-by: ATMxsp01 <shou.xu@intel.com> --------- Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com> Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-18 14:13:16 +08:00
Shaojun Liu	2b17536424	Fix python style check: update python version to 3.11 (#11601 ) * Update python version to 3.11	2024-07-17 15:39:46 +08:00
Xu, Shuo	13a72dc51d	Test MiniCPM performance on iGPU in a more stable way (#11573 ) * Test MiniCPM performance on iGPU in a more stable way * small fix --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-12 17:07:41 +08:00
Xu, Shuo	1355b2ce06	Add model Qwen-VL-Chat to iGPU-perf (#11558 ) * Add model Qwen-VL-Chat to iGPU-perf * small fix --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-11 15:39:02 +08:00
Wang, Jian4	51f2effb05	Add xpu-tgi manually_build (#11556 )	2024-07-11 10:35:40 +08:00
Yuwen Hu	8982ab73d5	Add Yi-6B and StableLM to iGPU perf test (#11546 ) * Add transformer4.38.2 test to igpu benchmark (#11529) * add transformer4.38.1 test to igpu benchmark * use transformers4.38.2 & fix csv name error in 4.38 workflow * add model Yi-6B-Chat & remove temporarily most models --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> * filter some errorlevel (#11541) Co-authored-by: ATMxsp01 <shou.xu@intel.com> * Restore the temporarily removed models in iGPU-perf (#11544) * filter some errorlevel * restore the temporarily removed models in iGPU-perf --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> --------- Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com> Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-09 18:51:23 +08:00
Xu, Shuo	64cfed602d	Add new models to benchmark (#11505 ) * Add new models to benchmark * remove Qwen/Qwen-VL-Chat to pass the validation --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-08 10:35:55 +08:00
Yuwen Hu	8f376e5192	Change igpu perf to mainly test int4+fp16 (#11513 )	2024-07-05 17:12:33 +08:00
Shaojun Liu	932ef78131	Update Workflow Inputs, Runner, and PR Validation Process (#11501 ) * update check-artifact runner label to Shire * update github.event.inputs to inputs * update PR template	2024-07-03 16:49:54 +08:00
Shaojun Liu	e7ab93b55c	Update pull_request_template.md (#11484 ) * Update pull_request_template.md * refine	2024-07-03 11:13:16 +08:00
Jun Wang	18c973dc3e	Wang jun/ipex llm workflow (#11499 ) * [update] merge manually build for testing function to manualy build * [FIX] change public type to string * [FIX] change public type to string * [FIX] remove github.event prefix for inputs	2024-07-03 10:13:42 +08:00
Yuwen Hu	e53bd4401c	Small typo fixes in binary build workflow (#11494 )	2024-07-02 19:11:43 +08:00
Yuwen Hu	4e32c92979	Further fix for triggering perf test from commit (#11493 ) * Further fix for triggering perf test from commit * Small fix	2024-07-02 18:56:53 +08:00
Jun Wang	6352c718f3	[update] merge manually build for testing function to manualy build (#11491 )	2024-07-02 16:28:15 +08:00
Yuwen Hu	986b10e397	Further fix for performance tests triggered by pr (#11488 )	2024-07-02 15:29:42 +08:00
Yuwen Hu	bb6953c19e	Support pr validate perf test (#11486 ) * Support triggering performance tests through commits * Small fix * Small fix * Small fixes	2024-07-02 15:20:42 +08:00
Shaojun Liu	a1164e45b6	Enable Release Pypi workflow to be called in another repo (#11483 )	2024-07-01 19:48:21 +08:00
Yuwen Hu	fb4774b076	Update pull request template for manually-ttriggered Unit tests (#11482 )	2024-07-01 19:06:29 +08:00
Yuwen Hu	ca24794dd0	Fixes for performance test triggering (#11481 )	2024-07-01 18:39:54 +08:00
Yuwen Hu	6bdc562f4c	Enable triggering nightly tests/performance tests from another repo (#11480 ) * Enable triggering from another workflow for nightly tests and example tests * Enable triggering from another workflow for nightly performance tests	2024-07-01 17:45:42 +08:00
Yuwen Hu	dbba51f455	Enable LLM UT workflow to be called in another repo (#11475 ) * Enable LLM UT workflow to be called in another repo * Small fixes * Small fix	2024-07-01 15:26:17 +08:00
Shaojun Liu	13f59ae6b4	Fix llm binary build linux-build-avxvnni failure (#11447 ) * skip gpg check failure * skip gpg check	2024-06-27 14:12:14 +08:00
Yuwen Hu	75f836f288	Add extra warmup for THUDM/glm-4-9b-chat in igpu-performance test (#11417 )	2024-06-24 18:08:05 +08:00
Shaojun Liu	5e823ef2ce	Fix nightly arc perf (#11404 ) * pip install pytest for arc perf test * trigger test	2024-06-24 15:58:41 +08:00
Shaojun Liu	5aa3e427a9	Fix docker images (#11362 ) * Fix docker images * add-apt-repository requires gnupg, gpg-agent, software-properties-common * update * avoid importing ipex again	2024-06-20 15:44:55 +08:00
Wenjing Margaret Mao	c0e86c523a	Add qwen-moe batch1 to nightly perf (#11369 ) * add moe * reduce 437 models * rename * fix syntax * add moe check result * add 430 + 437 * all modes * 4-37-4 exclud * revert & comment --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-20 14:17:41 +08:00
Wenjing Margaret Mao	b2f62a8561	Add batch 4 perf test (#11355 ) * copy files to this branch * add tasks * comment one model * change the model to test the 4.36 * only test batch-4 * typo * typo * typo * typo * typo * typo * add 4.37-batch4 * change the file name * revet yaml file * no print * add batch4 task * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-20 09:48:52 +08:00
Qiyuan Gong	de4bb97b4f	Remove accelerate 0.23.0 install command in readme and docker (#11333 ) *ipex-llm's accelerate has been upgraded to 0.23.0. Remove accelerate 0.23.0 install command in README and docker。	2024-06-17 17:52:12 +08:00
Yuwen Hu	a2a5890b48	Make manually-triggered perf test able to choose which test to run (#11324 )	2024-06-17 10:23:13 +08:00
Yuwen Hu	1978f63f6b	Fix igpu performance guide regarding html generation (#11328 )	2024-06-17 10:21:30 +08:00
Wenjing Margaret Mao	b61f6e3ab1	Add update_parent_folder for nightly_perf_test (#11287 ) * add update_parent_folder and change the workflow file * add update_parent_folder and change the workflow file * move to pr mode and comment the test * use one model per comfig * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-12 17:58:13 +08:00
Wenjing Margaret Mao	70b17c87be	Merge multiple batches (#11264 ) * add merge steps * move to pr mode * remove build + add merge.py * add tohtml and change cp * change test_batch folder path * change merge_temp path * change to html folder * revert * change place * revert 437 * revert space --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-07 18:38:45 +08:00
Shaojun Liu	8aabb5bac7	Enable CodeQL Check for CT39 (#11242 ) * Create codeql.yml * Update codeql.yml * Update codeql.yml * Update codeql.yml * Update codeql.yml	2024-06-06 17:41:12 +08:00
Wenjing Margaret Mao	c825a7e1e9	change the workflow file to test ftp (#11241 ) * change the workflow to test ftp * comment some models * revert file --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-06 16:53:19 +08:00
Wenjing Margaret Mao	231b968aba	Modify the check_results.py to support batch 2&4 (#11133 ) * add batch 2&4 and exclude to perf_test * modify the perf-test&437 yaml * modify llm_performance_test.yml * remove batch 4 * modify check_results.py to support batch 2&4 * change the batch_size format * remove genxir * add str(batch_size) * change actual_test_casese in check_results file to support batch_size * change html highlight * less models to test html and html_path * delete the moe model * split batch html * split * use installing from pypi * use installing from pypi - batch2 * revert cpp * revert cpp * merge two jobs into one, test batch_size in one job * merge two jobs into one, test batch_size in one job * change file directory in workflow * try catch deal with odd file without batch_size * modify pandas version * change the dir * organize the code * organize the code * remove Qwen-MOE * modify based on feedback * modify based on feedback * modify based on second round of feedback * modify based on second round of feedback + change run-arc.sh mode * modify based on second round of feedback + revert config * modify based on second round of feedback + revert config * modify based on second round of feedback + remove comments * modify based on second round of feedback + remove comments * modify based on second round of feedback + revert arc-perf-test * modify based on third round of feedback * change error type * change error type * modify check_results.html * split batch into two folders * add all models * move csv_name * revert pr test * revert pr test --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-05 15:04:55 +08:00
Shaojun Liu	dc4fea7e3f	always cleanup conda env after build (#11211 )	2024-06-05 13:46:30 +08:00
Yuwen Hu	9f8074c653	Add extra warmup for chatglm3-6b in igpu-performance test (#11197 ) * Add extra warmup for chatglm3-6b to record more stable performance (int4+fp32) * Small updates	2024-06-04 14:06:09 +08:00
Shaojun Liu	401013a630	Remove chatglm_C Module to Eliminate LGPL Dependency (#11178 ) * remove chatglm_C.*.pyd to solve ngsolve weak copyright vunl fix style check error * remove chatglm native int4 from langchain	2024-05-31 17:03:11 +08:00
Jin Qiao	25b6402315	Add Windows GPU unit test (#11050 ) * Add Windows GPU UT * temporarily remove ut on arc * retry * retry * retry * fix * retry * retry * fix * retry * retry * retry * retry * retry * retry * retry * retry * retry * retry * retry * retry * retry * fix * retry * retry * retry * retry * retry * retry * merge into single workflow * retry inference test * retry * retrigger * try to fix inference test * retry * retry * retry * retry * retry * retry * retry * retry * retry * retry * retry * check lower_bound * retry * retry * try example test * try fix example test * retry * fix * seperate function into shell script * remove cygpath * try remove all cygpath * retry * retry * Revert "try remove all cygpath" This reverts commit 7ceeff3e48f08429062ecef548c1a3ad3488756f. * Revert "retry" This reverts commit 40ea2457843bff6991b8db24316cde5de1d35418. * Revert "retry" This reverts commit 817d0db3e5aec3bd449d3deaf4fb01d3ecfdc8a3. * enable ut * fix * retrigger * retrigger * update download url * fix * fix * retry * add comment * fix	2024-05-28 13:29:47 +08:00
Yina Chen	b6b70d1ba0	Divide core-xe packages (#11131 ) * temp * add batch * fix style * update package name * fix style * add workflow * use temp version to run uts * trigger performance test * trigger win igpu perf * revert workflow & setup	2024-05-28 12:00:18 +08:00
Jiao Wang	0a06a6e1d4	Update tests for transformers 4.36 (#10858 ) * update unit test * update * update * update * update * update * fix gpu attention test * update * update * update * update * update * update * update example test * replace replit code * update * update * update * update * set safe_serialization false * perf test * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * delete * update * update * update * update * update * update * revert * update	2024-05-24 10:26:38 +08:00
Yuwen Hu	1c5ed9b6cf	Fix arc ut (#11096 )	2024-05-22 14:13:13 +08:00
Yuwen Hu	b3027e2d60	Update for cpu install option in performance tests (#11060 )	2024-05-17 10:33:43 +08:00
Yuwen Hu	fff067d240	Make install ut for cpu exactly the same as what we want for users (#11051 )	2024-05-17 10:11:01 +08:00
Shaojun Liu	c62e828281	Create release-ipex-llm.yaml (#11039 )	2024-05-16 11:10:10 +08:00
Qiyuan Gong	4638682140	Fix xpu finetune image path in action (#11037 ) * Fix xpu finetune image path in action	2024-05-16 10:48:02 +08:00
Xiangyu Tian	612a365479	LLM: Install CPU version torch with extras [all] (#10868 ) Modify setup.py to install CPU version torch with extras [all]	2024-05-16 10:39:55 +08:00

1 2 3 4 5 ...

433 commits