ipex-llm

Author	SHA1	Message	Date
Zhicun	88463cbf47	fix transformer version (#10788 ) * fix transformer version * uninstall sentence transformer * uninstall * uninstall	2024-04-18 17:37:21 +08:00
Wenjing Margaret Mao	63a9a736be	Merge branch 'intel-analytics:main' into MargarettMao-parent_folder	2024-04-11 07:18:19 +08:00
Wenjing Margaret Mao	50dfcaa8fa	Update llm-ppl-evaluation.yml -- Update llm-ppl-evaluation.yml -- Update HTML file: change from ppl/update_in_parent_folder into harness/update_in_parent_folder ppl test and harness test are using the same update_in_parent_folder file. To reduce the repetition, change the ppl update HTML file to the same one under the harness folder and delete the HTML file under the ppl folder.	2024-04-11 07:15:18 +08:00
Yuwen Hu	97db2492c8	Update setup.py for `bigdl-core-xe-esimd-21` on Windows (#10705 ) * Support bigdl-core-xe-esimd-21 for windows in setup.py * Update setup-llm-env accordingly	2024-04-09 18:21:21 +08:00
Shaojun Liu	e10040b7f1	upgrade to python 3.11 (#10695 )	2024-04-09 17:04:42 +08:00
Chen, Zhentao	d59e0cce5c	Migrate harness to ipexllm (#10703 ) * migrate to ipexlm * fix workflow * fix run_multi * fix precision map * rename ipexlm to ipexllm * rename bigdl to ipex in comments	2024-04-09 15:48:53 +08:00
Zhicun	f03c029914	pydantic version>=2.0.0 for llamaindex (#10694 ) * pydantic version * pydantic version * upgrade version	2024-04-09 09:48:42 +08:00
Shaojun Liu	db7c5cb78f	update model path for spr perf test (#10687 ) * update model path for spr perf test * revert	2024-04-08 10:21:56 +08:00
Shaojun Liu	d18dbfb097	update spr perf test (#10644 )	2024-04-03 15:53:55 +08:00
Shaojun Liu	0779ca3db0	Bump ossf/scorecard-action to v2.3.1 (#10639 ) * Bump ossf/scorecard-action to v2.3.1 * revert	2024-04-03 11:14:18 +08:00
Shaojun Liu	dfcf08c58a	update ossf/scorecard-action to fix TUF invalid key bug (#10635 )	2024-04-03 09:55:32 +08:00
Shaojun Liu	a10f5a1b8d	add python style check (#10620 ) * add python style check * fix style checks * update runner * add ipex-llm-finetune-qlora-cpu-k8s to manually_build workflow * update tag to 2.1.0-SNAPSHOT	2024-04-02 16:17:56 +08:00
Shaojun Liu	20a5e72da0	refine and verify ipex-llm-serving-xpu docker document (#10615 ) * refine serving on cpu/xpu * minor fix * replace localhost with 0.0.0.0 so that service can be accessed through ip address	2024-04-02 11:45:45 +08:00
Shaojun Liu	c4b533f0e1	nightly build docker images (#10585 ) * nightly build docker images	2024-03-29 16:12:28 +08:00
Cheen Hau, 俊豪	1c5eb14128	Update pip install to use --extra-index-url for ipex package (#10557 ) * Change to 'pip install .. --extra-index-url' for readthedocs * Change to 'pip install .. --extra-index-url' for examples * Change to 'pip install .. --extra-index-url' for remaining files * Fix URL for ipex * Add links for ipex US and CN servers * Update ipex cpu url * remove readme * Update for github actions * Update for dockerfiles	2024-03-28 09:56:23 +08:00
Shaojun Liu	924e01b842	Create scorecard.yml (#10559 )	2024-03-27 16:51:10 +08:00
Shaojun Liu	bb9be70105	replace bigdl-llm with ipex-llm (#10545 )	2024-03-26 15:12:38 +08:00
Shaojun Liu	c563b41491	add nightly_build workflow (#10533 ) * add nightly_build workflow * add create-job-status-badge action * update * update * update * update setup.py * release * revert	2024-03-26 12:47:38 +08:00
Shaojun Liu	93e6804bfe	update nightly test (#10520 ) * trigger nightly test * trigger perf test * update bigdl-llm to ipex-llm * revert	2024-03-25 18:22:05 +08:00
Wang, Jian4	a1048ca7f6	Update setup.py and add new actions and add compatible mode (#25 ) * update setup.py * add new action * add compatible mode	2024-03-22 15:44:59 +08:00
Yuwen Hu	1579ee4421	[LLM] Add nightly igpu perf test for INT4+FP16 1024-128 (#10496 )	2024-03-21 16:07:06 +08:00
Shaojun Liu	a57fd52a5b	pip install notebook (#10444 )	2024-03-18 13:56:34 +08:00
Keyan (Kyrie) Zhang	444b11af22	Add LangChain upstream ut test for ipynb (#10387 ) * Add LangChain upstream ut test for ipynb * Integrate unit test for LangChain upstream ut and ipynb into one file * Modify file name * Remove LangChain version update in unit test * Move Langchain upstream ut job to arc * Modify path in .yml file * Modify path in llm_unit_tests.yml * Avoid create directory repeatedly	2024-03-15 16:31:01 +08:00
Yuxuan Xia	a90e9b6ec2	Fix C-Eval Workflow (#10359 ) * Fix Baichuan2 prompt format * Fix ceval workflow errors * Fix ceval workflow error * Fix ceval error * Fix ceval error * Test ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Add ceval dependency test * Fix ceval * Fix ceval * Test full ceval * Test full ceval * Fix ceval * Fix ceval	2024-03-13 17:23:17 +08:00
Keyan (Kyrie) Zhang	7cf01e6ec8	Add LangChain upstream ut test (#10349 ) * Add LangChain upstream ut test * Add LangChain upstream ut test * Specify version numbers in yml script * Correct langchain-community version	2024-03-13 09:52:45 +08:00
binbin Deng	df3bcc0e65	LLM: remove english_quotes dataset (#10370 )	2024-03-12 16:57:40 +08:00
WeiguangHan	17bdb1a60b	LLM: add whisper models into nightly test (#10193 ) * LLM: add whisper models into nightly test * small fix * small fix * add more whisper models * test all cases * test specific cases * collect the csv * store the resut * to html * small fix * small test * test all cases * modify whisper_csv_to_html	2024-03-11 20:00:47 +08:00
Chen, Zhentao	a425eaabfc	fix from_pretrained when device_map=None (#10361 ) * pr trigger * fix error when device_map=None * fix device_map=None	2024-03-11 16:06:12 +08:00
Keyan (Kyrie) Zhang	f1825d7408	Add RMSNorm unit test (#10190 )	2024-03-08 15:51:03 +08:00
Yuxuan Xia	0c8d3c9830	Add C-Eval HTML report (#10294 ) * Add C-Eval HTML report * Fix C-Eval workflow pr trigger path * Fix C-Eval workflow typos * Add permissions to C-Eval workflow * Fix C-Eval workflow typo * Add pandas dependency * Fix C-Eval workflow typo	2024-03-07 16:44:49 +08:00
hxsz1997	b7db21414e	Update llamaindex ut (#10338 ) * add test_llamaindex of gpu * add llamaindex gpu tests bash * add llamaindex cpu tests bash * update name of Run LLM langchain GPU test * import llama_index in llamaindex gpu ut * update the dependency of test_llamaindex * add Run LLM llamaindex GPU test * modify import dependency of llamaindex cpu test * add Run LLM llamaindex test * update llama_model_path * delete unused model path * add LLAMA2_7B_ORIGIN_PATH in llamaindex cpu test	2024-03-07 10:06:16 +08:00
dingbaorong	fc7f10cd12	add langchain gpu example (#10277 ) * first draft * fix * add readme for transformer_int4_gpu * fix doc * check device_map * add arc ut test * fix ut test * fix langchain ut * Refine README * fix gpu mem too high * fix ut test --------- Co-authored-by: Ariadne <wyn2000330@126.com>	2024-03-05 13:33:57 +08:00
Yuwen Hu	5dbbe1a826	[LLM] Support for new arc ut runner (#10311 ) * Support for new arc ut runner * Comment unnecessary OMP_NUM_THREADS related settings for arc uts	2024-03-04 18:42:02 +08:00
Yuwen Hu	d45e577d8c	[LLM] Test `load_low_bit` in iGPU perf test on Windows (#10313 )	2024-03-04 18:03:57 +08:00
Shaojun Liu	bab2ee5f9e	update nightly spr perf test (#10178 ) * update nightly spr perf test * update * update runner lable * update * update * update folder * revert	2024-03-04 13:46:33 +08:00
Shaojun Liu	57e211dab4	topLevel 'contents' permission set to 'read' (#10295 )	2024-03-04 10:33:19 +08:00
hxsz1997	925aff730e	Integrate the result of ppl and harness (#10265 ) * modify NIGHTLY_MATRIX_PRECISION * change ACC_FOLDER of harness * change ACC_FOLDER of ppl	2024-02-28 17:53:02 +08:00
Yuwen Hu	d85f7c78df	Small fix for better trail (#10256 )	2024-02-27 20:00:40 +08:00
hxsz1997	cba61a2909	Add html report of ppl (#10218 ) * remove include and language option, select the corresponding dataset based on the model name in Run * change the nightly test time * change the nightly test time of harness and ppl * save the ppl result to json file * generate csv file and print table result * generate html * modify the way to get parent folder * update html in parent folder * add llm-ppl-summary and llm-ppl-summary-html * modify echo single result * remove download fp16.csv * change model name of PR * move ppl nightly related files to llm/test folder * reformat * seperate make_table from make_table_and_csv.py * separate make_csv from make_table_and_csv.py * update llm-ppl-html * remove comment * add Download fp16.results	2024-02-27 17:37:08 +08:00
hxsz1997	15ad2fd72e	Merge pull request #10226 from zhentaocc/fix_harness Fix harness	2024-02-26 16:49:27 +08:00
Chen, Zhentao	5ad752bae8	Separate llmcpp build of linux and windows (#10136 ) * separate linux window llmcpp build * harness run on linux only * fix platform * skip error * change to linux only build * add judgement of platform * add download args * remove \|\|true	2024-02-26 15:04:29 +08:00
Chen, Zhentao	62350a36f0	fix if in update html	2024-02-26 13:39:59 +08:00
Yuxuan Xia	0c6aef0f47	Add einops dependency for C-Eval (#10234 ) * Add c-eval workflow and modify running files * Modify the chatglm evaluator file * Modify the ceval workflow for triggering test * Modify the ceval workflow file * Modify the ceval workflow file * Modify ceval workflow * Adjust the ceval dataset download * Add ceval workflow dependencies * Modify ceval workflow dataset download * Add ceval test dependencies * Add ceval test dependencies * Correct the result print * Fix the nightly test trigger time * Fix ChatGLM loading issue * Add einops dependency	2024-02-26 10:13:10 +08:00
Chen, Zhentao	85d13c65de	run one job only if triggered by pr	2024-02-24 00:33:33 +08:00
Chen, Zhentao	a55cc91e1f	fix make_csv.py	2024-02-23 20:25:46 +08:00
Chen, Zhentao	a204337cad	Rename results	2024-02-23 17:12:37 +08:00
Chen, Zhentao	4fdf96dc8b	fix ACC_FOLDER	2024-02-23 17:11:03 +08:00
Chen, Zhentao	e838ec9e14	remove dependency	2024-02-23 16:33:40 +08:00
Chen, Zhentao	88f7f56980	rewrite html visualization	2024-02-23 16:33:39 +08:00
Chen, Zhentao	bfa98666a6	fall back to make_table.py	2024-02-23 16:33:38 +08:00
Chen, Zhentao	02cb96e7f6	fix Run Harness job	2024-02-23 16:33:37 +08:00
Chen, Zhentao	e1fcf54a0c	reformat	2024-02-23 16:33:36 +08:00
Chen, Zhentao	5399343adc	fix harness installation	2024-02-23 16:33:35 +08:00
Chen, Zhentao	9c8e349196	remove harness job output	2024-02-23 16:33:34 +08:00
Chen, Zhentao	8472de90e8	use stable lm to test pr	2024-02-23 16:33:34 +08:00
Chen, Zhentao	f315c7f93a	Move harness nightly related files to llm/test folder (#10209 ) * move harness nightly files to test folder * change workflow file path accordingly * use arc01 when pr * fix path * fix fp16 csv path	2024-02-23 11:12:36 +08:00
Ovo233	60e11b6739	LLM: Add mlp layer unit tests (#10200 ) * add mlp layer unit tests * add download baichuan-13b * exclude llama for now * install additional packages * rename bash file * switch to Baichuan2 * delete attention related code * fix name errors in yml file	2024-02-22 13:44:45 +08:00
hxsz1997	5b387bb71a	Change the nightly test time of ppl and harness (#10198 ) * remove include and language option, select the corresponding dataset based on the model name in Run * change the nightly test time * change the nightly test time of harness and ppl	2024-02-21 17:39:33 +08:00
yb-peng	b1a97b71a9	Harness eval: Add is_last parameter and fix logical operator in highlight_vals (#10192 ) * Add is_last parameter and fix logical operator in highlight_vals * Add script to update HTML files in parent folder * Add running update_html_in_parent_folder.py in summarize step * Add licence info * Remove update_html_in_parent_folder.py in Summarize the results for pull request	2024-02-21 14:45:32 +08:00
Chen, Zhentao	39d37bd042	upgrade harness package version in workflow (#10188 ) * upgrade harness * update readme	2024-02-21 11:21:30 +08:00
yb-peng	de3dc609ee	Modify harness evaluation workflow (#10174 ) * Modify table head in harness * Specify the file path of fp16.csv * change run to run nightly and run pr to debug * Modify the way to get fp16.csv to downloading from github * Change the method to calculate diff in html table * Change the method to calculate diff in html table * Re-arrange job order * Re-arrange job order * Change limit * Change fp16.csv path * Change highlight rules * Change limit	2024-02-20 18:55:43 +08:00
hxsz1997	b55fd00fb1	remove include and language option, select the corresponding dataset based on the model name in Run (#10181 )	2024-02-20 17:34:52 +08:00
hxsz1997	6e10d98a8d	Fix some typos (#10175 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env * modify workflow_dispatch language options * modify options * modify language options * modeify workflow_dispatch type * modify type * modify the type of language * change seq_len type * fix some typos * revert changes to stress_test.txt	2024-02-20 14:14:53 +08:00
Yuxuan Xia	71875ebc24	Fix the C-Eval nightly test trigger time (#10172 ) * Add c-eval workflow and modify running files * Modify the chatglm evaluator file * Modify the ceval workflow for triggering test * Modify the ceval workflow file * Modify the ceval workflow file * Modify ceval workflow * Adjust the ceval dataset download * Add ceval workflow dependencies * Modify ceval workflow dataset download * Add ceval test dependencies * Add ceval test dependencies * Correct the result print * Fix the nightly test trigger time	2024-02-20 09:53:59 +08:00
WeiguangHan	6c09aed90d	LLM: add qwen_1.5_7b model for arc perf test (#10166 ) * LLM: add qwen_1.5_7b model for arc perf test * small fix * revert some codes	2024-02-19 17:21:00 +08:00
Yuxuan Xia	209122559a	Add Ceval workflow and modify the result printing (#10140 ) * Add c-eval workflow and modify running files * Modify the chatglm evaluator file * Modify the ceval workflow for triggering test * Modify the ceval workflow file * Modify the ceval workflow file * Modify ceval workflow * Adjust the ceval dataset download * Add ceval workflow dependencies * Modify ceval workflow dataset download * Add ceval test dependencies * Add ceval test dependencies * Correct the result print	2024-02-19 17:06:53 +08:00
yb-peng	50fa004ba5	Specify the version of pandas in harness evaluation workflow (#10159 ) * Specify the version of pandas in harness evaluation workflow * Specify the version of pandas in harness evaluation workflow	2024-02-19 16:27:08 +08:00
Shaojun Liu	7a3a20cf5b	Fix: GitHub-owned GitHubAction not pinned by hash (#10152 )	2024-02-18 16:49:28 +08:00
Shaojun Liu	c3daacec6d	Fix Token Permission issues (#10151 ) Co-authored-by: Your Name <Your Email>	2024-02-18 13:23:54 +08:00
yb-peng	b7c5104d98	remove limit in harness run (#10139 )	2024-02-09 11:20:53 +08:00
yb-peng	b4dc33def6	In harness-evaluation workflow, add statistical tables (#10118 ) * chnage storage * fix typo * change label * change label to arc03 * change needs in the last step * add generate csv in harness/make_table_results.py * modify needs in the last job * add csv to html * mfix path issue in llm-harness-summary-nightly * modify output_path * modify args in make_table_results.py * modify make table command in summary * change pr env label * remove irrelevant code in summary; add set output path step; add limit in harness run * re-organize code structure * modify limit in run harness * modify csv_to_html input path * modify needs in summary-nightly	2024-02-08 19:01:05 +08:00
Shaojun Liu	c2378a9546	Fix code scanning issues (#10129 ) * Fix code scanning issues * update oneccl_bind_pt link * update * update --------- Co-authored-by: Your Name <Your Email>	2024-02-08 17:19:44 +08:00
Yuwen Hu	81ed65fbe7	[LLM] Add qwen1.5-7B in iGPU perf (#10127 ) * Add qwen1.5 test config yaml with transformers 4.37.0 * Update for yaml file	2024-02-07 22:31:20 +08:00
hxsz1997	b4c327ea78	Llm ppl workflow bug fix (#10128 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env * modify workflow_dispatch language options * modify options * modify language options * modeify workflow_dispatch type * modify type * modify the type of language * change seq_len type	2024-02-07 18:48:14 +08:00
hxsz1997	76bd792ff1	Fix llm ppl workflow workflow_dispatch bugs (#10125 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env * modify workflow_dispatch language options * modify options * modify language options	2024-02-07 17:41:44 +08:00
Shaojun Liu	9f5a86f9db	fix OpenSSF Token-Permissions issues (#10121 ) Co-authored-by: Your Name <Your Email>	2024-02-07 16:51:10 +08:00
hxsz1997	1710ecb990	Add llm-ppl workflow (#10074 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env	2024-02-07 16:29:57 +08:00
Shaojun Liu	5e9710cec4	Update threshold for cpu stable version tests (#10108 ) * update threshold * update * test * update * update * revert * revert --------- Co-authored-by: Your Name <Your Email>	2024-02-07 11:21:23 +08:00
pengyb2001	f63eba6c5a	change pr test machine	2024-02-06 23:35:18 +08:00
pengyb2001	e627727b4b	change download path	2024-02-06 21:12:51 +08:00
pengyb2001	2c4e610743	remove irrelevant code	2024-02-06 20:12:10 +08:00
pengyb2001	d11ef0d117	remove retry in llm install part	2024-02-06 14:25:26 +08:00
pengyb2001	94723bb0b1	add retry in run llm install part;test arc05 with llama2	2024-02-06 14:09:14 +08:00
pengyb2001	2c75b5b981	remove mistral in pr job	2024-02-06 13:51:57 +08:00
pengyb2001	5edefe7d8e	remove nightly summary job	2024-02-06 13:50:38 +08:00
pengyb2001	bc92dbf7be	remove stableml;change schedule;change storage method	2024-02-06 11:20:37 +08:00
dingbaorong	36c9442c6d	Arc Stable version test (#10087 ) * add batch_size in stable version test * add batch_size in excludes * add excludes for batch_size * fix ci * triger regression test * fix xpu version * disable ci * address kai's comment --------- Co-authored-by: Ariadne <wyn2000330@126.com>	2024-02-06 10:23:50 +08:00
Yuwen Hu	136f042f84	[LLM] Make sure python 310-311 tests only happen for nightly tests (#10081 ) * Make sure python 310-311 tests only happen for nightly tests * Use default runner for setup-python-version * Small fixes	2024-02-04 16:14:48 +08:00
Jin Qiao	f9a468a2c7	LLM: conditionally choose python version for unit test (#10062 ) * conditional python version * retry * temporary skip llm-cpp-build * apply on llm-unit-test-on-arc * fix * add llm-cpp-build dependency * use GITHUB_OUTPUT instead of set-output * check nightly build * fix quote * fix quote * add llm-cpp-build dependency * test nightly build * test pull request	2024-02-04 13:37:34 +08:00
yb-peng	738275761d	In llm-harness-evaluation, add new models and change schedule to nightly (#10072 ) * add new models and change schedule to nightly * correct syntax error * modify env set up and job * change label and schedule time * change schedule time * change label	2024-02-04 13:12:09 +08:00
Shaojun Liu	698f84648c	split stable version tests (#10076 ) Co-authored-by: Your Name <Your Email>	2024-02-04 11:08:12 +08:00
WeiguangHan	d2d3f6b091	LLM: ensure the result of daily arc perf test (#10016 ) * ensure the result of daily arc perf test * small fix * small fix * small fix * small fix * small fix * small fix * small fix * small fix * small fix * small fix * concat more csvs * small fix * revert some files	2024-01-31 18:26:21 +08:00
Yuwen Hu	863c3f94d0	[LLM] Change nightly perf to install from pypi (#10027 ) * Change to install from pypi and have a check to make sure the installed bigdl-llm version is as expected * Make sure result date is the same as tested bigdl-llm version * Small fixes * Small fix * Small fixes * Small fix * Small fixes * Small updates	2024-01-30 18:15:44 +08:00
Yuwen Hu	68d78fb57e	[LLM] Small improvement to iGPU perf test #9915 ) - Avoid delete csv if there is something wrong with concating csv	2024-01-17 11:21:58 +08:00
Yuwen Hu	8643b62521	[LLM] Support longer context in iGPU perf tests (2048-256) (#9910 )	2024-01-16 17:48:37 +08:00
Yuwen Hu	023679459e	[LLM] Small fixes for finetune related examples and UTs (#9870 )	2024-01-09 18:05:03 +08:00
Yuwen Hu	aebed4b7bc	Enable llm gpu tests for PyTorch 2.1 (#9863 )	2024-01-09 16:29:02 +08:00
Yuwen Hu	5ba1dc38d4	[LLM] Change default Linux GPU install option to PyTorch 2.1 (#9858 ) * Update default xpu to ipex 2.1 * Update related install ut support correspondingly * Add arc ut tests for both ipex 2.0 and 2.1 * Small fix * Diable ipex 2.1 test for now as oneapi 2024.0 has not beed installed on the test machine * Update document for default PyTorch 2.1 * Small fix * Small fix * Small doc fixes * Small fixes	2024-01-08 17:16:17 +08:00
Chen, Zhentao	cad5c2f516	fixed harness deps version (#9854 ) * fixed harness deps version * fix typo	2024-01-08 15:22:42 +08:00
WeiguangHan	4269a585b2	LLM: arc perf test using ipex2.1 (#9837 ) * LLM: upgrade to ipex_2.1 for arc perf test * revert llm_performance_tests.yml	2024-01-05 18:12:19 +08:00
Yuwen Hu	86f86a64a2	Small fixes to ipex 2.1 UT support (#9848 )	2024-01-05 17:36:21 +08:00
Yuwen Hu	f25d23dfbf	[LLM] Add support for PyTorch 2.1 install in UT for GPU (#9845 ) * Add support for ipex 2.1 install in UT and fix perf test * Small fix	2024-01-05 16:13:18 +08:00
Yuwen Hu	ad4a6b5096	Fix langchain UT by not downgrading typing-extension (#9842 )	2024-01-05 13:38:04 +08:00
Wang, Jian4	4ceefc9b18	LLM: Support bitsandbytes config on qlora finetune (#9715 ) * test support bitsandbytesconfig * update style * update cpu example * update example * update readme * update unit test * use bfloat16 * update logic * use int4 * set defalut bnb_4bit_use_double_quant * update * update example * update model.py * update * support lora example	2024-01-04 11:23:16 +08:00
dingbaorong	f6bb4ab313	Arc stress test (#9795 ) * add arc stress test * triger ci * triger CI * triger ci * disable ci	2023-12-27 21:02:41 +08:00
Yuwen Hu	dfe28c58bb	Small upload fix for igpu-perf test (#9792 )	2023-12-27 15:50:58 +08:00
Shaojun Liu	6c75c689ea	bigdl-llm stress test for stable version (#9781 ) * 1k-512 2k-512 baseline * add cpu stress test * update yaml name * update * update * clean up * test * update * update * update * test * update	2023-12-27 15:40:53 +08:00
dingbaorong	5cfb4c4f5b	Arc stable version performance regression test (#9785 ) * add arc stable version regression test * empty gpu mem between different models * triger ci * comment spr test * triger ci * address kai's comments and disable ci * merge fp8 and int4 * disable ci	2023-12-27 11:01:56 +08:00
Yuwen Hu	c38e18f2ff	[LLM] Migrate iGPU perf tests to new machine (#9784 ) * Move 1024 test just after 32-32 test; and enable all model for 1024-128 * Make sure python output encoding in utf-8 so that redirect to txt can always be success * Upload results to ftp * Small fix	2023-12-26 19:15:57 +08:00
Shaojun Liu	b6222404b8	bigdl-llm stable version: let the perf test fail if the difference between perf and baseline is greater than 5% (#9750 ) * test * test * test * update * revert	2023-12-25 13:47:11 +08:00
Chen, Zhentao	4a98bfa5ae	fix harness manual run env typo (#9763 )	2023-12-22 18:42:35 +08:00
Yuwen Hu	02436c6cce	[LLM] Enable more long context in-out pairs for iGPU perf tests (#9765 ) * Add test for 1024-128 and enable more tests for 512-64 * Fix date in results csv name to the time when the performance is triggered * Small fix * Small fix * further fixes	2023-12-22 18:18:23 +08:00
Yuwen Hu	1c8c104bb8	[LLM] Small fixes for igpu win perf tests (#9756 )	2023-12-22 15:51:03 +08:00
Chen, Zhentao	86a69e289c	fix harness runner label of manual trigger (#9754 ) * fix runner * update golden	2023-12-22 15:09:22 +08:00
WeiguangHan	2d1bf20309	LLM: small fix llm_performance_tests.html (#9753 ) * LLM: small fix llm_performance_tests.html * reslove some comments * revert the llm_performance_test.yaml	2023-12-22 13:55:01 +08:00
Shaojun Liu	bb52239e0a	bigdl-llm stable version release & test (#9732 ) * stable version test * trigger spr test * update * trigger * test * test * test * test * test * refine * release linux first	2023-12-21 22:55:33 +08:00
WeiguangHan	34bb804189	LLM: check csv and its corresponding yaml file (#9702 ) * LLM: check csv and its corresponding yaml file * run PR arc perf test * modify the name of some variables * execute the check results script in right place * use cp to replace mv command * resolve some comments * resolve more comments * revert the llm_performance_test.yaml file	2023-12-21 09:54:33 +08:00
WeiguangHan	3aa8b66bc3	LLM: remove starcoder-15.5b model temporarily (#9720 )	2023-12-19 20:14:46 +08:00
Chen, Zhentao	b3647507c0	Fix harness workflow (#9704 ) * error when larger than 0.001 * fix env setup * fix typo * fix typo	2023-12-18 15:42:10 +08:00
WeiguangHan	1f0245039d	LLM: check the final csv results for arc perf test (#9684 ) * LLM: check the final csv results for arc perf test * delete useless python script * change threshold * revert the llm_performance_tests.yml	2023-12-14 19:46:08 +08:00
Yuwen Hu	82ac2dbf55	[LLM] Small fixes for win igpu test for ipex 2.1 (#9686 ) * Fixes to install for igpu performance tests * Small update for core performance tests model lists	2023-12-14 15:39:51 +08:00
Yuwen Hu	cbdd49f229	[LLM] win igpu performance for ipex 2.1 and oneapi 2024.0 (#9679 ) * Change igpu win tests for ipex 2.1 and oneapi 2024.0 * Qwen model repo id updates; updates model list for 512-64 * Add .eval for win igpu all-in-one benchmark for best performance	2023-12-13 18:52:29 +08:00
Yuwen Hu	017932a7fb	Small fix for html generation (#9656 )	2023-12-12 14:06:18 +08:00
WeiguangHan	1e25499de0	LLM: test new oneapi (#9654 ) * test new oneapi * revert llm_performance_tests.yml	2023-12-12 11:12:14 +08:00
Yuwen Hu	d272b6dc47	[LLM] Enable generation of html again for win igpu tests (#9652 ) * Enable generation of html again and comment out rwkv for 32-512 as it is not very stable * Small fix	2023-12-11 19:15:17 +08:00
Yuwen Hu	894d0aaf5e	[LLM] iGPU win perf test reorg based on in-out pairs (#9645 ) * trigger pr temparorily * Saparate benchmark run for win igpu based in in-out pairs * Rename fix * Test workflow * Small fix * Skip generation of html for now * Change back to nightly triggered	2023-12-08 20:46:40 +08:00
Chen, Zhentao	972cdb9992	gsm8k OOM workaround (#9597 ) * update bigdl_llm.py * update the installation of harness * fix partial function * import ipex * force seq len in decrease order * put func outside class * move comments * default 'trust_remote_code' as True * Update llm-harness-evaluation.yml	2023-12-08 18:47:25 +08:00
WeiguangHan	1ff4bc43a6	degrade pandas version (#9643 )	2023-12-08 17:44:51 +08:00
Yuwen Hu	c998f5f2ba	[LLM] iGPU long context tests (#9598 ) * Temp enable PR * Enable tests for 256-64 * Try again 128-64 * Empty cache after each iteration for igpu benchmark scripts * Try tests for 512 * change order for 512 * Skip chatglm3 and llama2 for now * Separate tests for 512-64 * Small fix * Further fixes * Change back to nightly again	2023-12-06 10:19:20 +08:00
Chen, Zhentao	8c8a27ded7	Add harness summary job (#9457 ) * format yml * add make_table_results * add summary job * add a job to print single result * upload full directory	2023-12-05 10:04:10 +08:00
Yuwen Hu	3f4ad97929	[LLM] Add performance tests for windows iGPU (#9584 ) * Add support for win gpu benchmark with peak gpu memory monitoring * Add win igpu tests * Small fix * Forward outputs * Small fix * Test and small fixes * Small fix * Small fix and test * Small fixes * Add tests for 512-64 and change back to nightly tests * Small fix	2023-12-04 20:50:02 +08:00
Chen, Zhentao	29d5bb8df4	Harness workflow dispatch (#9591 ) * add set-matrix job * add workflow_dispatch * fix context * fix manual run * rename step * add quotes * add runner option * not required labels * add runner label to output * use double quote	2023-12-04 15:53:29 +08:00
Chen, Zhentao	9557aa9c21	Fix harness nightly (#9586 ) * update golden * loose the restriction of diff * only compare results when scheduled	2023-12-04 11:45:00 +08:00
Chen, Zhentao	5de92090b3	try to fix deps installation of bigdl (#9578 )	2023-12-01 15:25:47 +08:00
Chen, Zhentao	cb228c70ea	Add harness nightly (#9552 ) * modify output_path as a directory * schedule nightly at 21 on Friday * add tasks and models for nightly * add accuracy regression * comment out if to test * mixed fp4 * for test * add missing delimiter * remove comma * fixed golden results * add mixed 4 golden result * add more options * add mistral results * get golden result of stable lm * move nightly scripts and results to test folder * add license * add fp8 stable lm golden * run on all available devices * trigger only when ready for review * fix new line * update golden * add mistral	2023-12-01 14:16:35 +08:00
Chen, Zhentao	4d7d5d4c59	Add 3 leaderboard tasks (#9566 ) * update leaderboard map * download model and dataset without overwritten * fix task drop * run on all available devices	2023-12-01 14:01:14 +08:00
Chen, Zhentao	c8e0c2ed48	Fixed dumped logs in harness (#9549 ) * install transformers==4.34.0 * modify output_path as a directory * add device and task to output dir parents	2023-11-30 12:47:56 +08:00
WeiguangHan	5098bc3544	LLM: enable previous models (#9505 ) * enable previous models * test mistral model * for test * run models separately * test all models * for test * revert the llm_performance_test.yaml	2023-11-28 10:21:07 +08:00
Wang, Jian4	40ec9f7ead	Add qlora cpu docker manually build (#9501 )	2023-11-21 14:39:16 +08:00
WeiguangHan	c487b53f21	LLM: only run arc perf test nightly (#9448 ) * LLM: only run arc perf test nightly * deleted unused python scripts * rebase main	2023-11-15 19:38:14 +08:00
Chen, Zhentao	d19ca21957	patch bigdl-llm model to harness by binding instead of patch file (#9420 ) * add run_llb.py * fix args interpret * modify outputs * update workflow * add license * test mixed 4 bit * update readme * use autotokenizer * add timeout * refactor workflow file * fix working directory * fix env * throw exception if some jobs failed * improve terminal outputs * Disable var which cause the run stuck * fix unknown precision * fix key error * directly output config instead * rm harness submodule	2023-11-14 12:51:39 +08:00
Chen, Zhentao	f36d7b2d59	Fix harness stuck (#9435 ) * remove env to avoid being stuck * use small model for test	2023-11-13 15:29:53 +08:00
Yuwen Hu	4faf5af8f1	[LLM] Add perf test for core on Windows (#9397 ) * temporary stop other perf test * Add framework for core performance test with one test model * Small fix and add platform control * Comment out lp for now * Add missing ymal file * Small fix * Fix sed contents * Small fix * Small path fixes * Small fix * Add update to ftp * Small upload fix * add chatglm3-6b * LLM: add model names * Keep repo id same as ftp and temporary make baichuan2 first priority * change order * Remove temp if false and separate pr and nightly results * Small fix --------- Co-authored-by: jinbridge <2635480475@qq.com>	2023-11-13 13:58:40 +08:00
WeiguangHan	2cfef5ef1e	LLM: store the nightly test and pr results separately (#9404 ) * LLM: store the csv results separately * modify the trigger files of LLM Performance Test	2023-11-11 06:35:27 +08:00
Yuwen Hu	3d107f6d25	[LLM] Separate windows build UT and build runner (#9403 ) * Separate windows build UT and build runner * Small fix	2023-11-09 18:47:38 +08:00
WeiguangHan	34449cb4bb	LLM: add remaining models to the arc perf test (#9384 ) * add remaining models * modify the filepath which stores the test result on ftp server * resolve some comments	2023-11-09 14:28:42 +08:00
Yuwen Hu	d4b248fcd4	Add windows binary build label AVX_VNNI (#9387 )	2023-11-08 18:13:35 +08:00
Chen, Zhentao	298b64217e	add auto triggered acc test (#9364 ) * add auto triggered acc test * use llama 7b instead * fix env * debug download * fix download prefix * add cut dirs * fix env of model path * fix dataset download * full job * source xpu env vars * use matrix to trigger model run * reset batch=1 * remove redirect * remove some trigger * add task matrix * add precision list * test llama-7b-chat * use /mnt/disk1 to store model and datasets * remove installation test * correct downloading path * fix HF vars * add bigdl-llm env vars * rename file * fix hf_home * fix script path * rename as harness evalution * rerun	2023-11-08 10:22:27 +08:00
WeiguangHan	84ab614aab	LLM: add more models and skip runtime error (#9349 ) * add more models and skip runtime error * upgrade transformers * temporarily removed Mistral-7B-v0.1 * temporarily disable the upload of arc perf result	2023-11-08 09:45:53 +08:00
Shaojun Liu	833e4dbc8d	fix llm-performance-test-on-arc bug (#9357 )	2023-11-06 10:00:25 +08:00

1 2 3 4 5 ...

470 commits