ipex-llm

Author	SHA1	Message	Date
Xu, Shuo	f9a199900d	add model RWKV/v5-Eagle-7B-HF to igpu benchmark (#11528 ) Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-08 15:50:16 +08:00
Jun Wang	5a57e54400	[ADD] add 5 new models for igpu-perf (#11524 )	2024-07-08 11:12:15 +08:00
Xu, Shuo	64cfed602d	Add new models to benchmark (#11505 ) * Add new models to benchmark * remove Qwen/Qwen-VL-Chat to pass the validation --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-08 10:35:55 +08:00
Yuwen Hu	8f376e5192	Change igpu perf to mainly test int4+fp16 (#11513 )	2024-07-05 17:12:33 +08:00
Jun Wang	f07937945f	[REMOVE] remove all useless repo-id in benchmark/igpu-perf (#11508 )	2024-07-04 16:38:34 +08:00
Xu, Shuo	52519e07df	remove models we no longer need in benchmark. (#11492 ) Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-02 17:20:48 +08:00
Wenjing Margaret Mao	c0e86c523a	Add qwen-moe batch1 to nightly perf (#11369 ) * add moe * reduce 437 models * rename * fix syntax * add moe check result * add 430 + 437 * all modes * 4-37-4 exclud * revert & comment --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-20 14:17:41 +08:00
Wenjing Margaret Mao	b2f62a8561	Add batch 4 perf test (#11355 ) * copy files to this branch * add tasks * comment one model * change the model to test the 4.36 * only test batch-4 * typo * typo * typo * typo * typo * typo * add 4.37-batch4 * change the file name * revet yaml file * no print * add batch4 task * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-20 09:48:52 +08:00
hxsz1997	44f22cba70	add config and default value (#11344 ) * add config and default value * add config in taml * remove lookahead and max_matching_ngram_size in config * remove streaming and use_fp16_torch_dtype in test yaml * update task in readme * update commit of task	2024-06-18 15:28:57 +08:00
Wenjing Margaret Mao	bca5cbd96c	Modify arc nightly perf to fp16 (#11275 ) * change api * move to pr mode and remove the build * add batch4 yaml and remove the bigcode * remove batch4 * revert the starcode * remove the exclude * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-17 13:47:22 +08:00
Shaojun Liu	f5ef94046e	exclude dolly-v2-12b for arc perf test (#11315 ) * test arc perf * test * test * exclude dolly-v2-12b:2048 * revert changes	2024-06-14 15:35:56 +08:00
Jin Qiao	3682c6a979	add glm4 and qwen2 to igpu perf (#11304 )	2024-06-13 16:16:35 +08:00
Wenjing Margaret Mao	b61f6e3ab1	Add update_parent_folder for nightly_perf_test (#11287 ) * add update_parent_folder and change the workflow file * add update_parent_folder and change the workflow file * move to pr mode and comment the test * use one model per comfig * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-12 17:58:13 +08:00
Wenjing Margaret Mao	70b17c87be	Merge multiple batches (#11264 ) * add merge steps * move to pr mode * remove build + add merge.py * add tohtml and change cp * change test_batch folder path * change merge_temp path * change to html folder * revert * change place * revert 437 * revert space --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-07 18:38:45 +08:00
Wenjing Margaret Mao	231b968aba	Modify the check_results.py to support batch 2&4 (#11133 ) * add batch 2&4 and exclude to perf_test * modify the perf-test&437 yaml * modify llm_performance_test.yml * remove batch 4 * modify check_results.py to support batch 2&4 * change the batch_size format * remove genxir * add str(batch_size) * change actual_test_casese in check_results file to support batch_size * change html highlight * less models to test html and html_path * delete the moe model * split batch html * split * use installing from pypi * use installing from pypi - batch2 * revert cpp * revert cpp * merge two jobs into one, test batch_size in one job * merge two jobs into one, test batch_size in one job * change file directory in workflow * try catch deal with odd file without batch_size * modify pandas version * change the dir * organize the code * organize the code * remove Qwen-MOE * modify based on feedback * modify based on feedback * modify based on second round of feedback * modify based on second round of feedback + change run-arc.sh mode * modify based on second round of feedback + revert config * modify based on second round of feedback + revert config * modify based on second round of feedback + remove comments * modify based on second round of feedback + remove comments * modify based on second round of feedback + revert arc-perf-test * modify based on third round of feedback * change error type * change error type * modify check_results.html * split batch into two folders * add all models * move csv_name * revert pr test * revert pr test --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-05 15:04:55 +08:00
Jiao Wang	0a06a6e1d4	Update tests for transformers 4.36 (#10858 ) * update unit test * update * update * update * update * update * fix gpu attention test * update * update * update * update * update * update * update example test * replace replit code * update * update * update * update * set safe_serialization false * perf test * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * delete * update * update * update * update * update * update * revert * update	2024-05-24 10:26:38 +08:00
Jin Qiao	15ee3fd542	Update igpu perf internlm (#10958 )	2024-05-08 14:16:43 +08:00
Yuwen Hu	0efe26c3b6	Change order of chatglm2-6b and chatglm3-6b in iGPU perf test for more stable performance (#10948 )	2024-05-07 13:48:39 +08:00
Jin Qiao	fb3c268d13	Add phi-3 to perf (#10883 )	2024-04-25 20:21:56 +08:00
Yuxuan Xia	0213c1c1da	Add phi3 to the nightly test (#10885 ) * Add llama3 and phi2 nightly test * Change llama3-8b to llama3-8b-instruct * Add phi3 to nightly test * Add phi3 to nightly test --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-04-25 17:39:12 +08:00
Yuxuan Xia	844e18b1db	Add llama3 and phi2 nightly test (#10874 ) * Add llama3 and phi2 nightly test * Change llama3-8b to llama3-8b-instruct --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-04-24 16:58:56 +08:00
Yuwen Hu	fb2a160af3	Add phi-2 to 2048-256 test for fixes (#10867 )	2024-04-24 10:00:25 +08:00
Yuwen Hu	21bb8bd164	Add phi-2 to igpu performance test (#10865 )	2024-04-23 18:13:14 +08:00
Yuwen Hu	07e8b045a9	Add Meta-llama-3-8B-Instruct and Yi-6B-Chat to igpu nightly perf (#10810 )	2024-04-19 15:09:58 +08:00
Wenjing Margaret Mao	c41730e024	edit 'ppl_result does not exist' issue, delete useless code (#10767 ) * edit ppl_result not exist issue, delete useless code * delete nonzero_min function --------- Co-authored-by: jenniew <jenniewang123@gmail.com>	2024-04-16 18:11:56 +08:00
hxsz1997	0d518aab8d	Merge pull request #10697 from MargarettMao/ceval combine english and chinese, remove nan	2024-04-12 14:37:47 +08:00
jenniew	dd0d2df5af	Change fp16.csv mistral-7b-v0.1 into Mistral-7B-v0.1	2024-04-12 14:28:46 +08:00
jenniew	7309f1ddf9	Mofidy Typos	2024-04-12 14:23:13 +08:00
jenniew	cb594e1fc5	Mofidy Typos	2024-04-12 14:22:09 +08:00
jenniew	382c18e600	Mofidy Typos	2024-04-12 14:15:48 +08:00
jenniew	1a360823ce	Mofidy Typos	2024-04-12 14:13:21 +08:00
jenniew	cdbb1de972	Mark Color Modification	2024-04-12 14:00:50 +08:00
jenniew	9bbfcaf736	Mark Color Modification	2024-04-12 13:30:16 +08:00
jenniew	bb34c6e325	Mark Color Modification	2024-04-12 13:26:36 +08:00
jenniew	b151a9b672	edit csv_to_html to combine en & zh	2024-04-11 17:35:36 +08:00
Wenjing Margaret Mao	9bec233e4d	Delete python/llm/test/benchmark/perplexity/update_html_in_parent_folder.py Delete due to repetition	2024-04-11 07:21:12 +08:00
Shaojun Liu	d18dbfb097	update spr perf test (#10644 )	2024-04-03 15:53:55 +08:00
Yuwen Hu	1579ee4421	[LLM] Add nightly igpu perf test for INT4+FP16 1024-128 (#10496 )	2024-03-21 16:07:06 +08:00
Kai Huang	1315150e64	Add baichuan2-13b 1k to arc nightly perf (#10406 )	2024-03-15 10:29:11 +08:00
Yuxuan Xia	a90e9b6ec2	Fix C-Eval Workflow (#10359 ) * Fix Baichuan2 prompt format * Fix ceval workflow errors * Fix ceval workflow error * Fix ceval error * Fix ceval error * Test ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Fix ceval * Add ceval dependency test * Fix ceval * Fix ceval * Test full ceval * Test full ceval * Fix ceval * Fix ceval	2024-03-13 17:23:17 +08:00
Yuxuan Xia	0c8d3c9830	Add C-Eval HTML report (#10294 ) * Add C-Eval HTML report * Fix C-Eval workflow pr trigger path * Fix C-Eval workflow typos * Add permissions to C-Eval workflow * Fix C-Eval workflow typo * Add pandas dependency * Fix C-Eval workflow typo	2024-03-07 16:44:49 +08:00
Yuwen Hu	d45e577d8c	[LLM] Test `load_low_bit` in iGPU perf test on Windows (#10313 )	2024-03-04 18:03:57 +08:00
WeiguangHan	fd81d66047	LLM: Compress some models to save space (#10315 ) * LLM: compress some models to save space * add deleted comments	2024-03-04 17:53:03 +08:00
Shaojun Liu	bab2ee5f9e	update nightly spr perf test (#10178 ) * update nightly spr perf test * update * update runner lable * update * update * update folder * revert	2024-03-04 13:46:33 +08:00
Jin Qiao	5d7243067c	LLM: add Baichuan2-13B-Chat 2048-256 to MTL perf (#10273 )	2024-02-29 13:48:55 +08:00
hxsz1997	cba61a2909	Add html report of ppl (#10218 ) * remove include and language option, select the corresponding dataset based on the model name in Run * change the nightly test time * change the nightly test time of harness and ppl * save the ppl result to json file * generate csv file and print table result * generate html * modify the way to get parent folder * update html in parent folder * add llm-ppl-summary and llm-ppl-summary-html * modify echo single result * remove download fp16.csv * change model name of PR * move ppl nightly related files to llm/test folder * reformat * seperate make_table from make_table_and_csv.py * separate make_csv from make_table_and_csv.py * update llm-ppl-html * remove comment * add Download fp16.results	2024-02-27 17:37:08 +08:00
Yuwen Hu	38ae4b372f	Add yuan2-2b to win igpu perf test (#10250 )	2024-02-27 11:08:33 +08:00
Jin Qiao	3e6d188553	LLM: add baichuan2-13b to mtl perf (#10238 )	2024-02-26 15:55:56 +08:00
Chen, Zhentao	f315c7f93a	Move harness nightly related files to llm/test folder (#10209 ) * move harness nightly files to test folder * change workflow file path accordingly * use arc01 when pr * fix path * fix fp16 csv path	2024-02-23 11:12:36 +08:00
Yuwen Hu	21de2613ce	[LLM] Add model loading time record for all-in-one benchmark (#10201 ) * Add model loading time record in csv for all-in-one benchmark * Small fix * Small fix to number after .	2024-02-22 13:57:18 +08:00

1 2 3

116 commits