ipex-llm

Author	SHA1	Message	Date
Xu, Shuo	13a72dc51d	Test MiniCPM performance on iGPU in a more stable way (#11573 ) * Test MiniCPM performance on iGPU in a more stable way * small fix --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-12 17:07:41 +08:00
Xu, Shuo	1355b2ce06	Add model Qwen-VL-Chat to iGPU-perf (#11558 ) * Add model Qwen-VL-Chat to iGPU-perf * small fix --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-11 15:39:02 +08:00
Xu, Shuo	028ad4f63c	Add model phi-3-vision-128k-instruct to iGPU-perf benchmark (#11554 ) * try to improve MIniCPM performance * Add model phi-3-vision-128k-instruct to iGPU-perf benchmark --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-10 17:26:30 +08:00
Xu, Shuo	61613b210c	try to improve MIniCPM performance (#11552 ) Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-10 16:58:23 +08:00
Yuwen Hu	8982ab73d5	Add Yi-6B and StableLM to iGPU perf test (#11546 ) * Add transformer4.38.2 test to igpu benchmark (#11529) * add transformer4.38.1 test to igpu benchmark * use transformers4.38.2 & fix csv name error in 4.38 workflow * add model Yi-6B-Chat & remove temporarily most models --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> * filter some errorlevel (#11541) Co-authored-by: ATMxsp01 <shou.xu@intel.com> * Restore the temporarily removed models in iGPU-perf (#11544) * filter some errorlevel * restore the temporarily removed models in iGPU-perf --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com> --------- Co-authored-by: Xu, Shuo <100334393+ATMxsp01@users.noreply.github.com> Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-09 18:51:23 +08:00
Xu, Shuo	f9a199900d	add model RWKV/v5-Eagle-7B-HF to igpu benchmark (#11528 ) Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-08 15:50:16 +08:00
Jun Wang	5a57e54400	[ADD] add 5 new models for igpu-perf (#11524 )	2024-07-08 11:12:15 +08:00
Yuwen Hu	8f376e5192	Change igpu perf to mainly test int4+fp16 (#11513 )	2024-07-05 17:12:33 +08:00
Jun Wang	f07937945f	[REMOVE] remove all useless repo-id in benchmark/igpu-perf (#11508 )	2024-07-04 16:38:34 +08:00
Jin Qiao	3682c6a979	add glm4 and qwen2 to igpu perf (#11304 )	2024-06-13 16:16:35 +08:00
Jiao Wang	0a06a6e1d4	Update tests for transformers 4.36 (#10858 ) * update unit test * update * update * update * update * update * fix gpu attention test * update * update * update * update * update * update * update example test * replace replit code * update * update * update * update * set safe_serialization false * perf test * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * delete * update * update * update * update * update * update * revert * update	2024-05-24 10:26:38 +08:00
Jin Qiao	15ee3fd542	Update igpu perf internlm (#10958 )	2024-05-08 14:16:43 +08:00
Yuwen Hu	0efe26c3b6	Change order of chatglm2-6b and chatglm3-6b in iGPU perf test for more stable performance (#10948 )	2024-05-07 13:48:39 +08:00
Jin Qiao	fb3c268d13	Add phi-3 to perf (#10883 )	2024-04-25 20:21:56 +08:00
Yuwen Hu	fb2a160af3	Add phi-2 to 2048-256 test for fixes (#10867 )	2024-04-24 10:00:25 +08:00
Yuwen Hu	21bb8bd164	Add phi-2 to igpu performance test (#10865 )	2024-04-23 18:13:14 +08:00
Yuwen Hu	07e8b045a9	Add Meta-llama-3-8B-Instruct and Yi-6B-Chat to igpu nightly perf (#10810 )	2024-04-19 15:09:58 +08:00
Yuwen Hu	1579ee4421	[LLM] Add nightly igpu perf test for INT4+FP16 1024-128 (#10496 )	2024-03-21 16:07:06 +08:00
Yuwen Hu	d45e577d8c	[LLM] Test `load_low_bit` in iGPU perf test on Windows (#10313 )	2024-03-04 18:03:57 +08:00
Jin Qiao	5d7243067c	LLM: add Baichuan2-13B-Chat 2048-256 to MTL perf (#10273 )	2024-02-29 13:48:55 +08:00
Yuwen Hu	38ae4b372f	Add yuan2-2b to win igpu perf test (#10250 )	2024-02-27 11:08:33 +08:00
Jin Qiao	3e6d188553	LLM: add baichuan2-13b to mtl perf (#10238 )	2024-02-26 15:55:56 +08:00
Yuwen Hu	81ed65fbe7	[LLM] Add qwen1.5-7B in iGPU perf (#10127 ) * Add qwen1.5 test config yaml with transformers 4.37.0 * Update for yaml file	2024-02-07 22:31:20 +08:00
Jin Qiao	8c8fc148c9	LLM: add rwkv 5 (#10048 )	2024-01-31 15:54:55 +08:00
Yuwen Hu	1eaaace2dc	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
Yuwen Hu	9e2ac5291b	Add rwkv v4 back for igpu perf test 32-512 (#9938 )	2024-01-18 17:15:28 +08:00
Yuwen Hu	0c498a7b64	Add llama2-13b to igpu perf test (#9920 )	2024-01-17 14:58:45 +08:00
Yuwen Hu	8643b62521	[LLM] Support longer context in iGPU perf tests (2048-256) (#9910 )	2024-01-16 17:48:37 +08:00
Yuwen Hu	c38e18f2ff	[LLM] Migrate iGPU perf tests to new machine (#9784 ) * Move 1024 test just after 32-32 test; and enable all model for 1024-128 * Make sure python output encoding in utf-8 so that redirect to txt can always be success * Upload results to ftp * Small fix	2023-12-26 19:15:57 +08:00
Yuwen Hu	02436c6cce	[LLM] Enable more long context in-out pairs for iGPU perf tests (#9765 ) * Add test for 1024-128 and enable more tests for 512-64 * Fix date in results csv name to the time when the performance is triggered * Small fix * Small fix * further fixes	2023-12-22 18:18:23 +08:00
Yuwen Hu	cbdd49f229	[LLM] win igpu performance for ipex 2.1 and oneapi 2024.0 (#9679 ) * Change igpu win tests for ipex 2.1 and oneapi 2024.0 * Qwen model repo id updates; updates model list for 512-64 * Add .eval for win igpu all-in-one benchmark for best performance	2023-12-13 18:52:29 +08:00
Yuwen Hu	d272b6dc47	[LLM] Enable generation of html again for win igpu tests (#9652 ) * Enable generation of html again and comment out rwkv for 32-512 as it is not very stable * Small fix	2023-12-11 19:15:17 +08:00
Yuwen Hu	894d0aaf5e	[LLM] iGPU win perf test reorg based on in-out pairs (#9645 ) * trigger pr temparorily * Saparate benchmark run for win igpu based in in-out pairs * Rename fix * Test workflow * Small fix * Skip generation of html for now * Change back to nightly triggered	2023-12-08 20:46:40 +08:00

33 commits