ipex-llm

History

WeiguangHan fd81d66047 LLM: Compress some models to save space (#10315 ) * LLM: compress some models to save space * add deleted comments		2024-03-04 17:53:03 +08:00
..
harness	Move harness nightly related files to llm/test folder (#10209 )	2024-02-23 11:12:36 +08:00
igpu-perf	LLM: add Baichuan2-13B-Chat 2048-256 to MTL perf (#10273 )	2024-02-29 13:48:55 +08:00
perplexity	Add html report of ppl (#10218 )	2024-02-27 17:37:08 +08:00
arc-perf-test.yaml	LLM: Compress some models to save space (#10315 )	2024-03-04 17:53:03 +08:00
arc-perf-transformers-434.yaml	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
arc-perf-transformers-437.yaml	LLM: add qwen_1.5_7b model for arc perf test (#10166 )	2024-02-19 17:21:00 +08:00
check_results.py	LLM: check csv and its corresponding yaml file (#9702 )	2023-12-21 09:54:33 +08:00
concat_csv.py	[LLM] Add qwen1.5-7B in iGPU perf (#10127 )	2024-02-07 22:31:20 +08:00
core-perf-test.yaml	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
cpu-perf-test.yaml	update nightly spr perf test (#10178 )	2024-03-04 13:46:33 +08:00
csv_to_html.py	[LLM] Add model loading time record for all-in-one benchmark (#10201 )	2024-02-22 13:57:18 +08:00
stable-version-arc-perf-test-fp8.yaml	Arc Stable version test (#10087 )	2024-02-06 10:23:50 +08:00
stable-version-arc-perf-test-sym_int4.yaml	Arc Stable version test (#10087 )	2024-02-06 10:23:50 +08:00
stable-version-arc-stress-test-fp8.yaml	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
stable-version-arc-stress-test-sym_int4.yaml	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
stable-version-cpu-perf-test.yaml	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00
stable-version-cpu-stress-test.yaml	Update perf test all-in-one config for batch_size arg (#10012 )	2024-01-26 16:46:36 +08:00