ipex-llm/python/llm/test/benchmark
WeiguangHan fd81d66047 LLM: Compress some models to save space (#10315)
* LLM: compress some models to save space

* add deleted comments
2024-03-04 17:53:03 +08:00
..
harness Move harness nightly related files to llm/test folder (#10209) 2024-02-23 11:12:36 +08:00
igpu-perf LLM: add Baichuan2-13B-Chat 2048-256 to MTL perf (#10273) 2024-02-29 13:48:55 +08:00
perplexity Add html report of ppl (#10218) 2024-02-27 17:37:08 +08:00
arc-perf-test.yaml LLM: Compress some models to save space (#10315) 2024-03-04 17:53:03 +08:00
arc-perf-transformers-434.yaml Update perf test all-in-one config for batch_size arg (#10012) 2024-01-26 16:46:36 +08:00
arc-perf-transformers-437.yaml LLM: add qwen_1.5_7b model for arc perf test (#10166) 2024-02-19 17:21:00 +08:00
check_results.py LLM: check csv and its corresponding yaml file (#9702) 2023-12-21 09:54:33 +08:00
concat_csv.py [LLM] Add qwen1.5-7B in iGPU perf (#10127) 2024-02-07 22:31:20 +08:00
core-perf-test.yaml Update perf test all-in-one config for batch_size arg (#10012) 2024-01-26 16:46:36 +08:00
cpu-perf-test.yaml update nightly spr perf test (#10178) 2024-03-04 13:46:33 +08:00
csv_to_html.py [LLM] Add model loading time record for all-in-one benchmark (#10201) 2024-02-22 13:57:18 +08:00
stable-version-arc-perf-test-fp8.yaml Arc Stable version test (#10087) 2024-02-06 10:23:50 +08:00
stable-version-arc-perf-test-sym_int4.yaml Arc Stable version test (#10087) 2024-02-06 10:23:50 +08:00
stable-version-arc-stress-test-fp8.yaml Update perf test all-in-one config for batch_size arg (#10012) 2024-01-26 16:46:36 +08:00
stable-version-arc-stress-test-sym_int4.yaml Update perf test all-in-one config for batch_size arg (#10012) 2024-01-26 16:46:36 +08:00
stable-version-cpu-perf-test.yaml Update perf test all-in-one config for batch_size arg (#10012) 2024-01-26 16:46:36 +08:00
stable-version-cpu-stress-test.yaml Update perf test all-in-one config for batch_size arg (#10012) 2024-01-26 16:46:36 +08:00