ipex-llm/python/llm/dev/benchmark/ceval/run.sh at 843fe546b0d653d3f7ee8a8362ced8250fbb593d - ayo/ipex-llm - Ayo's Forge: Just keep building.

ayo/ipex-llm

Cengguang Zhang 511cbcf773 LLM: add Ceval benchmark test. (#9872 )

* init ceval benchmark test.

* upload dataset.

* add other tests.

* add qwen evaluator.

* fix qwen evaluator style.

* fix qwen evaluator style.

* update qwen evaluator.

* add llama evaluator.

* update eval

* fix typo.

* fix

* fix typo.

* fix llama evaluator.

* fix bug.

* fix style.

* delete dataset.

* fix style.

* fix style.

* add README.md and fix typo.

* fix comments.

* remove run scripts

2024-01-16 19:14:26 +08:00

7 lines

No EOL

175 B

Bash

Raw Blame History

 python eval.py \
     --model_family llama \
     --model_path "path to model" \
     --eval_type validation \
     --device xpu \
     --eval_data_path data \
     --qtype sym_int4