Shaojun Liu
|
40a7d2b4f0
|
Consolidated C-Eval Benchmark Guide for Single-GPU and Multi-GPU Environments (#12618)
* run c-eval on multi-GPUs
* Update README.md
|
2024-12-26 15:23:32 +08:00 |
|
Yina Chen
|
0236de3ac2
|
set IPEX_LLM_LAST_LM_HEAD=1 as default (#11885)
|
2024-08-21 15:06:12 +08:00 |
|
Yuxuan Xia
|
209122559a
|
Add Ceval workflow and modify the result printing (#10140)
* Add c-eval workflow and modify running files
* Modify the chatglm evaluator file
* Modify the ceval workflow for triggering test
* Modify the ceval workflow file
* Modify the ceval workflow file
* Modify ceval workflow
* Adjust the ceval dataset download
* Add ceval workflow dependencies
* Modify ceval workflow dataset download
* Add ceval test dependencies
* Add ceval test dependencies
* Correct the result print
|
2024-02-19 17:06:53 +08:00 |
|
Cengguang Zhang
|
511cbcf773
|
LLM: add Ceval benchmark test. (#9872)
* init ceval benchmark test.
* upload dataset.
* add other tests.
* add qwen evaluator.
* fix qwen evaluator style.
* fix qwen evaluator style.
* update qwen evaluator.
* add llama evaluator.
* update eval
* fix typo.
* fix
* fix typo.
* fix llama evaluator.
* fix bug.
* fix style.
* delete dataset.
* fix style.
* fix style.
* add README.md and fix typo.
* fix comments.
* remove run scripts
|
2024-01-16 19:14:26 +08:00 |
|