* init ceval benchmark test. * upload dataset. * add other tests. * add qwen evaluator. * fix qwen evaluator style. * fix qwen evaluator style. * update qwen evaluator. * add llama evaluator. * update eval * fix typo. * fix * fix typo. * fix llama evaluator. * fix bug. * fix style. * delete dataset. * fix style. * fix style. * add README.md and fix typo. * fix comments. * remove run scripts
		
			
				
	
	
		
			7 lines
		
	
	
		
			No EOL
		
	
	
		
			175 B
		
	
	
	
		
			Bash
		
	
	
	
	
	
			
		
		
	
	
			7 lines
		
	
	
		
			No EOL
		
	
	
		
			175 B
		
	
	
	
		
			Bash
		
	
	
	
	
	
python eval.py \
 | 
						|
    --model_family llama \
 | 
						|
    --model_path "path to model" \
 | 
						|
    --eval_type validation \
 | 
						|
    --device xpu \
 | 
						|
    --eval_data_path data \
 | 
						|
    --qtype sym_int4 |