Cengguang Zhang
								
							 
						 | 
						
							
							
							
							
								
							
							
								511cbcf773
								
							
						 | 
						
							
							
								
								LLM: add Ceval benchmark test. (#9872)
							
							
							
							
							
							
							
							* init ceval benchmark test.
* upload dataset.
* add other tests.
* add qwen evaluator.
* fix qwen evaluator style.
* fix qwen evaluator style.
* update qwen evaluator.
* add llama evaluator.
* update eval
* fix typo.
* fix
* fix typo.
* fix llama evaluator.
* fix bug.
* fix style.
* delete dataset.
* fix style.
* fix style.
* add README.md and fix typo.
* fix comments.
* remove run scripts 
							
						 | 
						
							2024-01-16 19:14:26 +08:00 | 
						
						
							
							
							
								
							
							
						 |