Yuxuan Xia
								
							 
						 | 
						
							
							
							
							
								
							
							
								7cbc2429a6
								
							
						 | 
						
							
							
								
								Fix C-Eval ChatGLM loading issue (#10206)
							
							
							
							
							
							
							
							* Add c-eval workflow and modify running files
* Modify the chatglm evaluator file
* Modify the ceval workflow for triggering test
* Modify the ceval workflow file
* Modify the ceval workflow file
* Modify ceval workflow
* Adjust the ceval dataset download
* Add ceval workflow dependencies
* Modify ceval workflow dataset download
* Add ceval test dependencies
* Add ceval test dependencies
* Correct the result print
* Fix the nightly test trigger time
* Fix ChatGLM loading issue 
							
						 | 
						
							2024-02-22 10:00:43 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Yuxuan Xia
								
							 
						 | 
						
							
							
							
							
								
							
							
								209122559a
								
							
						 | 
						
							
							
								
								Add Ceval workflow and modify the result printing (#10140)
							
							
							
							
							
							
							
							* Add c-eval workflow and modify running files
* Modify the chatglm evaluator file
* Modify the ceval workflow for triggering test
* Modify the ceval workflow file
* Modify the ceval workflow file
* Modify ceval workflow
* Adjust the ceval dataset download
* Add ceval workflow dependencies
* Modify ceval workflow dataset download
* Add ceval test dependencies
* Add ceval test dependencies
* Correct the result print 
							
						 | 
						
							2024-02-19 17:06:53 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Yuxuan Xia
								
							 
						 | 
						
							
							
							
							
								
							
							
								3832eb0ce0
								
							
						 | 
						
							
							
								
								Add ChatGLM C-Eval Evaluator (#10095)
							
							
							
							
							
							
							
							* Add ChatGLM ceval evaluator
* Modify ChatGLM Evaluator Reference 
							
						 | 
						
							2024-02-07 11:27:06 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Cengguang Zhang
								
							 
						 | 
						
							
							
							
							
								
							
							
								511cbcf773
								
							
						 | 
						
							
							
								
								LLM: add Ceval benchmark test. (#9872)
							
							
							
							
							
							
							
							* init ceval benchmark test.
* upload dataset.
* add other tests.
* add qwen evaluator.
* fix qwen evaluator style.
* fix qwen evaluator style.
* update qwen evaluator.
* add llama evaluator.
* update eval
* fix typo.
* fix
* fix typo.
* fix llama evaluator.
* fix bug.
* fix style.
* delete dataset.
* fix style.
* fix style.
* add README.md and fix typo.
* fix comments.
* remove run scripts 
							
						 | 
						
							2024-01-16 19:14:26 +08:00 | 
						
						
							
							
							
								
							
							
						 |