xingyuan li
								
							 
						 | 
						
							
							
							
							
								
							
							
								610084e3c0
								
							
						 | 
						
							
							
								
								[LLM] Complete windows unittest (#8611)
							
							
							
							
							
							
							
							* add windows nightly test workflow
* use github runner to run pr test
* model load should use lowbit
* remove tmp dir after testing 
							
						 | 
						
							2023-08-03 14:48:42 +09:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Xin Qiu
								
							 
						 | 
						
							
							
							
							
								
							
							
								fccae91461
								
							
						 | 
						
							
							
								
								Add load_low_bit save_load_bit to AutoModelForCausalLM (#8531)
							
							
							
							
							
							
							
							* transformers save_low_bit load_low_bit
* update example and add readme
* update
* update
* update
* add ut
* update 
							
						 | 
						
							2023-07-17 15:29:55 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Xin Qiu
								
							 
						 | 
						
							
							
							
							
								
							
							
								90e3d86bce
								
							
						 | 
						
							
							
								
								rename low bit type name (#8512)
							
							
							
							
							
							
							
							* change qx_0 to sym_intx
* update
* fix typo
* update
* fix type
* fix style
* add python doc
* meet code review
* fix style 
							
						 | 
						
							2023-07-13 15:53:31 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Xin Qiu
								
							 
						 | 
						
							
							
							
							
								
							
							
								cd7a980ec4
								
							
						 | 
						
							
							
								
								Transformer int4 add qtype, support q4_1 q5_0 q5_1 q8_0 (#8481)
							
							
							
							
							
							
							
							* quant in Q4 5 8
* meet code review
* update readme
* style
* update
* fix error
* fix error
* update
* fix style
* update
* Update README.md
* Add load_in_low_bit 
							
						 | 
						
							2023-07-12 08:23:08 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Zhao Changmin
								
							 
						 | 
						
							
							
							
							
								
							
							
								81d655cda9
								
							
						 | 
						
							
							
								
								LLM: transformer int4 save and load (#8462)
							
							
							
							
							
							
							
							* LLM: transformer int4 save and load 
							
						 | 
						
							2023-07-10 16:34:41 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Ruonan Wang
								
							 
						 | 
						
							
							
							
							
								
							
							
								4be784a49d
								
							
						 | 
						
							
							
								
								LLM: add UT for starcoder (convert, inference)  update examples and readme (#8379)
							
							
							
							
							
							
							
							* first commit to add path
* update example and readme
* update path
* fix
* update based on comment 
							
						 | 
						
							2023-06-27 12:12:11 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Zhao Changmin
								
							 
						 | 
						
							
							
							
							
								
							
							
								4d177ca0a1
								
							
						 | 
						
							
							
								
								LLM: Merge convert pth/gptq model script into one shell script (#8348)
							
							
							
							
							
							
							
							* convert model in one
* model type
* license
* readme and pep8
* ut path
* rename
* readme
* fix docs
* without lines 
							
						 | 
						
							2023-06-19 11:50:05 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Yuwen Hu
								
							 
						 | 
						
							
							
							
							
								
							
							
								1aa33d35d5
								
							
						 | 
						
							
							
								
								[LLM] Refactor LLM Linux tests (#8349)
							
							
							
							
							
							
							
							* Small name fix
* Add convert nightly tests, and for other llm tests, use stable ckpt
* Small fix and ftp fix
* Small fix
* Small fix 
							
						 | 
						
							2023-06-16 15:22:48 +08:00 | 
						
						
							
							
							
								
							
							
						 |