Shaojun Liu
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								ab9f7f3ac5
								
							
						 | 
						
							
							
								
								FIX: Qwen1.5-GPTQ-Int4 inference error (#11432)
							
							
							
							
							
							
							
							* merge_qkv if quant_method is 'gptq'
* fix python style checks
* refactor
* update GPU example 
							
						 | 
						
							2024-06-26 15:36:22 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Jin Qiao
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								10ee786920
								
							
						 | 
						
							
							
								
								Replace with IPEX-LLM in example comments (#10671)
							
							
							
							
							
							
							
							* Replace with IPEX-LLM in example comments
* More replacement
* revert some changes 
							
						 | 
						
							2024-04-07 13:29:51 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Wang, Jian4
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								9df70d95eb
								
							
						 | 
						
							
							
								
								Refactor bigdl.llm to  ipex_llm (#24)
							
							
							
							
							
							
							
							* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm 
							
						 | 
						
							2024-03-22 15:41:21 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Yang Wang
								
							 
						 | 
						
							
							
							
							
								
							
							
								51d07a9fd8
								
							
						 | 
						
							
							
								
								Support directly loading gptq models from huggingface (#9391)
							
							
							
							
							
							
							
							* Support directly loading GPTQ models from huggingface
* fix style
* fix tests
* change example structure
* address comments
* fix style
* address comments 
							
						 | 
						
							2023-11-13 20:48:12 -08:00 | 
						
						
							
							
							
								
							
							
						 |