| .. | 
			
		
		
			
			
			
			
				| 
					
						
							
								
								
								
									
									
									
										awq
									
								
							
						
					
				 | 
				
					
						
							
							Refactor bigdl.llm to  ipex_llm (#24)
						
					
				 | 
				2024-03-22 15:41:21 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								
								
									
									
									
										gguf
									
								
							
						
					
				 | 
				
					
						
							
							IPEX Duplicate importer V2 (#11310)
						
					
				 | 
				2024-06-19 16:29:19 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								
								
									
									
									
										layers
									
								
							
						
					
				 | 
				
					
						
							
							Divide core-xe packages (#11131)
						
					
				 | 
				2024-05-28 12:00:18 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								
								
									
									
									
										models
									
								
							
						
					
				 | 
				
					
						
							
							Support compress kv with lookahead (#11752)
						
					
				 | 
				2024-08-09 17:39:57 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								
								
									
									
									
										npu_models
									
								
							
						
					
				 | 
				
					
						
							
							Fix qwen2 & int4 on NPU (#11646)
						
					
				 | 
				2024-07-24 13:14:39 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								__init__.py
							
						
					
				 | 
				
					
						
							
							Refactor fastapi-serving and add one card serving(#11581)
						
					
				 | 
				2024-07-17 11:12:43 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								bmm.py
							
						
					
				 | 
				
					
						
							
							Divide core-xe packages (#11131)
						
					
				 | 
				2024-05-28 12:00:18 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								convert.py
							
						
					
				 | 
				
					
						
							
							support and optimize minicpm-v-2_6 (#11738)
						
					
				 | 
				2024-08-07 18:21:16 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								convert_ipex.py
							
						
					
				 | 
				
					
						
							
							LLM: Fix bigdl_ipex_int8 warning (#10890)
						
					
				 | 
				2024-04-26 11:18:44 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								embedding.py
							
						
					
				 | 
				
					
						
							
							add save_low_bit support for DiskEmbedding (#11621)
						
					
				 | 
				2024-07-19 10:34:53 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								kv.py
							
						
					
				 | 
				
					
						
							
							Phi3 support compresskv (#11733)
						
					
				 | 
				2024-08-09 15:43:43 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								lisa.py
							
						
					
				 | 
				
					
						
							
							LISA Finetuning Example (#10743)
						
					
				 | 
				2024-04-18 13:48:10 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								load_config.yaml
							
						
					
				 | 
				
					
						
							
							Adding load_low_bit interface for ipex_llm_worker (#11000)
						
					
				 | 
				2024-05-13 15:30:19 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								loader.py
							
						
					
				 | 
				
					
						
							
							Add half precision for fastchat models (#11130)
						
					
				 | 
				2024-05-24 15:41:14 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								lookup.py
							
						
					
				 | 
				
					
						
							
							[WIP] Add look up table in 1st token stage (#11193)
						
					
				 | 
				2024-06-07 10:51:05 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								low_bit_linear.py
							
						
					
				 | 
				
					
						
							
							Add disk_embedding parameter to support put Embedding layer on CPU (#11617)
						
					
				 | 
				2024-07-18 17:06:06 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								model.py
							
						
					
				 | 
				
					
						
							
							Fix Pipeline Parallel dtype (#11623)
						
					
				 | 
				2024-07-19 13:07:40 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								modelling_bigdl.py
							
						
					
				 | 
				
					
						
							
							Remove chatglm_C Module to Eliminate LGPL Dependency (#11178)
						
					
				 | 
				2024-05-31 17:03:11 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								npu_model.py
							
						
					
				 | 
				
					
						
							
							Clean npu dtype branch (#11515)
						
					
				 | 
				2024-07-05 15:45:26 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								pipeline_parallel.py
							
						
					
				 | 
				
					
						
							
							Optimizations for Pipeline Parallel Serving (#11702)
						
					
				 | 
				2024-08-02 12:06:59 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								qlora.py
							
						
					
				 | 
				
					
						
							
							Upgrade Peft version to 0.10.0 for LLM finetune (#10886)
						
					
				 | 
				2024-05-07 15:09:14 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								relora.py
							
						
					
				 | 
				
					
						
							
							Refactor bigdl.llm to  ipex_llm (#24)
						
					
				 | 
				2024-03-22 15:41:21 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								speculative.py
							
						
					
				 | 
				
					
						
							
							Support compress kv with lookahead (#11752)
						
					
				 | 
				2024-08-09 17:39:57 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								streamer.py
							
						
					
				 | 
				
					
						
							
							[LLM]Reopen autotp generate_stream (#11120)
						
					
				 | 
				2024-05-24 17:16:14 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								training_patch.py
							
						
					
				 | 
				
					
						
							
							Fix error during merging adapter (#11145)
						
					
				 | 
				2024-05-27 19:41:42 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								utils.py
							
						
					
				 | 
				
					
						
							
							add fallback for unsupported k-quants (#11691)
						
					
				 | 
				2024-07-31 11:39:58 +08:00 | 
			
		
			
			
			
			
				| 
					
						
							
								
								xpu_customize_fwd.py
							
						
					
				 | 
				
					
						
							
							Refactor bigdl.llm to  ipex_llm (#24)
						
					
				 | 
				2024-03-22 15:41:21 +08:00 |