Wang, Jian4
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								23c6a52fb0
								
							
						 | 
						
							
							
								
								LLM: Fix ipex torchscript=True error (#10832)
							
							
							
							
							
							
							
							* remove
* update
* remove torchscript 
							
						 | 
						
							2024-04-22 15:53:09 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									ZehuaCao
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								0646e2c062
								
							
						 | 
						
							
							
								
								Fix short prompt for IPEX_CPU speculative decoding cause no_attr error (#10783)
							
							
							
							
							
						 | 
						
							2024-04-17 16:19:57 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Wang, Jian4
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								9df70d95eb
								
							
						 | 
						
							
							
								
								Refactor bigdl.llm to  ipex_llm (#24)
							
							
							
							
							
							
							
							* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm 
							
						 | 
						
							2024-03-22 15:41:21 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Wang, Jian4
								
							 
						 | 
						
							
							
							
							
								
							
							
								fb53b994f8
								
							
						 | 
						
							
							
								
								LLM : Add llama ipex optimized (#10046)
							
							
							
							
							
							
							
							* init ipex
* remove padding 
							
						 | 
						
							2024-01-31 10:38:46 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								b1ff28ceb6
								
							
						 | 
						
							
							
								
								LLama2 CPU example of speculative decoding (#9962)
							
							
							
							
							
							
							
							* LLama2 example of speculative decoding
* add docs
* Update speculative.py
* Update README.md
* Update README.md
* Update speculative.py
* remove autocast 
							
						 | 
						
							2024-01-31 09:45:20 +08:00 | 
						
						
							
							
							
								
							
							
						 |