Wang, Jian4
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								16b2ef49c6
								
							
						 | 
						
							
							
								
								Update_document by heyang (#30)
							
							
							
							
							
						 | 
						
							2024-03-25 10:06:02 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								36a9e88104
								
							
						 | 
						
							
							
								
								Speculative Starcoder on CPU (#10138)
							
							
							
							
							
							
							
							* Speculative Starcoder on CPU
* enable kv-cache pre-allocation
* refine codes
* refine
* fix style
* fix style
* fix style
* refine
* refine
* Update speculative.py
* Update gptbigcode.py
* fix style
* Update speculative.py
* enable mixed-datatype layernorm on top of torch API
* adaptive dtype
* Update README.md 
							
						 | 
						
							2024-02-27 09:57:29 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Ziteng Zhang
								
							 
						 | 
						
							
							
							
							
								
							
							
								ea23afc8ec
								
							
						 | 
						
							
							
								
								[LLM]update ipex part in mistral example readme (#10239)
							
							
							
							
							
							
							
							* update ipex part in mistral example readme 
							
						 | 
						
							2024-02-26 14:35:20 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Xiangyu Tian
								
							 
						 | 
						
							
							
							
							
								
							
							
								f445217d02
								
							
						 | 
						
							
							
								
								LLM: Update IPEX to 2.2.0+cpu and Refactor for _ipex_optimize (#10189)
							
							
							
							
							
							
							
							Update IPEX to 2.2.0+cpu and refactor for _ipex_optimize. 
							
						 | 
						
							2024-02-22 16:01:11 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								601024f418
								
							
						 | 
						
							
							
								
								Mistral CPU example of speculative decoding (#10024)
							
							
							
							
							
							
							
							* Mistral CPU example of speculative decoding
* update transformres version
* update example
* Update README.md 
							
						 | 
						
							2024-02-01 10:52:32 +08:00 | 
						
						
							
							
							
								
							
							
						 |