Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								177273c1a4
								
							
						 | 
						
							
							
								
								IPEX Speculative Support for Baichuan2 7B (#10112)
							
							
							
							
							
							
							
							* IPEX Speculative Support for Baichuan2 7B
* fix license problems
* refine 
							
						 | 
						
							2024-02-19 09:12:57 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								601024f418
								
							
						 | 
						
							
							
								
								Mistral CPU example of speculative decoding (#10024)
							
							
							
							
							
							
							
							* Mistral CPU example of speculative decoding
* update transformres version
* update example
* Update README.md 
							
						 | 
						
							2024-02-01 10:52:32 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								7284edd9b7
								
							
						 | 
						
							
							
								
								Vicuna CPU example of speculative decoding (#10018)
							
							
							
							
							
							
							
							* Vicuna CPU example of speculative decoding
* Update speculative.py
* Update README.md
* add requirements for ipex
* Update README.md
* Update speculative.py
* Update speculative.py 
							
						 | 
						
							2024-01-31 11:23:50 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Wang, Jian4
								
							 
						 | 
						
							
							
							
							
								
							
							
								fb53b994f8
								
							
						 | 
						
							
							
								
								LLM : Add llama ipex optimized (#10046)
							
							
							
							
							
							
							
							* init ipex
* remove padding 
							
						 | 
						
							2024-01-31 10:38:46 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								b1ff28ceb6
								
							
						 | 
						
							
							
								
								LLama2 CPU example of speculative decoding (#9962)
							
							
							
							
							
							
							
							* LLama2 example of speculative decoding
* add docs
* Update speculative.py
* Update README.md
* Update README.md
* Update speculative.py
* remove autocast 
							
						 | 
						
							2024-01-31 09:45:20 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Xiangyu Tian
								
							 
						 | 
						
							
							
							
							
								
							
							
								9978089796
								
							
						 | 
						
							
							
								
								[LLM] Enable BIGDL_OPT_IPEX in speculative baichuan2 13b example  (#10028)
							
							
							
							
							
							
							
							Enable BIGDL_OPT_IPEX in speculative baichuan2 13b example 
							
						 | 
						
							2024-01-30 17:11:37 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								cc3f122f6a
								
							
						 | 
						
							
							
								
								Baichuan2 CPU example of speculative decoding (#10003)
							
							
							
							
							
							
							
							* Baichuan2 CPU example of speculative decoding
* Update generate.py
* Update README.md
* Update generate.py
* Update generate.py
* Update generate.py
* fix default model
* fix wrong chinese coding
* Update generate.py
* update prompt
* update sample outputs
* baichuan 7b needs transformers==4.31.0
* rename example file's name 
							
						 | 
						
							2024-01-29 14:21:09 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Wang, Jian4
								
							 
						 | 
						
							
							
							
							
								
							
							
								093e6f8f73
								
							
						 | 
						
							
							
								
								LLM: Add qwen CPU speculative example (#9985)
							
							
							
							
							
							
							
							* init from gpu
* update for cpu
* update
* update
* fix xpu readme
* update
* update example prompt
* update prompt and add 72b
* update
* update 
							
						 | 
						
							2024-01-25 17:01:34 +08:00 | 
						
						
							
							
							
								
							
							
						 |