Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								b1ff28ceb6
								
							
						 | 
						
							
							
								
								LLama2 CPU example of speculative decoding (#9962)
							
							
							
							
							
							
							
							* LLama2 example of speculative decoding
* add docs
* Update speculative.py
* Update README.md
* Update README.md
* Update speculative.py
* remove autocast 
							
						 | 
						
							2024-01-31 09:45:20 +08:00 | 
						
						
							
							
							
								
							
							
						 |