Heyang Sun
								
							 
						 | 
						
							
							
							
							
								
							
							
								36a9e88104
								
							
						 | 
						
							
							
								
								Speculative Starcoder on CPU (#10138)
							
							
							
							
							
							
							
							* Speculative Starcoder on CPU
* enable kv-cache pre-allocation
* refine codes
* refine
* fix style
* fix style
* fix style
* refine
* refine
* Update speculative.py
* Update gptbigcode.py
* fix style
* Update speculative.py
* enable mixed-datatype layernorm on top of torch API
* adaptive dtype
* Update README.md 
							
						 | 
						
							2024-02-27 09:57:29 +08:00 | 
						
						
							
							
							
								
							
							
						 |