6 lines
465 B
Markdown
6 lines
465 B
Markdown
# Speculative-Decoding Examples on Intel CPU
|
|
|
|
This folder contains examples of running Speculative-Decoding Examples with IPEX-LLM on Intel CPU:
|
|
|
|
- [Self-Speculation](Self-Speculation): running BF16 inference for Huggingface Transformer model with ***self-speculative decoding*** with IPEX-LLM on Intel CPUs
|
|
- [EAGLE](EAGLE): running speculative sampling using ***EAGLE*** (Extrapolation Algorithm for Greater Language-model Efficiency) with IPEX-LLM on Intel CPUs
|