ipex-llm/python/llm/example/GPU/Speculative-Decoding
Jiao Wang 667f0db466
Update Eagle example to Eagle2+ipex-llm integration (#11717)
* update to e2 example

* update

* update
2024-10-16 23:16:14 -07:00
..
EAGLE Update Eagle example to Eagle2+ipex-llm integration (#11717) 2024-10-16 23:16:14 -07:00
Self-Speculation Reconstruct Speculative Decoding example directory (#11136) 2024-05-29 13:15:27 -07:00
README.md Reconstruct Speculative Decoding example directory (#11136) 2024-05-29 13:15:27 -07:00

Speculative-Decoding Examples on Intel GPU

This folder contains examples of running Speculative-Decoding Examples with IPEX-LLM on Intel GPU:

  • Self-Speculation: running BF16 inference for Huggingface Transformer model with self-speculative decoding with IPEX-LLM on Intel GPUs
  • EAGLE: running speculative sampling using EAGLE (Extrapolation Algorithm for Greater Language-model Efficiency) with IPEX-LLM on Intel GPUs