ipex-llm/python/llm/example
Heyang Sun b1ff28ceb6 LLama2 CPU example of speculative decoding (#9962)
* LLama2 example of speculative decoding

* add docs

* Update speculative.py

* Update README.md

* Update README.md

* Update speculative.py

* remove autocast
2024-01-31 09:45:20 +08:00
..
CPU LLama2 CPU example of speculative decoding (#9962) 2024-01-31 09:45:20 +08:00
GPU LLM: add gpu example for redpajama models (#10040) 2024-01-30 19:39:28 +08:00
Text-Generation-WebUI [LLM] Add Text_Generation_WebUI Support (#9884) 2024-01-26 15:12:49 +08:00