ipex-llm/python
Heyang Sun b1ff28ceb6 LLama2 CPU example of speculative decoding (#9962)
* LLama2 example of speculative decoding

* add docs

* Update speculative.py

* Update README.md

* Update README.md

* Update speculative.py

* remove autocast
2024-01-31 09:45:20 +08:00
..
llm LLama2 CPU example of speculative decoding (#9962) 2024-01-31 09:45:20 +08:00