* LLama2 example of speculative decoding * add docs * Update speculative.py * Update README.md * Update README.md * Update speculative.py * remove autocast |
||
|---|---|---|
| .. | ||
| CPU | ||
| GPU | ||
| Text-Generation-WebUI | ||
* LLama2 example of speculative decoding * add docs * Update speculative.py * Update README.md * Update README.md * Update speculative.py * remove autocast |
||
|---|---|---|
| .. | ||
| CPU | ||
| GPU | ||
| Text-Generation-WebUI | ||