Jason Dai
|
3bc3d0bbcd
|
Update self-speculative readme (#9986)
|
2024-01-24 22:37:32 +08:00 |
|
Ruonan Wang
|
d4f65a6033
|
LLM: add mistral speculative example (#9976)
* add mistral example
* update
|
2024-01-24 17:35:15 +08:00 |
|
Ruonan Wang
|
60b35db1f1
|
LLM: add chatglm3 speculative decoding example (#9966)
* add chatglm3 example
* update
* fix
|
2024-01-23 15:54:12 +08:00 |
|
Ruonan Wang
|
27b19106f3
|
LLM: add readme for speculative decoding gpu examples (#9961)
* add readme
* add readme
* meet code review
|
2024-01-23 12:54:19 +08:00 |
|