Wang, Jian4
|
a2e1578fd9
|
Merge tgi_api_server to main (#11036)
* init
* fix style
* speculative can not use benchmark
* add tgi server readme
|
2024-05-20 09:15:03 +08:00 |
|
Wang, Jian4
|
0e0bd309e2
|
LLM: Enable Speculative on Fastchat (#10909)
* init
* enable streamer
* update
* update
* remove deprecated
* update
* update
* add gpu example
|
2024-05-06 10:06:20 +08:00 |
|
ZehuaCao
|
a7c12020b4
|
Add fastchat quickstart (#10688)
* add fastchat quickstart
* update
* update
* update
|
2024-04-16 14:02:38 +08:00 |
|