Guancheng Fu
|
f654f7e08c
|
Add serving docker quickstart (#11072)
* add temp file
* add initial docker readme
* temp
* done
* add fastchat service
* fix
* fix
* fix
* fix
* remove stale file
|
2024-05-21 17:00:58 +08:00 |
|
Wang, Jian4
|
a2e1578fd9
|
Merge tgi_api_server to main (#11036)
* init
* fix style
* speculative can not use benchmark
* add tgi server readme
|
2024-05-20 09:15:03 +08:00 |
|
Wang, Jian4
|
0e0bd309e2
|
LLM: Enable Speculative on Fastchat (#10909)
* init
* enable streamer
* update
* update
* remove deprecated
* update
* update
* add gpu example
|
2024-05-06 10:06:20 +08:00 |
|
ZehuaCao
|
a7c12020b4
|
Add fastchat quickstart (#10688)
* add fastchat quickstart
* update
* update
* update
|
2024-04-16 14:02:38 +08:00 |
|