Xiangyu Tian
|
b3f6faa038
|
LLM: Add CPU vLLM entrypoint (#11083)
Add CPU vLLM entrypoint and update CPU vLLM serving example.
|
2024-05-24 09:16:59 +08:00 |
|
Guancheng Fu
|
f654f7e08c
|
Add serving docker quickstart (#11072)
* add temp file
* add initial docker readme
* temp
* done
* add fastchat service
* fix
* fix
* fix
* fix
* remove stale file
|
2024-05-21 17:00:58 +08:00 |
|
Guancheng Fu
|
67db925112
|
Add vllm quickstart (#10978)
* temp
* add doc
* finish
* done
* fix
* add initial docker readme
* temp
* done fixing vllm_quickstart
* done
* remove not used file
* add
* fix
|
2024-05-17 16:16:42 +08:00 |
|