Xiangyu Tian
|
044e486480
|
Fix vLLM CPU /chat endpoint (#11748)
|
2024-08-09 10:33:52 +08:00 |
|
Xiangyu Tian
|
b30bf7648e
|
Fix vLLM CPU api_server params (#11384)
|
2024-06-21 13:00:06 +08:00 |
|
Xiangyu Tian
|
4b07712fd8
|
LLM: Fix vLLM CPU model convert mismatch (#11254)
Fix vLLM CPU model convert mismatch.
|
2024-06-07 15:54:34 +08:00 |
|
Xiangyu Tian
|
ac3d53ff5d
|
LLM: Fix vLLM CPU version error (#11206)
Fix vLLM CPU version error
|
2024-06-04 19:10:23 +08:00 |
|
Xiangyu Tian
|
b3f6faa038
|
LLM: Add CPU vLLM entrypoint (#11083)
Add CPU vLLM entrypoint and update CPU vLLM serving example.
|
2024-05-24 09:16:59 +08:00 |
|