Commit graph

1 commit

Author SHA1 Message Date
Wang, Jian4
1eed0635f2
Add lightweight serving and support tgi parameter (#11600)
* init tgi request

* update openai api

* update for pp

* update and add readme

* add to docker

* add start bash

* update

* update

* update
2024-07-19 13:15:56 +08:00
Renamed from python/llm/example/GPU/Pipeline-Parallel-Serving/serving.py (Browse further)