ipex-llm/python/llm/example/GPU/Pipeline-Parallel-Serving/prompt
Wang, Jian4 9c15abf825
Refactor fastapi-serving and add one card serving(#11581)
* init fastapi-serving one card

* mv api code to source

* update worker

* update for style-check

* add worker

* update bash

* update

* update worker name and add readme

* rename update

* rename to fastapi
2024-07-17 11:12:43 +08:00
..
32.txt Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00
128.txt Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00
1024.txt Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00
2048.txt Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00