ipex-llm/docker/llm/inference
Wang, Jian4 9c15abf825
Refactor fastapi-serving and add one card serving(#11581)
* init fastapi-serving one card

* mv api code to source

* update worker

* update for style-check

* add worker

* update bash

* update

* update worker name and add readme

* rename update

* rename to fastapi
2024-07-17 11:12:43 +08:00
..
cpu/docker Fix docker images (#11362) 2024-06-20 15:44:55 +08:00
xpu/docker Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00