ipex-llm/python/llm/example
Wang, Jian4 9c15abf825
Refactor fastapi-serving and add one card serving(#11581)
* init fastapi-serving one card

* mv api code to source

* update worker

* update for style-check

* add worker

* update bash

* update

* update worker name and add readme

* rename update

* rename to fastapi
2024-07-17 11:12:43 +08:00
..
CPU Fix codegeex2 transformers version (#11487) 2024-07-02 15:09:28 +08:00
GPU Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00
NPU/HF-Transformers-AutoModels Add npu benchmark all-in-one script (#11571) 2024-07-15 10:42:37 +08:00