ipex-llm/python/llm
Wang, Jian4 9c15abf825
Refactor fastapi-serving and add one card serving(#11581)
* init fastapi-serving one card

* mv api code to source

* update worker

* update for style-check

* add worker

* update bash

* update

* update worker name and add readme

* rename update

* rename to fastapi
2024-07-17 11:12:43 +08:00
..
dev Add npu benchmark all-in-one script (#11571) 2024-07-15 10:42:37 +08:00
example Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00
portable-zip Fix null pointer dereferences error. (#11125) 2024-05-30 16:16:10 +08:00
scripts fix typo in python/llm/scripts/README.md (#11536) 2024-07-09 09:53:14 +08:00
src/ipex_llm Refactor fastapi-serving and add one card serving(#11581) 2024-07-17 11:12:43 +08:00
test Test MiniCPM performance on iGPU in a more stable way (#11573) 2024-07-12 17:07:41 +08:00
tpp OSPDT: add tpp licenses (#11165) 2024-06-06 10:59:06 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Upgrade accelerate to 0.23.0 (#11331) 2024-06-17 15:03:11 +08:00
version.txt Update setup.py and add new actions and add compatible mode (#25) 2024-03-22 15:44:59 +08:00