ipex-llm/python/llm/example
Wang, Jian4 1eed0635f2
Add lightweight serving and support tgi parameter (#11600)
* init tgi request

* update openai api

* update for pp

* update and add readme

* add to docker

* add start bash

* update

* update

* update
2024-07-19 13:15:56 +08:00
..
CPU fix gemma for 4.41 (#11531) 2024-07-18 15:02:50 -07:00
GPU Add lightweight serving and support tgi parameter (#11600) 2024-07-19 13:15:56 +08:00
NPU/HF-Transformers-AutoModels Add npu benchmark all-in-one script (#11571) 2024-07-15 10:42:37 +08:00