ipex-llm/docker/llm/serving
Guancheng Fu af693425f1
Upgrade to vLLM 0.6.6 (#12796)
* init

* update engine init

* fix serving load_in_low_bit problem

* temp

* temp

* temp

* temp

* temp

* fix

* fixed

* done

* fix

* fix all arguments

* fix

* fix throughput script

* fix

* fix

* use official ipex-llm

* Fix readme

* fix

---------

Co-authored-by: hzjane <a1015616934@qq.com>
2025-02-12 16:47:51 +08:00
..
cpu Fix cpu serving docker image (#12783) 2025-02-07 11:12:42 +08:00
xpu/docker Upgrade to vLLM 0.6.6 (#12796) 2025-02-12 16:47:51 +08:00