ipex-llm/python
Guancheng Fu cbe7b5753f
Add vLLM[xpu] related code (#10779)
* Add ipex-llm side change

* add runable offline_inference

* refactor to call vllm2

* Verified async server

* add new v2 example

* add README

* fix

* change dir

* refactor readme.md

* add experimental

* fix
2024-04-18 15:29:20 +08:00
..
llm Add vLLM[xpu] related code (#10779) 2024-04-18 15:29:20 +08:00