Guancheng Fu
|
cbe7b5753f
|
Add vLLM[xpu] related code (#10779)
* Add ipex-llm side change
* add runable offline_inference
* refactor to call vllm2
* Verified async server
* add new v2 example
* add README
* fix
* change dir
* refactor readme.md
* add experimental
* fix
|
2024-04-18 15:29:20 +08:00 |
|