* Add ipex-llm side change * add runable offline_inference * refactor to call vllm2 * Verified async server * add new v2 example * add README * fix * change dir * refactor readme.md * add experimental * fix |
||
|---|---|---|
| .. | ||
| CPU | ||
| GPU | ||
* Add ipex-llm side change * add runable offline_inference * refactor to call vllm2 * Verified async server * add new v2 example * add README * fix * change dir * refactor readme.md * add experimental * fix |
||
|---|---|---|
| .. | ||
| CPU | ||
| GPU | ||