ipex-llm/python
Guancheng Fu 963a5c8d79 Add vLLM-XPU version's README/examples (#9536)
* test

* test

* fix last kv cache

* add xpu readme

* remove numactl for xpu example

* fix link error

* update max_num_batched_tokens logic

* add explaination

* add xpu environement version requirement

* refine gpu memory

* fix

* fix style
2023-11-28 09:44:03 +08:00
..
llm Add vLLM-XPU version's README/examples (#9536) 2023-11-28 09:44:03 +08:00