* update vllm-docker-quick-start for vllm0.6.2 * [UPDATE] rm max-num-seqs parameter in vllm-serving script |
||
|---|---|---|
| .. | ||
| docker_cpp_xpu_quickstart.md | ||
| docker_pytorch_inference_gpu.md | ||
| docker_run_pytorch_inference_in_vscode.md | ||
| docker_windows_gpu.md | ||
| fastchat_docker_quickstart.md | ||
| README.md | ||
| vllm_cpu_docker_quickstart.md | ||
| vllm_docker_quickstart.md | ||
IPEX-LLM Docker Container User Guides
In this section, you will find guides related to using IPEX-LLM with Docker, covering how to:
-
Inference in Python/C++
-
Serving