* [ADD] rewrite new vllm docker quick start * [ADD] lora adapter doc finished * [ADD] mulit lora adapter test successfully * [ADD] add ipex-llm quantization doc * [Merge] rebase main * [REMOVE] rm tmp file * [Merge] rebase main * [ADD] add prefix caching experiment and result * [REMOVE] rm cpu offloading chapter |
||
|---|---|---|
| .. | ||
| docker_cpp_xpu_quickstart.md | ||
| docker_pytorch_inference_gpu.md | ||
| docker_run_pytorch_inference_in_vscode.md | ||
| docker_windows_gpu.md | ||
| fastchat_docker_quickstart.md | ||
| README.md | ||
| vllm_cpu_docker_quickstart.md | ||
| vllm_docker_quickstart.md | ||
IPEX-LLM Docker Container User Guides
In this section, you will find guides related to using IPEX-LLM with Docker, covering how to:
-
Inference in Python/C++
-
Serving