ipex-llm/docs/mddocs/DockerGuides
Jun Wang 6ffaec66a2
[UPDATE] add prefix caching document into vllm_docker_quickstart.md (#12173)
* [ADD] rewrite new vllm docker quick start

* [ADD] lora adapter doc finished

* [ADD] mulit lora adapter test successfully

* [ADD] add ipex-llm quantization doc

* [Merge] rebase main

* [REMOVE] rm tmp file

* [Merge] rebase main

* [ADD] add prefix caching experiment and result

* [REMOVE] rm cpu offloading chapter
2024-10-11 19:12:22 +08:00
..
docker_cpp_xpu_quickstart.md Small mddoc fixed based on review (#11391) 2024-06-21 17:09:30 +08:00
docker_pytorch_inference_gpu.md Revert to use out-of-tree GPU driver (#11761) 2024-08-12 13:41:47 +08:00
docker_run_pytorch_inference_in_vscode.md Update GPU HF-Transformers example structure (#11526) 2024-07-08 17:58:06 +08:00
docker_windows_gpu.md Further mddocs fixes (#11386) 2024-06-21 13:27:43 +08:00
fastchat_docker_quickstart.md Update mddocs for DockerGuides (#11380) 2024-06-21 12:10:35 +08:00
README.md Add index page for API doc & links update in mddocs (#11393) 2024-06-21 17:34:34 +08:00
vllm_cpu_docker_quickstart.md Add missing ragflow quickstart in mddocs and update legecy contents (#11385) 2024-06-21 12:28:26 +08:00
vllm_docker_quickstart.md [UPDATE] add prefix caching document into vllm_docker_quickstart.md (#12173) 2024-10-11 19:12:22 +08:00