ipex-llm/docs/mddocs
Jun Wang 6ffaec66a2
[UPDATE] add prefix caching document into vllm_docker_quickstart.md (#12173)
* [ADD] rewrite new vllm docker quick start

* [ADD] lora adapter doc finished

* [ADD] mulit lora adapter test successfully

* [ADD] add ipex-llm quantization doc

* [Merge] rebase main

* [REMOVE] rm tmp file

* [Merge] rebase main

* [ADD] add prefix caching experiment and result

* [REMOVE] rm cpu offloading chapter
2024-10-11 19:12:22 +08:00
..
DockerGuides [UPDATE] add prefix caching document into vllm_docker_quickstart.md (#12173) 2024-10-11 19:12:22 +08:00
Inference Update mddocs for part of Overview (2/2) and Inference (#11377) 2024-06-21 12:07:50 +08:00
Overview Support Windows ARL release (#12183) 2024-10-11 18:30:52 +08:00
PythonAPI Small fix (#11395) 2024-06-21 17:45:10 +08:00
Quickstart Support Windows ARL release (#12183) 2024-10-11 18:30:52 +08:00
README.md Add GraphRAG QuickStart (#11582) 2024-07-16 09:27:54 +08:00