ipex-llm/docs/mddocs/Quickstart
Yuwen Hu 381d448ee2
[NPU] Example & Quickstart updates (#12650)
* Remove model with optimize_model=False in NPU verified models tables, and remove related example

* Remove experimental in run optimized model section title

* Unify model table order & example cmd

* Move embedding example to separate folder & update quickstart example link

* Add Quickstart reference in main NPU readme

* Small fix

* Small fix

* Move save/load examples under NPU/HF-Transformers-AutoModels

* Add low-bit and polish arguments for LLM Python examples

* Small fix

* Add low-bit and polish arguments for Multi-Model  examples

* Polish argument for Embedding models

* Polish argument for LLM CPP examples

* Add low-bit and polish argument for Save-Load examples

* Add accuracy tuning tips for examples

* Update NPU qucikstart accuracy tuning with low-bit optimizations

* Add save/load section to qucikstart

* Update CPP example sample output to EN

* Add installation regarding cmake for CPP examples

* Small fix

* Small fix

* Small fix

* Small fix

* Small fix

* Small fix

* Unify max prompt length to 512

* Change recommended low-bit for Qwen2.5-3B-Instruct to asym_int4

* Update based on comments

* Small fix
2025-01-07 13:52:41 +08:00
..
axolotl_quickstart.md Fix application quickstart (#12305) 2024-10-31 16:57:35 +08:00
benchmark_quickstart.md Remove env variable BIGDL_LLM_XMX_DISABLED in documentation (#12445) 2024-11-27 11:16:36 +08:00
bigdl_llm_migration.md Table of Contents in Quickstart Files (#11437) 2024-06-28 10:41:00 +08:00
chatchat_quickstart.md Table of Contents in Quickstart Files (#11437) 2024-06-28 10:41:00 +08:00
continue_quickstart.md add notes for SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS (#11936) 2024-08-30 09:26:47 +08:00
deepspeed_autotp_fastapi_quickstart.md Table of Contents in Quickstart Files (#11437) 2024-06-28 10:41:00 +08:00
dify_quickstart.md Table of Contents in Quickstart Files (#11437) 2024-06-28 10:41:00 +08:00
fastchat_quickstart.md add notes for SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS (#11936) 2024-08-30 09:26:47 +08:00
graphrag_quickstart.md Fix application quickstart (#12305) 2024-10-31 16:57:35 +08:00
install_linux_gpu.md Small typo fixes (#12558) 2024-12-17 13:54:13 +08:00
install_linux_gpu.zh-CN.md Small typo fixes (#12558) 2024-12-17 13:54:13 +08:00
install_windows_gpu.md Small typo fixes (#12558) 2024-12-17 13:54:13 +08:00
install_windows_gpu.zh-CN.md Small typo fixes (#12558) 2024-12-17 13:54:13 +08:00
llama3_llamacpp_ollama_quickstart.md modification on llamacpp readme after Ipex-llm latest update (#11971) 2024-08-30 11:36:45 +08:00
llama_cpp_quickstart.md Update ollama and llama.cpp readme (#12574) 2024-12-18 17:33:20 +08:00
llama_cpp_quickstart.zh-CN.md Update ollama and llama.cpp readme (#12574) 2024-12-18 17:33:20 +08:00
npu_quickstart.md [NPU] Example & Quickstart updates (#12650) 2025-01-07 13:52:41 +08:00
ollama_quickstart.md [Doc] Update ipex-llm ollama troubleshooting for v0.4.6 (#12642) 2025-01-02 17:28:54 +08:00
ollama_quickstart.zh-CN.md [Doc] Update ipex-llm ollama troubleshooting for v0.4.6 (#12642) 2025-01-02 17:28:54 +08:00
open_webui_with_ollama_quickstart.md [docs] Update doc for latest open webui: 0.4.8 (#12591) 2024-12-26 09:18:20 +08:00
privateGPT_quickstart.md Table of Contents in Quickstart Files (#11437) 2024-06-28 10:41:00 +08:00
ragflow_quickstart.md Fix application quickstart (#12305) 2024-10-31 16:57:35 +08:00
README.md Add NPU QuickStart & update example links (#12470) 2024-12-02 17:03:10 +08:00
vLLM_quickstart.md fix vllm docs (#12176) 2024-10-10 15:44:36 +08:00
webui_quickstart.md Remove env variable BIGDL_LLM_XMX_DISABLED in documentation (#12445) 2024-11-27 11:16:36 +08:00