Commit graph

4025 commits

Author SHA1 Message Date
Wang, Jian4
7809ca9864
Reuse --privileged (#13015)
* fix

* add
2025-03-27 10:00:50 +08:00
Guancheng Fu
f437b36678
Fix vllm glm edge model (#13007)
* fix done

* fix
2025-03-26 09:25:32 +08:00
Yuwen Hu
374747b492
Update bert optimization to fit higher transformers/torch version (#13006) 2025-03-25 16:12:03 +08:00
Ruonan Wang
27d669210f
remove fschat in EAGLE example (#13005)
* update fschat version

* fix
2025-03-25 15:48:48 +08:00
Shaojun Liu
08f96a5139
Rename LICENSE-Intel®-OpenMP*-Runtime-Library.txt to LICENSE-Intel®-OpenMP-Runtime-Library.txt (#13002) 2025-03-25 10:07:55 +08:00
Ruonan Wang
0e0786a63c
update llama.cpp related quickstart with rebased llama.cpp (#12996)
* update doc with reabsed llama.cpp

* revert table of contents

* update demo output log
2025-03-25 09:49:39 +08:00
Shaojun Liu
7a86dd0569
Remove unused Gradio (#12995) 2025-03-24 10:51:06 +08:00
Shaojun Liu
46a4f53967
OSPDT: add tpp licenses for release 2.2.0 (#12840)
* Create LICENSE-zstd.txt

* Create LICENSE-libcxx.txt

* Create LICENSE-libcxxabi.txt

* Create LICENSE-safestring.txt

* Create LICENSE-stb-image.txt

* Create LICENSE-cluster-agent.txt

* Create LICENSE-hd-agent.txt

* Create LICENSE-platform-telemetry-agent.txt

* Create LICENSE-platform-update-agent.txt

* Create LICENSE-OpenCL-ICD-Loader.txt

* Create LICENSE-xptifw.txt

* Create LICENSE-intel-openmp.txt

* Create LICENSE-Intel®-OpenMP*-Runtime-Library.txt

* Create LICENSE-Intel®-C-C++-Fortran-Compiler-Mainline.txt

* add TPP files

* Add TPP files

* add tpp

* add tpp

* update

* update
2025-03-21 15:52:22 +08:00
Yuwen Hu
5bdf57327d
Remove ipex import in fastchat loader (#12984) 2025-03-20 18:29:00 +08:00
Yuwen Hu
6f634b41da
Update model support list regarding Gemma3 for Ollama portable zip QuickStart (#12979)
* Update model support list regarding Gemma3 for Ollama portable zip QuickStart

* Small fix

* Small fix

* Small fix
2025-03-19 11:16:45 +08:00
Qiyuan Gong
dd026db50b
Add SNC to llama.cpp portable zip quick start (#12972)
* Add SNC to quick start
2025-03-17 10:58:06 +08:00
Shaojun Liu
b0d56273a8
Fix Docker build failure due to outdated ipex-llm pip index URL (#12977) 2025-03-17 10:46:01 +08:00
Shaojun Liu
760abc47aa
Fix Docker build failure due to outdated ipex-llm pip index URL (#12976) 2025-03-17 09:50:09 +08:00
Jason Dai
03c9024209
Update README (#12973) 2025-03-14 19:04:10 +08:00
Yuwen Hu
6a7819f1ac
Update portable zip related quickstart regarding recommanded driver (#12970) 2025-03-14 16:34:24 +08:00
Wang, Jian4
c9ecb7a113
Fix qwen nan value issue on vllm (#12971)
* add to fix qwen nan value issue

* update
2025-03-14 14:43:54 +08:00
Heyang Sun
cd109bb061
Gemma QLoRA example (#12969)
* Gemma QLoRA example

* Update README.md

* Update README.md

---------

Co-authored-by: sgwhat <ge.song@intel.com>
2025-03-14 14:27:51 +08:00
Yuwen Hu
8bc41c13ab
Support PyTorch 2.6 with Arrow Lake-H AOT on Windows (#12967) 2025-03-13 15:29:47 +08:00
Wang, Jian4
c8a0462507
Add vllm api_server input output log (#12962) 2025-03-12 20:58:04 +08:00
Jason Dai
3941f322c5
Update issue templates 2025-03-11 08:54:15 +08:00
Jason Dai
d0e443e893
Update issue templates 2025-03-11 08:53:01 +08:00
Shaojun Liu
6a2d87e40f
add --entrypoint /bin/bash (#12957)
Co-authored-by: gc-fu <guancheng.fu@intel.com>
2025-03-10 10:10:27 +08:00
Jason Dai
2a8f624f4b
Update README (#12956) 2025-03-09 09:04:13 +08:00
binbin Deng
5ee09b4b28
[NPU] Small update about zip doc (#12951) 2025-03-07 15:22:14 +08:00
Shaojun Liu
015a4c8c43
Add CPU and GPU Frequency Locking Instructions to Documentation (#12947) 2025-03-07 09:20:40 +08:00
Jason Dai
cb3c4b26ad
Update llamacpp_portable_zip_gpu_quickstart.md (#12945) 2025-03-06 11:58:11 +08:00
Jason Dai
1432c5d9a0
Update llamacpp_portable_zip_gpu_quickstart (#12941) 2025-03-06 10:01:56 +08:00
Jason Dai
32480cc8ed
Update llamacpp_portable_zip_gpu_quickstart (#12940) 2025-03-06 08:42:18 +08:00
Jason Dai
975cf5f21f
Update README.md (#12939) 2025-03-06 08:04:27 +08:00
joan726
eccb5b817e
Add llamacpp_portable_zip_gpu_quickstart.zh-CN.md (#12930)
* Add llamacpp_portable_zip_gpu_quickstart.zh-CN.md

Add llamacpp_portable_zip_gpu_quickstart.zh-CN.md

* Update README.zh-CN.md

 Changed and Linked to llamacpp portable zip.zh-CN.md.

* Update llamacpp_portable_zip_gpu_quickstart.md

Added CN version link

* Update README.zh-CN.md

Update all links to "llamacpp_portable_zip_gpu_quickstart.zh-CN.md

* Update llama_cpp_quickstart.zh-CN.md

* Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md

Modify based on comments.

* Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md

Modify based on comments.

* Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md

Update the doc based on #12928

* Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md

Add “More Details” on Table of Contents

* Update README.zh-CN.md

Update llamacpp_portable_zip_gpu_quickstart CN link

* Update README.zh-CN.md

Change llama.cpp link

* Update README.zh-CN.md

* Update README.md
2025-03-05 14:55:44 +08:00
Yuwen Hu
7c0c77cce3
Tiny fixes (#12936) 2025-03-05 14:55:26 +08:00
Yuwen Hu
68a770745b
Add moonlight GPU example (#12929)
* Add moonlight GPU example and update table

* Small fix

* Fix based on comments

* Small fix
2025-03-05 11:31:14 +08:00
Xin Qiu
33da3a3cb7
Update llama cpp portable zip quickstart (#12928)
* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md
2025-03-05 09:22:10 +08:00
Jason Dai
de09590ca3
Update llamacpp_portable_zip_gpu_quickstart.md (#12932) 2025-03-05 07:59:32 +08:00
Jason Dai
69edc8b6f6
Update quickstart (#12927) 2025-03-04 15:34:52 +08:00
Qiyuan Gong
0b5079833c
llama.cpp portable Zip for Linux quickstart (#12923)
* llamacpp Linux portable doc & flashmoe
2025-03-04 14:50:21 +08:00
binbin Deng
091ab2bd59
[NPU] Add troubleshooting in portable zip doc (#12924) 2025-03-04 10:41:39 +08:00
Yuwen Hu
b2d676f1c6
Further update Ollama portable zip quickstart (#12921)
* Update Chinese doc for ollama quickstart tips and troubleshooting

* Update for recommanded Windows OS

* Small fix

* Small fix
2025-03-03 18:07:57 +08:00
Shaojun Liu
f81d89d908
Remove Unnecessary --privileged Flag While Keeping It for WSL Users (#12920) 2025-03-03 11:11:42 +08:00
Shaojun Liu
7810b8fb49
OSPDT: update dockerfile header (#12908)
* Update Dockerfile

* Update Dockerfile

* Update Dockerfile

* Update Dockerfile
2025-03-03 09:59:11 +08:00
Yishuo Wang
b6f33d5c4d
optimize moonlight again (#12909) 2025-03-03 09:21:15 +08:00
Jason Dai
35e5fa851c
Update README.md (#12911) 2025-02-28 17:55:45 +08:00
binbin Deng
8351f6c455
[NPU] Add QuickStart for llama.cpp NPU portable zip (#12899) 2025-02-28 17:19:18 +08:00
Xin Qiu
029480f4a8
llama cpp portable zip Quickstart (#12894)
* llamacpp_quickstart

* update

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md

* Update llamacpp_portable_zip_gpu_quickstart.md
2025-02-28 15:45:11 +08:00
Yuwen Hu
443cb5d4e0
Update Janus-Pro GPU example (#12906) 2025-02-28 15:39:03 +08:00
Yuwen Hu
8d94752c4b
Ollama portable zip QuickStart updates regarding more tips (#12905)
* Update for select multiple GPUs

* Update Ollama portable zip quickstarts regarding more tips

* Small fix
2025-02-28 15:10:56 +08:00
Yishuo Wang
39e360fe9d
add grouped topk optimization for moonlight (#12903) 2025-02-28 13:25:56 +08:00
Xin Qiu
e946127613
glm 4v 1st sdp for vision (#12904)
* glm4v 1st sdp

* update glm4v example

* meet code review

* fix style
2025-02-28 13:23:27 +08:00
Shaojun Liu
5c100ac105
Add ENTRYPOINT to Dockerfile to auto-start vllm service on container launch (for CVTE customer) (#12901)
* Add ENTRYPOINT to Dockerfile to auto-start service on container launch (for CVTE client)

* Update start-vllm-service.sh

* Update README.md

* Update README.md

* Update start-vllm-service.sh

* Update README.md
2025-02-27 17:33:58 +08:00
Yishuo Wang
be1f073866
add fuse moe optimization for moonlight (#12898) 2025-02-27 09:15:24 +08:00