ipex-llm

Author	SHA1	Message	Date
Shaojun Liu	7a86dd0569	Remove unused Gradio (#12995 )	2025-03-24 10:51:06 +08:00
Shaojun Liu	46a4f53967	OSPDT: add tpp licenses for release 2.2.0 (#12840 ) * Create LICENSE-zstd.txt * Create LICENSE-libcxx.txt * Create LICENSE-libcxxabi.txt * Create LICENSE-safestring.txt * Create LICENSE-stb-image.txt * Create LICENSE-cluster-agent.txt * Create LICENSE-hd-agent.txt * Create LICENSE-platform-telemetry-agent.txt * Create LICENSE-platform-update-agent.txt * Create LICENSE-OpenCL-ICD-Loader.txt * Create LICENSE-xptifw.txt * Create LICENSE-intel-openmp.txt * Create LICENSE-Intel®-OpenMP-Runtime-Library.txt Create LICENSE-Intel®-C-C++-Fortran-Compiler-Mainline.txt * add TPP files * Add TPP files * add tpp * add tpp * update * update	2025-03-21 15:52:22 +08:00
Yuwen Hu	5bdf57327d	Remove ipex import in fastchat loader (#12984 )	2025-03-20 18:29:00 +08:00
Yuwen Hu	6f634b41da	Update model support list regarding Gemma3 for Ollama portable zip QuickStart (#12979 ) * Update model support list regarding Gemma3 for Ollama portable zip QuickStart * Small fix * Small fix * Small fix	2025-03-19 11:16:45 +08:00
Qiyuan Gong	dd026db50b	Add SNC to llama.cpp portable zip quick start (#12972 ) * Add SNC to quick start	2025-03-17 10:58:06 +08:00
Shaojun Liu	b0d56273a8	Fix Docker build failure due to outdated ipex-llm pip index URL (#12977 )	2025-03-17 10:46:01 +08:00
Shaojun Liu	760abc47aa	Fix Docker build failure due to outdated ipex-llm pip index URL (#12976 )	2025-03-17 09:50:09 +08:00
Jason Dai	03c9024209	Update README (#12973 )	2025-03-14 19:04:10 +08:00
Yuwen Hu	6a7819f1ac	Update portable zip related quickstart regarding recommanded driver (#12970 )	2025-03-14 16:34:24 +08:00
Wang, Jian4	c9ecb7a113	Fix qwen nan value issue on vllm (#12971 ) * add to fix qwen nan value issue * update	2025-03-14 14:43:54 +08:00
Heyang Sun	cd109bb061	Gemma QLoRA example (#12969 ) * Gemma QLoRA example * Update README.md * Update README.md --------- Co-authored-by: sgwhat <ge.song@intel.com>	2025-03-14 14:27:51 +08:00
Yuwen Hu	8bc41c13ab	Support PyTorch 2.6 with Arrow Lake-H AOT on Windows (#12967 )	2025-03-13 15:29:47 +08:00
Wang, Jian4	c8a0462507	Add vllm api_server input output log (#12962 )	2025-03-12 20:58:04 +08:00
Jason Dai	3941f322c5	Update issue templates	2025-03-11 08:54:15 +08:00
Jason Dai	d0e443e893	Update issue templates	2025-03-11 08:53:01 +08:00
Shaojun Liu	6a2d87e40f	add `--entrypoint /bin/bash` (#12957 ) Co-authored-by: gc-fu <guancheng.fu@intel.com>	2025-03-10 10:10:27 +08:00
Jason Dai	2a8f624f4b	Update README (#12956 )	2025-03-09 09:04:13 +08:00
binbin Deng	5ee09b4b28	[NPU] Small update about zip doc (#12951 )	2025-03-07 15:22:14 +08:00
Shaojun Liu	015a4c8c43	Add CPU and GPU Frequency Locking Instructions to Documentation (#12947 )	2025-03-07 09:20:40 +08:00
Jason Dai	cb3c4b26ad	Update llamacpp_portable_zip_gpu_quickstart.md (#12945 )	2025-03-06 11:58:11 +08:00
Jason Dai	1432c5d9a0	Update llamacpp_portable_zip_gpu_quickstart (#12941 )	2025-03-06 10:01:56 +08:00
Jason Dai	32480cc8ed	Update llamacpp_portable_zip_gpu_quickstart (#12940 )	2025-03-06 08:42:18 +08:00
Jason Dai	975cf5f21f	Update README.md (#12939 )	2025-03-06 08:04:27 +08:00
joan726	eccb5b817e	Add llamacpp_portable_zip_gpu_quickstart.zh-CN.md (#12930 ) * Add llamacpp_portable_zip_gpu_quickstart.zh-CN.md Add llamacpp_portable_zip_gpu_quickstart.zh-CN.md * Update README.zh-CN.md Changed and Linked to llamacpp portable zip.zh-CN.md. * Update llamacpp_portable_zip_gpu_quickstart.md Added CN version link * Update README.zh-CN.md Update all links to "llamacpp_portable_zip_gpu_quickstart.zh-CN.md * Update llama_cpp_quickstart.zh-CN.md * Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md Modify based on comments. * Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md Modify based on comments. * Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md Update the doc based on #12928 * Update llamacpp_portable_zip_gpu_quickstart.zh-CN.md Add “More Details” on Table of Contents * Update README.zh-CN.md Update llamacpp_portable_zip_gpu_quickstart CN link * Update README.zh-CN.md Change llama.cpp link * Update README.zh-CN.md * Update README.md	2025-03-05 14:55:44 +08:00
Yuwen Hu	7c0c77cce3	Tiny fixes (#12936 )	2025-03-05 14:55:26 +08:00
Yuwen Hu	68a770745b	Add moonlight GPU example (#12929 ) * Add moonlight GPU example and update table * Small fix * Fix based on comments * Small fix	2025-03-05 11:31:14 +08:00
Xin Qiu	33da3a3cb7	Update llama cpp portable zip quickstart (#12928 ) * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md	2025-03-05 09:22:10 +08:00
Jason Dai	de09590ca3	Update llamacpp_portable_zip_gpu_quickstart.md (#12932 )	2025-03-05 07:59:32 +08:00
Jason Dai	69edc8b6f6	Update quickstart (#12927 )	2025-03-04 15:34:52 +08:00
Qiyuan Gong	0b5079833c	llama.cpp portable Zip for Linux quickstart (#12923 ) * llamacpp Linux portable doc & flashmoe	2025-03-04 14:50:21 +08:00
binbin Deng	091ab2bd59	[NPU] Add troubleshooting in portable zip doc (#12924 )	2025-03-04 10:41:39 +08:00
Yuwen Hu	b2d676f1c6	Further update Ollama portable zip quickstart (#12921 ) * Update Chinese doc for ollama quickstart tips and troubleshooting * Update for recommanded Windows OS * Small fix * Small fix	2025-03-03 18:07:57 +08:00
Shaojun Liu	f81d89d908	Remove Unnecessary --privileged Flag While Keeping It for WSL Users (#12920 )	2025-03-03 11:11:42 +08:00
Shaojun Liu	7810b8fb49	OSPDT: update dockerfile header (#12908 ) * Update Dockerfile * Update Dockerfile * Update Dockerfile * Update Dockerfile	2025-03-03 09:59:11 +08:00
Yishuo Wang	b6f33d5c4d	optimize moonlight again (#12909 )	2025-03-03 09:21:15 +08:00
Jason Dai	35e5fa851c	Update README.md (#12911 )	2025-02-28 17:55:45 +08:00
binbin Deng	8351f6c455	[NPU] Add QuickStart for llama.cpp NPU portable zip (#12899 )	2025-02-28 17:19:18 +08:00
Xin Qiu	029480f4a8	llama cpp portable zip Quickstart (#12894 ) * llamacpp_quickstart * update * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md * Update llamacpp_portable_zip_gpu_quickstart.md	2025-02-28 15:45:11 +08:00
Yuwen Hu	443cb5d4e0	Update Janus-Pro GPU example (#12906 )	2025-02-28 15:39:03 +08:00
Yuwen Hu	8d94752c4b	Ollama portable zip QuickStart updates regarding more tips (#12905 ) * Update for select multiple GPUs * Update Ollama portable zip quickstarts regarding more tips * Small fix	2025-02-28 15:10:56 +08:00
Yishuo Wang	39e360fe9d	add grouped topk optimization for moonlight (#12903 )	2025-02-28 13:25:56 +08:00
Xin Qiu	e946127613	glm 4v 1st sdp for vision (#12904 ) * glm4v 1st sdp * update glm4v example * meet code review * fix style	2025-02-28 13:23:27 +08:00
Shaojun Liu	5c100ac105	Add ENTRYPOINT to Dockerfile to auto-start vllm service on container launch (for CVTE customer) (#12901 ) * Add ENTRYPOINT to Dockerfile to auto-start service on container launch (for CVTE client) * Update start-vllm-service.sh * Update README.md * Update README.md * Update start-vllm-service.sh * Update README.md	2025-02-27 17:33:58 +08:00
Yishuo Wang	be1f073866	add fuse moe optimization for moonlight (#12898 )	2025-02-27 09:15:24 +08:00
Jason Dai	ad65e2b03a	Update README.md (#12900 )	2025-02-27 08:30:06 +08:00
Yishuo Wang	5faba06409	simple optimization for moonlight moe decoding forward (#12891 )	2025-02-25 16:18:27 +08:00
Xiangyu Tian	ae9f5320da	vLLM CPU: Fix Triton Version to Resolve Related Error(#12893 )	2025-02-25 15:00:41 +08:00
Yishuo Wang	ab3fc66eb7	optimize attention part of moonlight-14B-A3B (#12886 )	2025-02-25 09:38:13 +08:00
Shaojun Liu	dd30d12cb6	Fix serving-cpu image: setuptools-scm requires setuptools>=61 (#12876 ) * setuptools-scm requires setuptools>=61 * Update Dockerfile * Update Dockerfile * Update Dockerfile	2025-02-25 09:10:14 +08:00
Yuwen Hu	06694ba61a	Further fix portable zip file link (#12885 )	2025-02-24 18:06:57 +08:00

1 2 3 4 5 ...

4019 commits