Jun Wang
6ffaec66a2
[UPDATE] add prefix caching document into vllm_docker_quickstart.md ( #12173 )
...
* [ADD] rewrite new vllm docker quick start
* [ADD] lora adapter doc finished
* [ADD] mulit lora adapter test successfully
* [ADD] add ipex-llm quantization doc
* [Merge] rebase main
* [REMOVE] rm tmp file
* [Merge] rebase main
* [ADD] add prefix caching experiment and result
* [REMOVE] rm cpu offloading chapter
2024-10-11 19:12:22 +08:00
Yuwen Hu
ddcdf47539
Support Windows ARL release ( #12183 )
...
* Support release for ARL
* Small fix
* Small fix to doc
* Temp for test
* Remove temp commit for test
2024-10-11 18:30:52 +08:00
Yuwen Hu
ac44e98b7d
Update Windows guide regarding LNL support ( #12178 )
...
* Update windows guide regarding LNL support
* Update based on comments
2024-10-11 09:20:08 +08:00
Guancheng Fu
0ef7e1d101
fix vllm docs ( #12176 )
2024-10-10 15:44:36 +08:00
Jun Wang
412cf8e20c
[UPDATE] update mddocs/DockerGuides/vllm_docker_quickstart.md ( #12166 )
...
* [ADD] rewrite new vllm docker quick start
* [ADD] lora adapter doc finished
* [ADD] mulit lora adapter test successfully
* [ADD] add ipex-llm quantization doc
* [UPDATE] update mmdocs vllm_docker_quickstart content
* [REMOVE] rm tmp file
* [UPDATE] tp and pp explaination and readthedoc link change
* [FIX] fix the error description of tp+pp and quantization part
* [FIX] fix the table of verifed model
* [UPDATE] add full low bit para list
* [UPDATE] update the load_in_low_bit params to verifed dtype
2024-10-09 11:19:32 +08:00
Ch1y0q
9b75806d14
Update Windows GPU quickstart regarding demo ( #12124 )
...
* use Qwen2-1.5B-Instruct in demo
* update
* add reference link
* update
* update
2024-09-29 18:08:49 +08:00
Ruonan Wang
a767438546
fix typo ( #12076 )
...
* fix typo
* fix
2024-09-13 11:44:42 +08:00
Ruonan Wang
3f0b24ae2b
update cpp quickstart ( #12075 )
...
* update cpp quickstart
* fix style
2024-09-13 11:35:32 +08:00
Ruonan Wang
48d9092b5a
upgrade OneAPI version for cpp Windows ( #12063 )
...
* update version
* update quickstart
2024-09-12 11:12:12 +08:00
Shaojun Liu
e5581e6ded
Select the Appropriate APT Repository Based on CPU Type ( #12023 )
2024-09-05 17:06:07 +08:00
Yuwen Hu
643458d8f0
Update GraphRAG QuickStart ( #11995 )
...
* Update GraphRAG QuickStart
* Further updates
* Small fixes
* Small fix
2024-09-03 15:52:08 +08:00
Jinhe
e895e1b4c5
modification on llamacpp readme after Ipex-llm latest update ( #11971 )
...
* update on readme after ipex-llm update
* update on readme after ipex-llm update
* rebase & delete redundancy
* revise
* add numbers for troubleshooting
2024-08-30 11:36:45 +08:00
Ch1y0q
77b04efcc5
add notes for SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS ( #11936 )
...
* add notes for `SYCL_PI_LEVEL_ZERO_USE_IMMEDIATE_COMMANDLISTS`
* also update other quickstart
2024-08-30 09:26:47 +08:00
Jinhe
6fc9340d53
restore ollama webui quickstart ( #11955 )
2024-08-29 17:53:19 +08:00
Jinhe
ec67ee7177
added accelerate version specification in open webui quickstart( #11948 )
2024-08-28 15:02:39 +08:00
Ruonan Wang
460bc96d32
update version of llama.cpp / ollama ( #11930 )
...
* update version
* fix version
2024-08-27 21:21:44 +08:00
Ch1y0q
5a8fc1baa2
update troubleshooting for llama.cpp and ollama ( #11890 )
...
* update troubleshooting for llama.cpp and ollama
* update
* update
2024-08-26 20:55:23 +08:00
Jinhe
dbd14251dd
Troubleshoot for sycl not found ( #11774 )
...
* added troubleshoot for sycl not found problem
* added troubleshoot for sycl not found problem
* revision on troubleshoot
* revision on troubleshoot
2024-08-14 10:26:01 +08:00
Shaojun Liu
fac4c01a6e
Revert to use out-of-tree GPU driver ( #11761 )
...
* Revert to use out-of-tree GPU driver since the performance with out-of-tree driver is better than upsteam's
* add spaces
* add troubleshooting case
* update Troubleshooting
2024-08-12 13:41:47 +08:00
Yuwen Hu
7e61fa1af7
Revise GPU driver related guide in for Windows users ( #11740 )
2024-08-08 11:26:26 +08:00
Jinhe
d0c89fb715
updated llama.cpp and ollama quickstart ( #11732 )
...
* updated llama.cpp and ollama quickstart.md
* added qwen2-1.5B sample output
* revision on quickstart updates
* revision on quickstart updates
* revision on qwen2 readme
* added 2 troubleshoots“
”
* troubleshoot revision
2024-08-08 11:04:01 +08:00
Qiyuan Gong
e32d13d78c
Remove Out of tree Driver from GPU driver installation document ( #11728 )
...
GPU drivers are already upstreamed to Kernel 6.2+. Remove the out-of-tree driver (intel-i915-dkms) for 6.2-6.5. https://dgpu-docs.intel.com/driver/kernel-driver-types.html#gpu-driver-support
* Remove intel-i915-dkms intel-fw-gpu (only for kernel 5.19)
2024-08-07 09:38:19 +08:00
Jason Dai
418640e466
Update install_gpu.md
2024-07-27 08:30:10 +08:00
Ruonan Wang
ac97b31664
update cpp quickstart about ONEAPI_DEVICE_SELECTOR ( #11630 )
...
* update
* update
* small fix
2024-07-22 13:40:28 +08:00
Yuwen Hu
af6d406178
Add section title for conduct graphrag indexing ( #11628 )
2024-07-22 10:23:26 +08:00
Ruonan Wang
4da93709b1
update doc/setup to use onednn gemm for cpp ( #11598 )
...
* update doc/setup to use onednn gemm
* small fix
* Change TOC of graphrag quickstart back
2024-07-18 13:04:38 +08:00
Yuwen Hu
f06d2f72fb
Add GraphRAG QuickStart ( #11582 )
...
* Add framework for graphrag quickstart
* Add quickstart contents for graphrag
* Small fixes and add toc
* Update for graph
* Small fixes
2024-07-16 09:27:54 +08:00
Xin Qiu
91409ffe8c
Add mtl AOT packages in faq.md ( #11577 )
...
* Update faq.md
* Update faq.md
* Update faq.md
* Update faq.md
* Update faq.md
2024-07-16 08:46:03 +08:00
binbin Deng
66f6ffe4b2
Update GPU HF-Transformers example structure ( #11526 )
2024-07-08 17:58:06 +08:00
Shaojun Liu
72b4efaad4
Enhanced XPU Dockerfiles: Optimized Environment Variables and Documentation ( #11506 )
...
* Added SYCL_CACHE_PERSISTENT=1 to xpu Dockerfile
* Update the document to add explanations for environment variables.
* update quickstart
2024-07-04 20:18:38 +08:00
Yuwen Hu
1638573f56
Update llama cpp quickstart regarding windows prerequisites to avoid misleading ( #11490 )
2024-07-02 16:15:47 +08:00
SichengStevenLi
86b81c09d9
Table of Contents in Quickstart Files ( #11437 )
...
* fixed a minor grammar mistake
* added table of contents
* added table of contents
* changed table of contents indexing
* added table of contents
* added table of contents, changed grammar
* added table of contents
* added table of contents
* added table of contents
* added table of contents
* added table of contents
* added table of contents, modified chapter numbering
* fixed troubleshooting section redirection path
* added table of contents
* added table of contents, modified section numbering
* added table of contents, modified section numbering
* added table of contents
* added table of contents, changed title size, modified numbering
* added table of contents, changed section title size and capitalization
* added table of contents, modified section numbering
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents syntax
* changed table of contents capitalization issue
* changed table of contents capitalization issue
* changed table of contents location
* changed table of contents
* changed table of contents
* changed section capitalization
* removed comments
* removed comments
* removed comments
2024-06-28 10:41:00 +08:00
Yuwen Hu
a45ceac4e4
Update main readme for missing quickstarts ( #11427 )
...
* Update main readme to add missing quckstart
* Update quickstart index page
* Small fixes
* Small fix
2024-06-26 13:51:42 +08:00
Yuwen Hu
ecb9efde65
Workaround if demo preview image load slow in mddocs ( #11412 )
...
* Small tests for demo video workaround
* Small fix
* Add workaround for langchain-chatchat demo video
* Small fix
* Small fix
* Update for other demo videos in quickstart
* Add missing for text-generation-webui quickstart
2024-06-24 16:17:50 +08:00
Yuwen Hu
ccb3fb357a
Add mddocs index ( #11411 )
2024-06-24 15:35:18 +08:00
Shengsheng Huang
475b0213d2
README update (API doc and FAQ and minor fixes) ( #11397 )
...
* add faq and API doc link in README.md
* add missing quickstart link
* update links in FAQ
* update links in FAQ
* update faq
* update faq text
2024-06-21 19:46:32 +08:00
Yuwen Hu
2004fe1a43
Small fix ( #11395 )
2024-06-21 17:45:10 +08:00
Yuwen Hu
4cb9a4728e
Add index page for API doc & links update in mddocs ( #11393 )
...
* Small fixes
* Add initial api doc index
* Change index.md -> README.md
* Fix on API links
2024-06-21 17:34:34 +08:00
Xu, Shuo
b200e11e21
Add initial python api doc in mddoc (2/2) ( #11388 )
...
* add PyTorch-API.md
* small change
* small change
---------
Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-06-21 17:15:05 +08:00
Yuwen Hu
aafd6d55cd
Add initial python api doc in mddoc (1/2) ( #11389 )
...
* Add initial python api mddoc
* Fix based on comments
2024-06-21 17:14:42 +08:00
Yuwen Hu
a027121530
Small mddoc fixed based on review ( #11391 )
...
* Fix based on review
* Further fix
* Small fix
* Small fix
2024-06-21 17:09:30 +08:00
Yuwen Hu
54f9d07d8f
Further mddocs fixes ( #11386 )
...
* Update mddocs for ragflow quickstart
* Fixes for docker guides mddocs
* Further fixes
2024-06-21 13:27:43 +08:00
Yuwen Hu
9b475c07db
Add missing ragflow quickstart in mddocs and update legecy contents ( #11385 )
2024-06-21 12:28:26 +08:00
Xu, Shuo
fed79f106b
Update mddocs for DockerGuides ( #11380 )
...
* transfer files in DockerGuides from rst to md
* add some dividing lines
* adjust the title hierarchy in docker_cpp_xpu_quickstart.md
* restore
* switch to the correct branch
* small change
---------
Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-06-21 12:10:35 +08:00
SichengStevenLi
1a1a97c9e4
Update mddocs for part of Overview (2/2) and Inference ( #11377 )
...
* updated link
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed
* converted to md format, need to be reviewed, deleted some leftover texts
* converted to md file type, need to be reviewed
* converted to md file type, need to be reviewed
* testing Github Tags
* testing Github Tags
* added Github Tags
* added Github Tags
* added Github Tags
* Small fix
* Small fix
* Small fix
* Small fix
* Small fix
* Further fix
* Fix index
* Small fix
* Fix
---------
Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-06-21 12:07:50 +08:00
Zijie Li
33b9a9c4c9
Update part of Overview guide in mddocs (1/2) ( #11378 )
...
* Create install.md
* Update install_cpu.md
* Delete original docs/mddocs/Overview/install_cpu.md
* Update install_cpu.md
* Update install_gpu.md
* update llm.md and install.md
* Update docs in KeyFeatures
* Review and fix typos
* Fix on folded NOTE
* Small fix
* Small fix
* Remove empty known_issue.md
* Small fix
* Small fix
* Further fix
* Fixes
* Fix
---------
Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-06-21 10:45:17 +08:00
Jin Qiao
9a3a21e4fc
Update part of Quickstart guide in mddocs (2/2) ( #11376 )
...
* axolotl_quickstart.md
* benchmark_quickstart.md
* bigdl_llm_migration.md
* chatchat_quickstart.md
* continue_quickstart.md
* deepspeed_autotp_fastapi_quickstart.md
* dify_quickstart.md
* fastchat_quickstart.md
* adjust tab style
* fix link
* fix link
* add video preview
* Small fixes
* Small fix
---------
Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-06-20 19:03:06 +08:00
Yuwen Hu
8c9f877171
Update part of Quickstart guide in mddocs (1/2)
...
* Quickstart index.rst -> index.md
* Update for Linux Install Quickstart
* Update md docs for Windows Install QuickStart
* Small fix
* Add blank lines
* Update mddocs for llama cpp quickstart
* Update mddocs for llama3 llama-cpp and ollama quickstart
* Update mddocs for ollama quickstart
* Update mddocs for openwebui quickstart
* Update mddocs for privateGPT quickstart
* Update mddocs for vllm quickstart
* Small fix
* Update mddocs for text-generation-webui quickstart
* Update for video links
2024-06-20 18:43:23 +08:00
Yuwen Hu
d9dd1b70bd
Remove example page in mddocs ( #11373 )
2024-06-20 14:23:43 +08:00
Yuwen Hu
769728c1eb
Add initial md docs ( #11371 )
2024-06-20 13:47:49 +08:00