Commit graph

761 commits

Author SHA1 Message Date
Shaojun Liu
fac4c01a6e
Revert to use out-of-tree GPU driver (#11761)
* Revert to use out-of-tree GPU driver since the performance with out-of-tree driver is better than upsteam's

* add spaces

* add troubleshooting case

* update Troubleshooting
2024-08-12 13:41:47 +08:00
Yuwen Hu
7e61fa1af7
Revise GPU driver related guide in for Windows users (#11740) 2024-08-08 11:26:26 +08:00
Jinhe
d0c89fb715
updated llama.cpp and ollama quickstart (#11732)
* updated llama.cpp and ollama quickstart.md

* added qwen2-1.5B sample output

* revision on quickstart updates

* revision on quickstart updates

* revision on qwen2 readme

* added 2 troubleshoots“
”

* troubleshoot revision
2024-08-08 11:04:01 +08:00
Qiyuan Gong
e32d13d78c
Remove Out of tree Driver from GPU driver installation document (#11728)
GPU drivers are already upstreamed to Kernel 6.2+. Remove the out-of-tree driver (intel-i915-dkms) for 6.2-6.5. https://dgpu-docs.intel.com/driver/kernel-driver-types.html#gpu-driver-support
* Remove intel-i915-dkms intel-fw-gpu (only for kernel 5.19)
2024-08-07 09:38:19 +08:00
Jason Dai
418640e466
Update install_gpu.md 2024-07-27 08:30:10 +08:00
Ruonan Wang
ac97b31664
update cpp quickstart about ONEAPI_DEVICE_SELECTOR (#11630)
* update

* update

* small fix
2024-07-22 13:40:28 +08:00
Yuwen Hu
af6d406178
Add section title for conduct graphrag indexing (#11628) 2024-07-22 10:23:26 +08:00
Ruonan Wang
4da93709b1
update doc/setup to use onednn gemm for cpp (#11598)
* update doc/setup to use onednn gemm

* small fix

* Change TOC of graphrag quickstart back
2024-07-18 13:04:38 +08:00
Yuwen Hu
f06d2f72fb
Add GraphRAG QuickStart (#11582)
* Add framework for graphrag quickstart

* Add quickstart contents for graphrag

* Small fixes and add toc

* Update for graph

* Small fixes
2024-07-16 09:27:54 +08:00
Xin Qiu
91409ffe8c
Add mtl AOT packages in faq.md (#11577)
* Update faq.md

* Update faq.md

* Update faq.md

* Update faq.md

* Update faq.md
2024-07-16 08:46:03 +08:00
binbin Deng
66f6ffe4b2
Update GPU HF-Transformers example structure (#11526) 2024-07-08 17:58:06 +08:00
Shaojun Liu
72b4efaad4
Enhanced XPU Dockerfiles: Optimized Environment Variables and Documentation (#11506)
* Added SYCL_CACHE_PERSISTENT=1 to xpu Dockerfile

* Update the document to add explanations for environment variables.

* update quickstart
2024-07-04 20:18:38 +08:00
Yuwen Hu
1638573f56
Update llama cpp quickstart regarding windows prerequisites to avoid misleading (#11490) 2024-07-02 16:15:47 +08:00
SichengStevenLi
86b81c09d9
Table of Contents in Quickstart Files (#11437)
* fixed a minor grammar mistake

* added table of contents

* added table of contents

* changed table of contents indexing

* added table of contents

* added table of contents, changed grammar

* added table of contents

* added table of contents

* added table of contents

* added table of contents

* added table of contents

* added table of contents, modified chapter numbering

* fixed troubleshooting section redirection path

* added table of contents

* added table of contents, modified section numbering

* added table of contents, modified section numbering

* added table of contents

* added table of contents, changed title size, modified numbering

* added table of contents, changed section title size and capitalization

* added table of contents, modified section numbering

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents syntax

* changed table of contents capitalization issue

* changed table of contents capitalization issue

* changed table of contents location

* changed table of contents

* changed table of contents

* changed section capitalization

* removed comments

* removed comments

* removed comments
2024-06-28 10:41:00 +08:00
Yuwen Hu
a45ceac4e4
Update main readme for missing quickstarts (#11427)
* Update main readme to add missing quckstart

* Update quickstart index page

* Small fixes

* Small fix
2024-06-26 13:51:42 +08:00
Yuwen Hu
ecb9efde65
Workaround if demo preview image load slow in mddocs (#11412)
* Small tests for demo video workaround

* Small fix

* Add workaround for langchain-chatchat demo video

* Small fix

* Small fix

* Update for other demo videos in quickstart

* Add missing for text-generation-webui quickstart
2024-06-24 16:17:50 +08:00
Yuwen Hu
ccb3fb357a
Add mddocs index (#11411) 2024-06-24 15:35:18 +08:00
Shengsheng Huang
475b0213d2
README update (API doc and FAQ and minor fixes) (#11397)
* add faq and API doc link in README.md

* add missing quickstart link

* update links in FAQ

* update links in FAQ

* update faq

* update faq text
2024-06-21 19:46:32 +08:00
Yuwen Hu
2004fe1a43
Small fix (#11395) 2024-06-21 17:45:10 +08:00
Yuwen Hu
4cb9a4728e
Add index page for API doc & links update in mddocs (#11393)
* Small fixes

* Add initial api doc index

* Change index.md -> README.md

* Fix on API links
2024-06-21 17:34:34 +08:00
Xu, Shuo
b200e11e21
Add initial python api doc in mddoc (2/2) (#11388)
* add PyTorch-API.md

* small change

* small change

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-06-21 17:15:05 +08:00
Yuwen Hu
aafd6d55cd
Add initial python api doc in mddoc (1/2) (#11389)
* Add initial python api mddoc

* Fix based on comments
2024-06-21 17:14:42 +08:00
Yuwen Hu
a027121530
Small mddoc fixed based on review (#11391)
* Fix based on review

* Further fix

* Small fix

* Small fix
2024-06-21 17:09:30 +08:00
Yuwen Hu
54f9d07d8f
Further mddocs fixes (#11386)
* Update mddocs for ragflow quickstart

* Fixes for docker guides mddocs

* Further fixes
2024-06-21 13:27:43 +08:00
ivy-lv11
21fc781fce
Add GLM-4V example (#11343)
* add example

* modify

* modify

* add line

* add

* add link and replace with phi-3-vision template

* fix generate options

* fix

* fix

---------

Co-authored-by: jinbridge <2635480475@qq.com>
2024-06-21 12:54:31 +08:00
Yuwen Hu
9b475c07db
Add missing ragflow quickstart in mddocs and update legecy contents (#11385) 2024-06-21 12:28:26 +08:00
Xu, Shuo
fed79f106b
Update mddocs for DockerGuides (#11380)
* transfer files in DockerGuides from rst to md

* add some dividing lines

* adjust the title hierarchy in docker_cpp_xpu_quickstart.md

* restore

* switch to the correct branch

* small change

---------

Co-authored-by: ATMxsp01 <shou.xu@intel.com>
2024-06-21 12:10:35 +08:00
SichengStevenLi
1a1a97c9e4
Update mddocs for part of Overview (2/2) and Inference (#11377)
* updated link

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed

* converted to md format, need to be reviewed, deleted some leftover texts

* converted to md file type, need to be reviewed

* converted to md file type, need to be reviewed

* testing Github Tags

* testing Github Tags

* added Github Tags

* added Github Tags

* added Github Tags

* Small fix

* Small fix

* Small fix

* Small fix

* Small fix

* Further fix

* Fix index

* Small fix

* Fix

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-06-21 12:07:50 +08:00
Zijie Li
33b9a9c4c9
Update part of Overview guide in mddocs (1/2) (#11378)
* Create install.md

* Update install_cpu.md

* Delete original docs/mddocs/Overview/install_cpu.md

* Update install_cpu.md

* Update install_gpu.md

* update llm.md and install.md

* Update docs in KeyFeatures

* Review and fix typos

* Fix on folded NOTE

* Small fix

* Small fix

* Remove empty known_issue.md

* Small fix

* Small fix

* Further fix

* Fixes

* Fix

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-06-21 10:45:17 +08:00
Jin Qiao
9a3a21e4fc
Update part of Quickstart guide in mddocs (2/2) (#11376)
* axolotl_quickstart.md

* benchmark_quickstart.md

* bigdl_llm_migration.md

* chatchat_quickstart.md

* continue_quickstart.md

* deepspeed_autotp_fastapi_quickstart.md

* dify_quickstart.md

* fastchat_quickstart.md

* adjust tab style

* fix link

* fix link

* add video preview

* Small fixes

* Small fix

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-06-20 19:03:06 +08:00
Yuwen Hu
8c9f877171
Update part of Quickstart guide in mddocs (1/2)
* Quickstart index.rst -> index.md

* Update for Linux Install Quickstart

* Update md docs for Windows Install QuickStart

* Small fix

* Add blank lines

* Update mddocs for llama cpp quickstart

* Update mddocs for llama3 llama-cpp and ollama quickstart

* Update mddocs for ollama quickstart

* Update mddocs for openwebui quickstart

* Update mddocs for privateGPT quickstart

* Update mddocs for vllm quickstart

* Small fix

* Update mddocs for text-generation-webui quickstart

* Update for video links
2024-06-20 18:43:23 +08:00
Yuwen Hu
d9dd1b70bd
Remove example page in mddocs (#11373) 2024-06-20 14:23:43 +08:00
Yuwen Hu
769728c1eb
Add initial md docs (#11371) 2024-06-20 13:47:49 +08:00
Shengsheng Huang
9601fae5d5
fix system note (#11368) 2024-06-20 11:09:53 +08:00
Shengsheng Huang
ed4c439497
small fix (#11366) 2024-06-20 10:38:20 +08:00
Shengsheng Huang
a721c1ae43
minor fix of ragflow_quickstart.md (#11364) 2024-06-19 22:30:33 +08:00
Shengsheng Huang
13727635e8
revise ragflow quickstart (#11363)
* revise ragflow quickstart

* update titles and split the quickstart into sections

* update
2024-06-19 22:24:31 +08:00
Zijie Li
5283df0078
LLM: Add RAGFlow with Ollama Example QuickStart (#11338)
* Create ragflow.md

* Update ragflow.md

* Update ragflow_quickstart

* Update ragflow_quickstart.md

* Upload RAGFlow quickstart without images

* Update ragflow_quickstart.md

* Update ragflow_quickstart.md

* Update ragflow_quickstart.md

* Update ragflow_quickstart.md

* fix typos in readme

* Fix typos in quickstart readme
2024-06-19 20:00:50 +08:00
Jason Dai
271d82a4fc
Update readme (#11357) 2024-06-19 10:05:42 +08:00
Xiangyu Tian
f6cd628cd8
Fix script usage in vLLM CPU Quickstart (#11353) 2024-06-18 16:50:48 +08:00
Guancheng Fu
c9b4cadd81
fix vLLM/docker issues (#11348)
* fix

* fix

* ffix
2024-06-18 16:23:53 +08:00
hxsz1997
44f22cba70
add config and default value (#11344)
* add config and default value

* add config in taml

* remove lookahead and max_matching_ngram_size in config

* remove streaming and use_fp16_torch_dtype in test yaml

* update task in readme

* update commit of task
2024-06-18 15:28:57 +08:00
Shengsheng Huang
1f39bb84c7
update readthedocs perf data (#11345) 2024-06-18 13:23:47 +08:00
Qiyuan Gong
de4bb97b4f
Remove accelerate 0.23.0 install command in readme and docker (#11333)
*ipex-llm's accelerate has been upgraded to 0.23.0. Remove accelerate 0.23.0 install command in README and docker。
2024-06-17 17:52:12 +08:00
Yuwen Hu
9e4d87a696
Langchain-chatchat QuickStart small link fix (#11317) 2024-06-14 14:02:17 +08:00
Yuwen Hu
bfab294f08
Update langchain-chatchat QuickStart to include Core Ultra iGPU Linux Guide (#11302) 2024-06-13 15:09:55 +08:00
Shengsheng Huang
ea372cc472
update demos section (#11298)
* update demos section

* update format
2024-06-13 11:58:19 +08:00
Jin Qiao
f224e98297
Add GLM-4 CPU example (#11223)
* Add GLM-4 example

* add tiktoken dependency

* fix

* fix
2024-06-12 15:30:51 +08:00
Yuwen Hu
8c36b5bdde
Add qwen2 example (#11252)
* Add GPU example for Qwen2

* Update comments in README

* Update README for Qwen2 GPU example

* Add CPU example for Qwen2

Sample Output under README pending

* Update generate.py and README for CPU Qwen2

* Update GPU example for Qwen2

* Small update

* Small fix

* Add Qwen2 table

* Update README for Qwen2 CPU and GPU

Update sample output under README

---------

Co-authored-by: Zijie Li <michael20001122@gmail.com>
2024-06-07 10:29:33 +08:00
Zijie Li
bfa1367149
Add CPU and GPU example for MiniCPM (#11202)
* Change installation address

Change former address: "https://docs.conda.io/en/latest/miniconda.html#" to new address: "https://conda-forge.org/download/" for 63 occurrences under python\llm\example

* Change Prompt

Change "Anaconda Prompt" to "Miniforge Prompt" for 1 occurrence

* Create and update model minicpm

* Update model minicpm

Update model minicpm under GPU/PyTorch-Models

* Update readme and generate.py

change "prompt = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=False)" and delete "pip install transformers==4.37.0
"

* Update comments for minicpm GPU

Update comments for generate.py at minicpm GPU

* Add CPU example for MiniCPM

* Update minicpm README for CPU

* Update README for MiniCPM and Llama3

* Update Readme for Llama3 CPU Pytorch

* Update and fix comments for MiniCPM
2024-06-05 18:09:53 +08:00