Commit graph

790 commits

Author SHA1 Message Date
Wang, Jian4
a2e1578fd9
Merge tgi_api_server to main (#11036)
* init

* fix style

* speculative can not use benchmark

* add tgi server readme
2024-05-20 09:15:03 +08:00
Yuwen Hu
f60565adc7
Fix toc for vllm serving quickstart (#11068) 2024-05-17 17:12:48 +08:00
Guancheng Fu
dfac168d5f
fix format/typo (#11067) 2024-05-17 16:52:17 +08:00
Guancheng Fu
67db925112
Add vllm quickstart (#10978)
* temp

* add doc

* finish

* done

* fix

* add initial docker readme

* temp

* done fixing vllm_quickstart

* done

* remove not used file

* add

* fix
2024-05-17 16:16:42 +08:00
ZehuaCao
56cb992497
LLM: Modify CPU Installation Command for most examples (#11049)
* init

* refine

* refine

* refine

* modify hf-agent example

* modify all CPU model example

* remove readthedoc modify

* replace powershell with cmd

* fix repo

* fix repo

* update

* remove comment on windows code block

* update

* update

* update

* update

---------

Co-authored-by: xiangyuT <xiangyu.tian@intel.com>
2024-05-17 15:52:20 +08:00
Shaojun Liu
84239d0bd3
Update docker image tags in Docker Quickstart (#11061)
* update docker image tag to latest

* add note

* simplify note

* add link in reStructuredText

* minor fix

* update tag
2024-05-17 11:06:11 +08:00
Xiangyu Tian
d963e95363
LLM: Modify CPU Installation Command for documentation (#11042)
* init

* refine

* refine

* refine

* refine comments
2024-05-17 10:14:00 +08:00
Wang, Jian4
00d4410746
Update cpp docker quickstart (#11040)
* add sample output

* update link

* update

* update header

* update
2024-05-16 14:55:13 +08:00
Ruonan Wang
1d73fc8106
update cpp quickstart (#11031) 2024-05-15 14:33:36 +08:00
Wang, Jian4
86cec80b51
LLM: Add llm inference_cpp_xpu_docker (#10933)
* test_cpp_docker

* update

* update

* update

* update

* add sudo

* update nodejs version

* no need npm

* remove blinker

* new cpp docker

* restore

* add line

* add manually_build

* update and add mtl

* update for workdir llm

* add benchmark part

* update readme

* update 1024-128

* update readme

* update

* fix

* update

* update

* update readme too

* update readme

* no change

* update dir_name

* update readme
2024-05-15 11:10:22 +08:00
Yuwen Hu
c34f85e7d0
[Doc] Simplify installation on Windows for Intel GPU (#11004)
* Simplify GPU installation guide regarding windows Prerequisites

* Update Windows install quickstart on Intel GPU

* Update for llama.cpp quickstart

* Update regarding minimum driver version

* Small fix

* Update based on comments

* Small fix
2024-05-15 09:55:41 +08:00
Shengsheng Huang
0b7e78b592
revise the benchmark part in python inference docker (#11020) 2024-05-14 18:43:41 +08:00
Shengsheng Huang
586a151f9c
update the README and reorganize the docker guides structure. (#11016)
* update the README and reorganize the docker guides structure.

* modified docker install guide into overview
2024-05-14 17:56:11 +08:00
Qiyuan Gong
c957ea3831
Add axolotl main support and axolotl Llama-3-8B QLoRA example (#10984)
* Support axolotl main (796a085).
* Add axolotl Llama-3-8B QLoRA example.
* Change `sequence_len` to 256 for alpaca, and revert `lora_r` value.
* Add example to quick_start.
2024-05-14 13:43:59 +08:00
Shaojun Liu
7f8c5b410b
Quickstart: Run PyTorch Inference on Intel GPU using Docker (on Linux or WSL) (#10970)
* add entrypoint.sh

* add quickstart

* remove entrypoint

* update

* Install related library of benchmarking

* update

* print out results

* update docs

* minor update

* update

* update quickstart

* update

* update

* update

* update

* update

* update

* add chat & example section

* add more details

* minor update

* rename quickstart

* update

* minor update

* update

* update config.yaml

* update readme

* use --gpu

* add tips

* minor update

* update
2024-05-14 12:58:31 +08:00
Ruonan Wang
04d5a900e1
update troubleshooting of llama.cpp (#10990)
* update troubleshooting

* small update
2024-05-13 11:18:38 +08:00
Yuwen Hu
9f6358e4c2
Deprecate support for pytorch 2.0 on Linux for ipex-llm >= 2.1.0b20240511 (#10986)
* Remove xpu_2.0 option in setup.py

* Disable xpu_2.0 test in UT and nightly

* Update docs for deprecated pytorch 2.0

* Small doc update
2024-05-11 12:33:35 +08:00
Ruonan Wang
5e0872073e
add version for llama.cpp and ollama (#10982)
* add version for cpp

* meet review
2024-05-11 09:20:31 +08:00
Ruonan Wang
b7f7d05a7e
update llama.cpp usage of llama3 (#10975)
* update llama.cpp usage of llama3

* fix
2024-05-09 16:44:12 +08:00
Shengsheng Huang
e3159c45e4
update private gpt quickstart and a small fix for dify (#10969) 2024-05-09 13:57:45 +08:00
Shengsheng Huang
11df5f9773
revise private GPT quickstart and a few fixes for other quickstart (#10967) 2024-05-08 21:18:20 +08:00
Keyan (Kyrie) Zhang
37820e1d86
Add privateGPT quickstart (#10932)
* Add privateGPT quickstart

* Update privateGPT_quickstart.md

* Update _toc.yml

* Update _toc.yml

---------

Co-authored-by: Shengsheng Huang <shengsheng.huang@intel.com>
2024-05-08 20:48:00 +08:00
Wang, Jian4
f4c615b1ee
Add cohere example (#10954)
* add link first

* add_cpu_example

* add GPU example
2024-05-08 17:19:59 +08:00
Xiangyu Tian
02870dc385
LLM: Refine README of AutoTP-FastAPI example (#10960) 2024-05-08 16:55:23 +08:00
Qiyuan Gong
164e6957af
Refine axolotl quickstart (#10957)
* Add default accelerate config for axolotl quickstart.
* Fix requirement link.
* Upgrade peft to 0.10.0 in requirement.
2024-05-08 09:34:02 +08:00
hxsz1997
245c7348bc
Add codegemma example (#10884)
* add codegemma example in GPU/HF-Transformers-AutoModels/

* add README of codegemma example in GPU/HF-Transformers-AutoModels/

* add codegemma example in GPU/PyTorch-Models/

* add readme of codegemma example in GPU/PyTorch-Models/

* add codegemma example in CPU/HF-Transformers-AutoModels/

* add readme of codegemma example in CPU/HF-Transformers-AutoModels/

* add codegemma example in CPU/PyTorch-Models/

* add readme of codegemma example in CPU/PyTorch-Models/

* fix typos

* fix filename typo

* add codegemma in tables

* add comments of lm_head

* remove comments of use_cache
2024-05-07 13:35:42 +08:00
Shengsheng Huang
d649236321
make images clickable (#10939) 2024-05-06 20:24:15 +08:00
Shengsheng Huang
64938c2ca7
Dify quickstart revision (#10938)
* revise dify quickstart guide

* update quick links and a small typo
2024-05-06 19:59:17 +08:00
Ruonan Wang
3f438495e4
update llama.cpp and ollama quickstart (#10929) 2024-05-06 15:01:06 +08:00
Wang, Jian4
0e0bd309e2
LLM: Enable Speculative on Fastchat (#10909)
* init

* enable streamer

* update

* update

* remove deprecated

* update

* update

* add gpu example
2024-05-06 10:06:20 +08:00
Zhicun
8379f02a74
Add Dify quickstart (#10903)
* add quick start

* modify

* modify

* add

* add

* resize

* add mp4

* add vedio

* add video

* video

* add

* modify

* add

* modify
2024-05-06 10:01:34 +08:00
Shengsheng Huang
c78a8e3677
update quickstart (#10923) 2024-04-30 18:19:31 +08:00
Shengsheng Huang
282d676561
update continue quickstart (#10922) 2024-04-30 17:51:21 +08:00
Yuwen Hu
71f51ce589
Initial Update for Continue Quickstart with Ollama backend (#10918)
* Initial continue quickstart with ollama backend updates

* Small fix

* Small fix
2024-04-30 15:10:30 +08:00
Jin Qiao
1f876fd837
Add example for phi-3 (#10881)
* Add example for phi-3

* add in readme and index

* fix

* fix

* fix

* fix indent

* fix
2024-04-29 16:43:55 +08:00
Shaojun Liu
d058f2b403
Fix apt install oneapi scripts (#10891)
* Fix apt install oneapi scripts

* add intel-oneapi-mkl-devel

* add apt pkgs
2024-04-26 16:39:37 +08:00
Qiyuan Gong
634726211a
Add video to axolotl quick start (#10870)
* Add video to axolotl quick start.
* Fix wget url.
2024-04-24 16:53:14 +08:00
Zhicun
a017bf2981
add quick start for dify (#10813)
* add quick start

* modify

* modify

* add

* add

* resize

* add mp4

* add vedio

* add video

* video

* add
2024-04-23 16:32:22 +08:00
Qiyuan Gong
bce99a5b00
Minior fix for quick start (#10857)
* Fix typo and space in quick start.
2024-04-23 15:22:01 +08:00
Qiyuan Gong
5eee1976ac
Add Axolotl v0.4.0 quickstart (#10840)
* Add Axolotl v0.4.0 quickstart
2024-04-23 14:57:34 +08:00
Ruonan Wang
2ec45c49d3
fix ollama quickstart(#10846) 2024-04-22 22:04:49 +08:00
Ruonan Wang
c6e868f7ad
update oneapi usage in cpp quickstart (#10836)
* update oneapi usage

* update

* small fix
2024-04-22 11:48:05 +08:00
Ruonan Wang
1edb19c1dd
small fix of cpp quickstart(#10829) 2024-04-22 09:44:08 +08:00
Jason Dai
3cd21d5105
Update readme (#10817) 2024-04-19 22:16:17 +08:00
SONG Ge
197f8dece9
Add open-webui windows document (#10775)
* add windows document

* update

* fix document

* build fix

* update some description

* reorg document structure

* update doc

* re-update to better view

* add reminder for running model on gpus

* update

* remove useless part
2024-04-19 18:06:40 +08:00
Ruonan Wang
a8df429985
QuickStart: Run Llama 3 on Intel GPU using llama.cpp and ollama with IPEX-LLM (#10809)
* initial commit

* update llama.cpp

* add demo video at first

* fix ollama link in readme

* meet review

* update

* small fix
2024-04-19 17:44:59 +08:00
Yuwen Hu
34ff07b689
Add CPU related info to langchain-chatchat quickstart (#10812) 2024-04-19 15:59:51 +08:00
SONG Ge
fbd1743b5e
Ollama quickstart update (#10806)
* add ollama doc for OLLAMA_NUM_GPU

* remove useless params

* revert unexpected changes back

* move env setting to server part

* update
2024-04-19 15:00:25 +08:00
Jason Dai
995c01367d
Update readme (#10802) 2024-04-19 06:52:57 +08:00
Yang Wang
8153c3008e
Initial llama3 example (#10799)
* Add initial hf huggingface GPU example

* Small fix

* Add llama3 gpu pytorch model example

* Add llama 3 hf transformers CPU example

* Add llama 3 pytorch model CPU example

* Fixes

* Small fix

* Small fixes

* Small fix

* Small fix

* Add links

* update repo id

* change prompt tuning url

* remove system header if there is no system prompt

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
Co-authored-by: Yuwen Hu <54161268+Oscilloscope98@users.noreply.github.com>
2024-04-18 11:01:33 -07:00
ZehuaCao
a7c12020b4
Add fastchat quickstart (#10688)
* add fastchat quickstart

* update

* update

* update
2024-04-16 14:02:38 +08:00
Ruonan Wang
ea5e46c8cb
Small update of quickstart (#10772) 2024-04-16 10:46:58 +08:00
Yuwen Hu
1abd77507e
Small update for GPU configuration related doc (#10770)
* Small doc fix for dGPU type name

* Further fixes

* Further fix

* Small fix
2024-04-15 18:43:29 +08:00
Ruonan Wang
1bd431976d
Update ollama quickstart (#10756)
* update windows part

* update ollama quickstart

* update ollama

* update

* small fix

* update

* meet review
2024-04-15 16:37:55 +08:00
Kai Huang
47622c6a92
Fix missing export typo in linux quickstart (#10750) 2024-04-15 14:16:40 +08:00
Yuwen Hu
486df2764a
Update gpu configuration (#10760) 2024-04-15 13:27:15 +08:00
Shengsheng Huang
0ccd7bfca9
revise quickstart (#10721) 2024-04-10 14:24:53 +08:00
yb-peng
a81f9e61a6
Revise open_webui_with_ollama_quickstart.md (#10720) 2024-04-10 14:04:13 +08:00
Shengsheng Huang
6e7da0d92c
small fix in document 2024-04-09 23:04:26 +08:00
Shengsheng Huang
8924dbc3f9
revise open webui quickstart and some indexes (#10715)
* update readme

* update openwebui readme and update index
2024-04-09 22:44:03 +08:00
Yuwen Hu
a0244527aa
Small updates to langchain-chatchat quickstart readme (#10714) 2024-04-09 19:37:41 +08:00
Yuwen Hu
fde6ab50d0
Further fix to python 3.11 document (#10712) 2024-04-09 19:13:01 +08:00
yb-peng
447f48499a
Init commit of open-webui quickstart (#10682)
* init commit of open-webui quickstart

* add links into open-webui quickstart

* Update open_webui_with_ollama_quickstart.md
2024-04-09 18:21:42 +08:00
Shaojun Liu
f37a1f2a81
Upgrade to python 3.11 (#10711)
* create conda env with python 3.11

* recommend to use Python 3.11

* update
2024-04-09 17:41:17 +08:00
Jason Dai
3e4fbee87c
Update readme & quickstart (#10685) 2024-04-09 15:59:17 +08:00
yb-peng
8cf26d8d08
Update ollama_quickstart.md (#10708) 2024-04-09 15:47:41 +08:00
Keyan (Kyrie) Zhang
a11b708135
Modify the .md link in chatchat readthedoc (#10681) 2024-04-07 16:33:32 +08:00
Shengsheng Huang
33f90beda0
fix quickstart docs (#10676) 2024-04-07 14:26:59 +08:00
Jason Dai
ab87b6ab21
Update readme (#10669) 2024-04-07 09:13:45 +08:00
Jason Dai
29d97e4678
Update readme (#10665) 2024-04-05 18:01:57 +08:00
Yang Wang
ac65ab65c6
Update llama_cpp_quickstart.md (#10663) 2024-04-04 11:00:50 -07:00
Jason Dai
6699d86192
Update index.rst (#10660) 2024-04-04 20:37:33 +08:00
Shengsheng Huang
22f09f618a
update the video demo (#10655) 2024-04-03 20:51:01 +08:00
Jason Dai
7c08d83d9e
Update quickstart (#10654) 2024-04-03 20:43:22 +08:00
Shengsheng Huang
f84e72e7af
revise ollama quickstart (#10653) 2024-04-03 20:35:34 +08:00
yb-peng
f789c2eee4
add ollama quickstart (#10649)
Co-authored-by: arda <arda@arda-arc12.sh.intel.com>
2024-04-03 19:33:39 +08:00
Shengsheng Huang
1ae519ec69
add langchain-chatchat quickstart (#10652) 2024-04-03 19:23:09 +08:00
Shengsheng Huang
45437ddc9a
update indexes, move some sections in coding quickstart to webui (#10651) 2024-04-03 18:18:49 +08:00
Shengsheng Huang
c26e06d5cf
update coding quickstart and webui quickstart for warmup note (#10650) 2024-04-03 17:18:28 +08:00
Yuwen Hu
5b096c39a6
Change style for video rendering (#10646) 2024-04-03 16:31:02 +08:00
Jin Qiao
cc8b3be11c
Add GPU and CPU example for stablelm-zephyr-3b (#10643)
* Add example for StableLM

* fix

* add to readme
2024-04-03 16:28:31 +08:00
Ovo233
97c626d76f
add continue quickstart (#10610)
Co-authored-by: Shengsheng Huang <shengsheng.huang@intel.com>
2024-04-03 14:50:11 +08:00
Jason Dai
e184c480d2
Update WebUI Quickstart (#10630) 2024-04-02 21:49:19 +08:00
Yuwen Hu
89d780f2e9
Small fix to install guide (#10618) 2024-04-02 11:10:55 +08:00
Shaojun Liu
59058bb206
replace 2.5.0-SNAPSHOT with 2.1.0-SNAPSHOT for llm docker images (#10603) 2024-04-01 09:58:51 +08:00
Yuxuan Xia
856f1ace2b
Add linux 6.5 kernel installation (#10573)
* Add linux 6.5 kernel installation

* Fix linux quick start typo
2024-03-29 16:02:19 +08:00
Yuwen Hu
e6c5a6a5e6
Small style fix in Install Guide (#10581)
* Remove strange bold style

* Small fix
2024-03-28 18:36:17 +08:00
Yuwen Hu
15b8964403
Win install change oneapi to pip installer (#10577)
* Update windows related guide to use pip installer for oneAPI

* Small style fix

* Add oneAPI version

* Update based on comments

* Small fix
2024-03-28 18:22:46 +08:00
Keyan (Kyrie) Zhang
0a2e820c9f
Modify install_linux_gpu.md (#10576) 2024-03-28 13:20:42 +08:00
Cheen Hau, 俊豪
1c5eb14128
Update pip install to use --extra-index-url for ipex package (#10557)
* Change to 'pip install .. --extra-index-url' for readthedocs

* Change to 'pip install .. --extra-index-url' for examples

* Change to 'pip install .. --extra-index-url' for remaining files

* Fix URL for ipex

* Add links for ipex US and CN servers

* Update ipex cpu url

* remove readme

* Update for github actions

* Update for dockerfiles
2024-03-28 09:56:23 +08:00
Kai Huang
e619142a16
Add SYCL_CACHE_PERSISTENT in doc and explain warmup in benchmark quickstart (#10571)
* update doc

* update
2024-03-27 21:03:51 +08:00
Jason Dai
c450c85489
Delete llm/readme.md (#10569) 2024-03-27 20:06:40 +08:00
Jason Dai
08e9aeb31f
Update index.rst 2024-03-27 19:41:19 +08:00
Yuwen Hu
1bae5f40d2
Hide pip installer for windows install (#10568)
* Hide oneAPI install with pip installer for now

* Small fix
2024-03-27 18:41:41 +08:00
Cheen Hau, 俊豪
f239bc329b
Specify oneAPI minor version in documentation (#10561) 2024-03-27 17:58:57 +08:00
Jin Qiao
817ef2d1de
Add verified models in document index (#10546)
* Add verified models in document index

* try to adjust column width

* try to adjust column width

* try to adjust column width

* try to adjust column width

* try replace link

* change to ipex-llm-tutorial

* try use raw html

* adjust table header
2024-03-26 18:25:32 +08:00
Shaojun Liu
2ecd737474
change bigdl-llm-tutorial to ipex-llm-tutorial in README (#10547)
* update bigdl-llm-tutorial to ipex-llm-tutorial

* change to ipex-llm-tutorial
2024-03-26 15:19:53 +08:00
Yuwen Hu
9367db7f2b
Small typo fix (#10535) 2024-03-25 18:48:44 +08:00
Yuwen Hu
c182acef3f
[Doc] Update IPEX-LLM Index Page (#10534)
* Update readthedocs readme before Latest Update

* Update before quick start section in index page

* Update quickstart section

* Further updates for Code Example

* Small fix

* Small fix

* Fix migration guide style
2024-03-25 18:43:32 +08:00
Yuwen Hu
e0ea7b8244
[Doc] IPEX-LLM Doc Layout Update (#10532)
* Fix navigation bar to 1

* Remove unnecessary python api

* Fixed failed langchain native api doc

* Change index page layout

* Update quicklink for IPEX-LLM

* Simplify toc and add bigdl-llm migration guide

* Update readthedocs readme

* Add missing index link for bigdl-llm migration guide

* Update logo image and repo link

* Update copyright

* Small fix

* Update copyright

* Update top nav bar

* Small fix
2024-03-25 16:23:56 +08:00
Shengsheng Huang
de5bbf83de
update linux quickstart and formats of migration (#10530)
* update linux quickstart and formats of migration

* update quickstart

* update format
2024-03-25 15:38:02 +08:00
Jason Dai
5b76f88a8f
Update README.md (#10518) 2024-03-25 13:37:01 +08:00
Shengsheng Huang
d7d0e66b18
move migration guide to quickstart (#10521) 2024-03-25 11:50:49 +08:00
Dongjie Shi
c4dbd21cfc
update readthedocs project name (#10519)
* update readthedocs project name

* update readthedocs project name
2024-03-25 11:44:35 +08:00
Wang, Jian4
16b2ef49c6
Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
Wang, Jian4
5dc121ee5e
Add guide for running bigdl-example using ipex-llm libs (#28)
* add guide

* update
2024-03-22 17:17:21 +08:00
Wang, Jian4
9df70d95eb
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm

* rm python/llm/src/bigdl

* from bigdl.llm to from ipex_llm
2024-03-22 15:41:21 +08:00
Ruonan Wang
a7da61925f LLM: add windows related info in llama-cpp quickstart (#10505)
* first commit

* update

* add image, update Prerequisites

* small fix
2024-03-22 13:51:14 +08:00
Cheen Hau, 俊豪
a7d38bee94 WebUI quickstart: add instruct chat mode and tested models (#10436)
* Add instruct chat mode and tested models

* Fix table

* Remove falcon from 'tested models'

* Fixes

* Open image in new window
2024-03-21 20:15:32 +08:00
Kai Huang
92ee2077b3 Update Linux Quickstart (#10499)
* fix quick start

* update toc

* expose docker
2024-03-21 20:13:21 +08:00
Ruonan Wang
8d0ea1b9b3 LLM: add initial QuickStart for linux cpp usage (#10418)
* add first version

* update content and add link

* --amend

* update based on new usage

* update usage based on new pr

* temp save

* basic stable version

* change to backend
2024-03-21 17:35:58 +08:00
Yuxuan Xia
3d59c74a0b Linux quick start (#10391)
* Fix Baichuan2 prompt format

* Add linux quick start guide

* Modify the linux installation quick start

* Adjust Linux quick start

* Adjust Linux quick start

* Add linux quick start screenshots

* Revert Baichuan2 changes

* Fix linux quick start typo

* Fix linux quick start typos

* Remove linux quick start downgrade kernel

* Change linux quick start bigdl install

* Modify linux quick start
2024-03-21 16:02:29 +08:00
hxsz1997
158a49986a Add quickstart for install bigdl-llm in docker on windows with Intel GPU (#10421)
* add quickstart for install bigdl in docker on window with Intel GPU

* modify the inference command

* add note of required disk space

* add the issue of iGPU
2024-03-21 15:57:27 +08:00
Shengsheng Huang
e25d7413de add prerequisite section in quickstart (#10460)
* add prerequisite section

* fix typo
2024-03-19 14:24:51 +08:00
Cheen Hau, 俊豪
9880ddfc17 Update WebUI quickstart (#10316)
* Enlarge images and make them clickable to open in new window

* Update text to match image

* Remove image for 'AttributeError' since it does not show the error

* Add note on slower first response

* 'gpu models' -> 'gpu types'
2024-03-13 17:59:55 +08:00
Lilac09
aec83a8be6 Fix user guide indent (#10393) 2024-03-13 09:49:07 +08:00
Jin Qiao
c2fb17bd43 LLM: update quickstart Windows gpu install guide & other quickstart doc style (#10365)
* init

* fix doc style, add modelscope and tutorial

* fix web ui doc style

* add exit way

* fix

* fix modelscope note

* fix according to comment

* fix according to comment

* fix

* fix according to comments

* fix

* fix

* fix

* fix style

* try fix

* fix

* fix

* Small updates

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-03-12 18:38:35 +08:00
Lilac09
5809a3f5fe Add run-hbm.sh & add user guide for spr and hbm (#10357)
* add run-hbm.sh

* add spr and hbm guide

* only support quad mode

* only support quad mode

* update special cases

* update special cases
2024-03-12 16:15:27 +08:00
WeiguangHan
cac96b00be LLM: Small fix for benchmark userguide (#10373)
* small fix for benchmark userguide

* resolve some comments
2024-03-12 12:26:26 +08:00
Jason Dai
490cbcc897 Update readme (#10378) 2024-03-12 11:53:03 +08:00
WeiguangHan
f4cef95690 LLM: some slight modification to benchmark user guide (#10347) 2024-03-08 19:43:12 +08:00
Cheen Hau, 俊豪
6829efd350 Change quickstart documentation to use oneapi offline installer (#10350)
* Change to oneapi offline installer

* Fixes

* Add "call"

* Fixes
2024-03-08 19:24:00 +08:00
WeiguangHan
db00e79cdf LLM: add user guide for benchmarking (#10284)
* add user guide for benchmarking

* change the name and place of the benchmark user guide

* resolve some comments

* resolve new comments

* modify some typo

* resolve some new comments

* modify some descriptions
2024-03-07 18:50:29 +08:00
Yuwen Hu
fa69fed58f Small fixes to oneAPI link (#10339) 2024-03-07 09:56:04 +08:00
Yuwen Hu
566e9bbb36 [LLM Doc] Restructure (#10322)
* Add quick link guide to sidebar

* Add QuickStart to TOC

* Update quick links in main page

* Hide some section in More for top nav bar

* Resturct FAQ sections

* Small fix
2024-03-05 14:35:55 +08:00
Xin Qiu
58208a5883 Update FAQ document. (#10300)
* Update install_gpu.md

* Update resolve_error.md

* Update README.md

* Update resolve_error.md

* Update README.md

* Update resolve_error.md
2024-03-04 08:35:11 +08:00
Jason Dai
4cb4db618d Update WebUI quickstart (#10305) 2024-03-03 22:18:26 +08:00
Jason Dai
367b1db4f7 Update readme (#10303) 2024-03-01 17:37:14 +08:00
Shengsheng Huang
1db20dd1d0 add warmup advice in quickstart (#10293) 2024-03-01 17:15:45 +08:00
Xin Qiu
509e206de0 update doc about gemma random and unreadable output. (#10297)
* Update install_gpu.md

* Update README.md

* Update README.md
2024-03-01 15:41:16 +08:00
Shengsheng Huang
90f2f82638 revise webui quickstart (#10287) 2024-03-01 10:04:21 +08:00
Jason Dai
14814abab8 Update README.md (#10286) 2024-02-29 20:00:53 +08:00
Cheen Hau, 俊豪
653cb500ed Add webUI quickstart (#10266)
* Add webUI quickstart

* Add GPU driver install

* Move images to readthedocs assets
2024-02-29 10:08:06 +08:00
Jason Dai
1572b6f7c3 Add quickstart (#10272) 2024-02-29 08:46:43 +08:00
Shengsheng Huang
b88f447974 fix typo and change wording (#10254) 2024-02-27 13:40:51 +08:00
Shengsheng Huang
04a6b0040c Windows GPU Install Quickstart update (#10240)
* Update install_windows_gpu.md

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* fix numbering

* Update install_windows_gpu.md

* Update install_windows_gpu.md
2024-02-27 13:14:39 +08:00
Zhicun
7c236e4c6d quick start for windows with gpu (#10221)
* quick start for windows igpu

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* update the demo.py

* Update install_windows_gpu.md

* Update install_windows_gpu.md

* fix image position typo

* Update install_windows_gpu.md

* update pip install command

---------

Co-authored-by: Shengsheng Huang <shannie.huang@gmail.com>
2024-02-26 12:19:36 +08:00
Jason Dai
40584dec6d Update readme (#10214) 2024-02-23 11:42:16 +08:00
Jason Dai
84d5f40936 Update README.md (#10213) 2024-02-22 17:22:59 +08:00
Yuwen Hu
94cb16fe40 [LLM] Small updates to Win GPU Install Doc (#10199)
* Make Offline installer as default for win gpu doc for oneAPI

* Small other fixes
2024-02-21 17:58:40 +08:00
Jason Dai
4655005f24 Update README (#10186) 2024-02-21 16:35:52 +08:00
hxsz1997
6e10d98a8d Fix some typos (#10175)
* add llm-ppl workflow

* update the DATASET_DIR

* test multiple precisions

* modify nightly test

* match the updated ppl code

* add matrix.include

* fix the include error

* update the include

* add more model

* update the precision of include

* update nightly time and add more models

* fix the workflow_dispatch description, change default model of pr and modify the env

* modify workflow_dispatch language options

* modify options

* modify language options

* modeify workflow_dispatch type

* modify type

* modify the type of language

* change seq_len type

* fix some typos

* revert changes to stress_test.txt
2024-02-20 14:14:53 +08:00
Cheen Hau, 俊豪
6952847f68 GPU install doc - add pip install oneAPI for windows (#10157)
* Add instructions for pip install oneAPI for windows

* Improve clarity

* Format fix

* Fix

* Fix in runtime configuration
2024-02-19 14:46:08 +08:00
Kai Huang
7400401706 Update gpu pip install oneapi doc (#10137)
* fix link

* fix

* fix

* minor
2024-02-09 11:27:40 +08:00
Cheen Hau, 俊豪
a7f9a13f6e Enhance gpu doc with PIP install oneAPI (#10109)
* Add pip install oneapi instructions

* Fixes

* Add instruction for oneapi2023

* Runtime config

* Fixes

* Remove "Currently, oneAPI installed with .. "

* Add pip package version for oneAPI 2024

* Reviewer comments

* Fix errors
2024-02-07 21:14:15 +08:00
binbin Deng
c1ec3d8921 LLM: update FAQ about too many open files (#10119) 2024-02-07 15:02:24 +08:00
Jason Dai
e2233dddef Update README (#10111) 2024-02-06 19:29:07 +08:00
Jason Dai
f440cb4fba Update Self-Speculative Decoding Readme (#10102) 2024-02-06 12:59:17 +08:00
binbin Deng
91cf9d41d0 LLM: add solutions of some frequently asked questions (#10068) 2024-02-04 09:28:20 +08:00
Jason Dai
2927c77d7f Update readme (#10071) 2024-02-01 20:40:20 -08:00