Shengsheng Huang
13727635e8
revise ragflow quickstart ( #11363 )
...
* revise ragflow quickstart
* update titles and split the quickstart into sections
* update
2024-06-19 22:24:31 +08:00
Zijie Li
5283df0078
LLM: Add RAGFlow with Ollama Example QuickStart ( #11338 )
...
* Create ragflow.md
* Update ragflow.md
* Update ragflow_quickstart
* Update ragflow_quickstart.md
* Upload RAGFlow quickstart without images
* Update ragflow_quickstart.md
* Update ragflow_quickstart.md
* Update ragflow_quickstart.md
* Update ragflow_quickstart.md
* fix typos in readme
* Fix typos in quickstart readme
2024-06-19 20:00:50 +08:00
Guancheng Fu
c9b4cadd81
fix vLLM/docker issues ( #11348 )
...
* fix
* fix
* ffix
2024-06-18 16:23:53 +08:00
hxsz1997
44f22cba70
add config and default value ( #11344 )
...
* add config and default value
* add config in taml
* remove lookahead and max_matching_ngram_size in config
* remove streaming and use_fp16_torch_dtype in test yaml
* update task in readme
* update commit of task
2024-06-18 15:28:57 +08:00
Qiyuan Gong
de4bb97b4f
Remove accelerate 0.23.0 install command in readme and docker ( #11333 )
...
*ipex-llm's accelerate has been upgraded to 0.23.0. Remove accelerate 0.23.0 install command in README and docker。
2024-06-17 17:52:12 +08:00
Yuwen Hu
9e4d87a696
Langchain-chatchat QuickStart small link fix ( #11317 )
2024-06-14 14:02:17 +08:00
Yuwen Hu
bfab294f08
Update langchain-chatchat QuickStart to include Core Ultra iGPU Linux Guide ( #11302 )
2024-06-13 15:09:55 +08:00
Guancheng Fu
3ef4aa98d1
Refine vllm_quickstart doc ( #11199 )
...
* refine doc
* refine
2024-06-04 18:46:27 +08:00
Yuwen Hu
f0aaa130a9
Update miniconda/anaconda -> miniforge in documentation ( #11176 )
...
* Update miniconda/anaconda -> miniforge in installation guide
* Update for all Quickstart
* further fix for docs
2024-05-30 17:40:18 +08:00
Ruonan Wang
83bd9cb681
add new version for cpp quickstart and keep an old version ( #11151 )
...
* add new version
* meet review
2024-05-28 15:29:34 +08:00
Guancheng Fu
daf7b1cd56
[Docker] Fix image using two cards error ( #11144 )
...
* fix all
* done
2024-05-27 16:20:13 +08:00
Xiangyu Tian
b3f6faa038
LLM: Add CPU vLLM entrypoint ( #11083 )
...
Add CPU vLLM entrypoint and update CPU vLLM serving example.
2024-05-24 09:16:59 +08:00
Zhao Changmin
15d906a97b
Update linux igpu run script ( #11098 )
...
* update run script
2024-05-22 17:18:07 +08:00
Zhao Changmin
bf0f904e66
Update level_zero on MTL linux ( #11085 )
...
* Update level_zero on MTL
---------
Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>
2024-05-22 11:01:56 +08:00
Guancheng Fu
f654f7e08c
Add serving docker quickstart ( #11072 )
...
* add temp file
* add initial docker readme
* temp
* done
* add fastchat service
* fix
* fix
* fix
* fix
* remove stale file
2024-05-21 17:00:58 +08:00
binbin Deng
7170dd9192
Update guide for running qwen with AutoTP ( #11065 )
2024-05-20 10:53:17 +08:00
Wang, Jian4
a2e1578fd9
Merge tgi_api_server to main ( #11036 )
...
* init
* fix style
* speculative can not use benchmark
* add tgi server readme
2024-05-20 09:15:03 +08:00
Guancheng Fu
dfac168d5f
fix format/typo ( #11067 )
2024-05-17 16:52:17 +08:00
Guancheng Fu
67db925112
Add vllm quickstart ( #10978 )
...
* temp
* add doc
* finish
* done
* fix
* add initial docker readme
* temp
* done fixing vllm_quickstart
* done
* remove not used file
* add
* fix
2024-05-17 16:16:42 +08:00
Wang, Jian4
00d4410746
Update cpp docker quickstart ( #11040 )
...
* add sample output
* update link
* update
* update header
* update
2024-05-16 14:55:13 +08:00
Ruonan Wang
1d73fc8106
update cpp quickstart ( #11031 )
2024-05-15 14:33:36 +08:00
Wang, Jian4
86cec80b51
LLM: Add llm inference_cpp_xpu_docker ( #10933 )
...
* test_cpp_docker
* update
* update
* update
* update
* add sudo
* update nodejs version
* no need npm
* remove blinker
* new cpp docker
* restore
* add line
* add manually_build
* update and add mtl
* update for workdir llm
* add benchmark part
* update readme
* update 1024-128
* update readme
* update
* fix
* update
* update
* update readme too
* update readme
* no change
* update dir_name
* update readme
2024-05-15 11:10:22 +08:00
Yuwen Hu
c34f85e7d0
[Doc] Simplify installation on Windows for Intel GPU ( #11004 )
...
* Simplify GPU installation guide regarding windows Prerequisites
* Update Windows install quickstart on Intel GPU
* Update for llama.cpp quickstart
* Update regarding minimum driver version
* Small fix
* Update based on comments
* Small fix
2024-05-15 09:55:41 +08:00
Shengsheng Huang
0b7e78b592
revise the benchmark part in python inference docker ( #11020 )
2024-05-14 18:43:41 +08:00
Shengsheng Huang
586a151f9c
update the README and reorganize the docker guides structure. ( #11016 )
...
* update the README and reorganize the docker guides structure.
* modified docker install guide into overview
2024-05-14 17:56:11 +08:00
Qiyuan Gong
c957ea3831
Add axolotl main support and axolotl Llama-3-8B QLoRA example ( #10984 )
...
* Support axolotl main (796a085).
* Add axolotl Llama-3-8B QLoRA example.
* Change `sequence_len` to 256 for alpaca, and revert `lora_r` value.
* Add example to quick_start.
2024-05-14 13:43:59 +08:00
Shaojun Liu
7f8c5b410b
Quickstart: Run PyTorch Inference on Intel GPU using Docker (on Linux or WSL) ( #10970 )
...
* add entrypoint.sh
* add quickstart
* remove entrypoint
* update
* Install related library of benchmarking
* update
* print out results
* update docs
* minor update
* update
* update quickstart
* update
* update
* update
* update
* update
* update
* add chat & example section
* add more details
* minor update
* rename quickstart
* update
* minor update
* update
* update config.yaml
* update readme
* use --gpu
* add tips
* minor update
* update
2024-05-14 12:58:31 +08:00
Ruonan Wang
04d5a900e1
update troubleshooting of llama.cpp ( #10990 )
...
* update troubleshooting
* small update
2024-05-13 11:18:38 +08:00
Ruonan Wang
5e0872073e
add version for llama.cpp and ollama ( #10982 )
...
* add version for cpp
* meet review
2024-05-11 09:20:31 +08:00
Ruonan Wang
b7f7d05a7e
update llama.cpp usage of llama3 ( #10975 )
...
* update llama.cpp usage of llama3
* fix
2024-05-09 16:44:12 +08:00
Shengsheng Huang
e3159c45e4
update private gpt quickstart and a small fix for dify ( #10969 )
2024-05-09 13:57:45 +08:00
Shengsheng Huang
11df5f9773
revise private GPT quickstart and a few fixes for other quickstart ( #10967 )
2024-05-08 21:18:20 +08:00
Keyan (Kyrie) Zhang
37820e1d86
Add privateGPT quickstart ( #10932 )
...
* Add privateGPT quickstart
* Update privateGPT_quickstart.md
* Update _toc.yml
* Update _toc.yml
---------
Co-authored-by: Shengsheng Huang <shengsheng.huang@intel.com>
2024-05-08 20:48:00 +08:00
Xiangyu Tian
02870dc385
LLM: Refine README of AutoTP-FastAPI example ( #10960 )
2024-05-08 16:55:23 +08:00
Qiyuan Gong
164e6957af
Refine axolotl quickstart ( #10957 )
...
* Add default accelerate config for axolotl quickstart.
* Fix requirement link.
* Upgrade peft to 0.10.0 in requirement.
2024-05-08 09:34:02 +08:00
Shengsheng Huang
d649236321
make images clickable ( #10939 )
2024-05-06 20:24:15 +08:00
Shengsheng Huang
64938c2ca7
Dify quickstart revision ( #10938 )
...
* revise dify quickstart guide
* update quick links and a small typo
2024-05-06 19:59:17 +08:00
Ruonan Wang
3f438495e4
update llama.cpp and ollama quickstart ( #10929 )
2024-05-06 15:01:06 +08:00
Wang, Jian4
0e0bd309e2
LLM: Enable Speculative on Fastchat ( #10909 )
...
* init
* enable streamer
* update
* update
* remove deprecated
* update
* update
* add gpu example
2024-05-06 10:06:20 +08:00
Zhicun
8379f02a74
Add Dify quickstart ( #10903 )
...
* add quick start
* modify
* modify
* add
* add
* resize
* add mp4
* add vedio
* add video
* video
* add
* modify
* add
* modify
2024-05-06 10:01:34 +08:00
Shengsheng Huang
c78a8e3677
update quickstart ( #10923 )
2024-04-30 18:19:31 +08:00
Shengsheng Huang
282d676561
update continue quickstart ( #10922 )
2024-04-30 17:51:21 +08:00
Yuwen Hu
71f51ce589
Initial Update for Continue Quickstart with Ollama backend ( #10918 )
...
* Initial continue quickstart with ollama backend updates
* Small fix
* Small fix
2024-04-30 15:10:30 +08:00
Shaojun Liu
d058f2b403
Fix apt install oneapi scripts ( #10891 )
...
* Fix apt install oneapi scripts
* add intel-oneapi-mkl-devel
* add apt pkgs
2024-04-26 16:39:37 +08:00
Qiyuan Gong
634726211a
Add video to axolotl quick start ( #10870 )
...
* Add video to axolotl quick start.
* Fix wget url.
2024-04-24 16:53:14 +08:00
Zhicun
a017bf2981
add quick start for dify ( #10813 )
...
* add quick start
* modify
* modify
* add
* add
* resize
* add mp4
* add vedio
* add video
* video
* add
2024-04-23 16:32:22 +08:00
Qiyuan Gong
bce99a5b00
Minior fix for quick start ( #10857 )
...
* Fix typo and space in quick start.
2024-04-23 15:22:01 +08:00
Qiyuan Gong
5eee1976ac
Add Axolotl v0.4.0 quickstart ( #10840 )
...
* Add Axolotl v0.4.0 quickstart
2024-04-23 14:57:34 +08:00
Ruonan Wang
2ec45c49d3
fix ollama quickstart( #10846 )
2024-04-22 22:04:49 +08:00
Ruonan Wang
c6e868f7ad
update oneapi usage in cpp quickstart ( #10836 )
...
* update oneapi usage
* update
* small fix
2024-04-22 11:48:05 +08:00