ipex-llm

Author	SHA1	Message	Date
Yuwen Hu	f60565adc7	Fix toc for vllm serving quickstart (#11068 )	2024-05-17 17:12:48 +08:00
Wang, Jian4	00d4410746	Update cpp docker quickstart (#11040 ) * add sample output * update link * update * update header * update	2024-05-16 14:55:13 +08:00
Wang, Jian4	86cec80b51	LLM: Add llm inference_cpp_xpu_docker (#10933 ) * test_cpp_docker * update * update * update * update * add sudo * update nodejs version * no need npm * remove blinker * new cpp docker * restore * add line * add manually_build * update and add mtl * update for workdir llm * add benchmark part * update readme * update 1024-128 * update readme * update * fix * update * update * update readme too * update readme * no change * update dir_name * update readme	2024-05-15 11:10:22 +08:00
Shengsheng Huang	0b7e78b592	revise the benchmark part in python inference docker (#11020 )	2024-05-14 18:43:41 +08:00
Shengsheng Huang	586a151f9c	update the README and reorganize the docker guides structure. (#11016 ) * update the README and reorganize the docker guides structure. * modified docker install guide into overview	2024-05-14 17:56:11 +08:00
Shengsheng Huang	11df5f9773	revise private GPT quickstart and a few fixes for other quickstart (#10967 )	2024-05-08 21:18:20 +08:00
Keyan (Kyrie) Zhang	37820e1d86	Add privateGPT quickstart (#10932 ) * Add privateGPT quickstart * Update privateGPT_quickstart.md * Update _toc.yml * Update _toc.yml --------- Co-authored-by: Shengsheng Huang <shengsheng.huang@intel.com>	2024-05-08 20:48:00 +08:00
Xiangyu Tian	02870dc385	LLM: Refine README of AutoTP-FastAPI example (#10960 )	2024-05-08 16:55:23 +08:00
Qiyuan Gong	5eee1976ac	Add Axolotl v0.4.0 quickstart (#10840 ) * Add Axolotl v0.4.0 quickstart	2024-04-23 14:57:34 +08:00
Ruonan Wang	a8df429985	QuickStart: Run Llama 3 on Intel GPU using llama.cpp and ollama with IPEX-LLM (#10809 ) * initial commit * update llama.cpp * add demo video at first * fix ollama link in readme * meet review * update * small fix	2024-04-19 17:44:59 +08:00
ZehuaCao	a7c12020b4	Add fastchat quickstart (#10688 ) * add fastchat quickstart * update * update * update	2024-04-16 14:02:38 +08:00
Shengsheng Huang	8924dbc3f9	revise open webui quickstart and some indexes (#10715 ) * update readme * update openwebui readme and update index	2024-04-09 22:44:03 +08:00
Shengsheng Huang	f84e72e7af	revise ollama quickstart (#10653 )	2024-04-03 20:35:34 +08:00
Shengsheng Huang	1ae519ec69	add langchain-chatchat quickstart (#10652 )	2024-04-03 19:23:09 +08:00
Shengsheng Huang	45437ddc9a	update indexes, move some sections in coding quickstart to webui (#10651 )	2024-04-03 18:18:49 +08:00
Yuwen Hu	c182acef3f	[Doc] Update IPEX-LLM Index Page (#10534 ) * Update readthedocs readme before Latest Update * Update before quick start section in index page * Update quickstart section * Further updates for Code Example * Small fix * Small fix * Fix migration guide style	2024-03-25 18:43:32 +08:00
Yuwen Hu	e0ea7b8244	[Doc] IPEX-LLM Doc Layout Update (#10532 ) * Fix navigation bar to 1 * Remove unnecessary python api * Fixed failed langchain native api doc * Change index page layout * Update quicklink for IPEX-LLM * Simplify toc and add bigdl-llm migration guide * Update readthedocs readme * Add missing index link for bigdl-llm migration guide * Update logo image and repo link * Update copyright * Small fix * Update copyright * Update top nav bar * Small fix	2024-03-25 16:23:56 +08:00
Kai Huang	92ee2077b3	Update Linux Quickstart (#10499 ) * fix quick start * update toc * expose docker	2024-03-21 20:13:21 +08:00
Ruonan Wang	8d0ea1b9b3	LLM: add initial QuickStart for linux cpp usage (#10418 ) * add first version * update content and add link * --amend * update based on new usage * update usage based on new pr * temp save * basic stable version * change to backend	2024-03-21 17:35:58 +08:00
WeiguangHan	db00e79cdf	LLM: add user guide for benchmarking (#10284 ) * add user guide for benchmarking * change the name and place of the benchmark user guide * resolve some comments * resolve new comments * modify some typo * resolve some new comments * modify some descriptions	2024-03-07 18:50:29 +08:00
Yuwen Hu	566e9bbb36	[LLM Doc] Restructure (#10322 ) * Add quick link guide to sidebar * Add QuickStart to TOC * Update quick links in main page * Hide some section in More for top nav bar * Resturct FAQ sections * Small fix	2024-03-05 14:35:55 +08:00
binbin Deng	4b92235bdb	LLM: add initial FAQ page (#10055 )	2024-02-01 09:43:39 +08:00
Xin Qiu	0f9a440b06	doc for Multi gpu selection (#9414 )	2023-11-20 09:25:58 +08:00
binbin Deng	7e96d3e79a	LLM: improve gpu supports key feature doc page (#9212 )	2023-10-19 18:40:48 +08:00
binbin Deng	760183bac6	LLM: update key feature and installation page of document (#9068 )	2023-09-27 15:44:34 +08:00
Yuwen Hu	cf6a620bae	[LLM] BigDL-LLM Documentation Initial Version (#8833 ) * Change order of LLM in header * Some updates to footer * Add BigDL-LLM index page and basic file structure * Update index page for key features * Add initial content for BigDL-LLM in 5 mins * Improvement to footnote * Add initial contents based on current contents we have * Add initial quick links * Small fix * Rename file * Hide cli section for now and change model supports to examples * Hugging Face format -> Hugging Face transformers format * Add placeholder for GPU supports * Add GPU related content structure * Add cpu/gpu installation initial contents * Add initial contents for GPU supports * Add image link to LLM index page * Hide tips and known issues for now * Small fix * Update based on comments * Small fix * Add notes for Python 3.9 * Add placehoder optimize model & reveal CLI; small revision * examples add gpu part * Hide CLI part again for first version of merging * add keyfeatures-optimize_model part (#1) * change gif link to the ones hosted on github * Small fix --------- Co-authored-by: plusbang <binbin1.deng@intel.com> Co-authored-by: binbin Deng <108676127+plusbang@users.noreply.github.com>	2023-09-06 15:38:45 +08:00
Kai Huang	1b65288bdb	Add api doc for LLM (#8605 ) * api doc initial * update desc	2023-08-08 18:17:16 +08:00
Heyang Sun	5ad5ac5356	LightGBM Spark Readthedocs (#8381 ) * LightGBM Spark Readthedocs * Update secure_lightgbm_on_spark.md * Update secure_lightgbm_on_spark.md * Update secure_lightgbm_on_spark.md * refine * Update secure_lightgbm_on_spark.md * add html link * Update secure_lightgbm_on_spark.md * Update secure_lightgbm_on_spark.md * add in yaml	2023-06-26 13:14:40 +08:00
Pingchuan Ma (Henry)	30367f5eb1	[Nano] Add how-to-guide for pytorch async pipeline (#8146 ) * add how-to-guide for pytorch async pipeline * revise introduction * resolve image issues	2023-05-06 22:15:42 +08:00
Pingchuan Ma (Henry)	c6eccbfcc2	[Nano] add pt dgpu inference how-to-guide (#8026 ) * docs for arc dgpu how-to-guide * minor adjustment + system info * minor adjustment for appearance * fix bugs * add system info * fix syntax errors * adjust docs according to comments * final adjustment * delete gpu workflow testing	2023-04-12 19:18:16 +08:00
Junwei Deng	2daaa6f7de	Nano: TF multi process how-to for customized training loop (#8006 ) * add how-to guide * add overview * fix doc * fix pep8 * update the notebook	2023-04-07 17:19:47 +08:00
Junwei Deng	a6f96521af	Chronos: AIOps User Guide (#7960 ) * add aiops doc * add image * fix typo	2023-03-31 10:15:00 +08:00
Qiyuan Gong	bcfcb3f12b	[PPML] Alibaba ECS Occlum CN Develop guide (#7297 ) * Alibaba ECS Occlum CN Develop guide with Occlum * Refine examples and style based on comments * Add to readthedocs	2023-02-17 14:06:41 +08:00
Yuwen Hu	6960c87b54	[Nano] Revise outdated info in PyTorch Inference how-to guides (#7453 ) * Change order of PyTorch Inference how-to guides order * Revise accelerate_pytorch_inference_jit_ipex.ipynb * Revise accelerat_pytorch_inference_onnx.ipynb * Revise accelerate_pytorch_inference_openvino.ipynb * Revise inference_optimizer_optimize.ipynb * Revise multi_instance_pytorch_inference.ipynb * Revise pytorch_context_manager.ipynb * Revise pytorch_save_and_load_ipex.ipynb and other small fixes * Revise pytorch_save_and_load_jit.ipynb and other small fixes * Revise pytorch_save_and_load_onnx.ipynb and other small fixese * Revise pytorch_save_and_load_openvino.ipynb * Revise quantize_pytorch_inference_inc.ipynb and other small fixes * Revise quantize_pytorch_inference_pot.ipynb and other small fixes * Fix failed links in the index page * Small fixes * Update based on comments	2023-02-09 17:50:50 +08:00
Yishuo Wang	f4700422e1	Nano: add OS support table (#7429 )	2023-02-06 18:01:31 +08:00
Yuwen Hu	c31136df0b	[Nano] Improve How-to Guides Navigations (#7396 ) * Remove deprecated option enable_auto_doc_ref for recommonmark * Add first level navigation structure for Nano how-to guides * Update navigation for How-to Training part * Update navigation for How-to Inference part * Update navigation for How-to Preprocessing/Install part and other small fixes * Fix wrong link path caused by position changes of how-to install related guides * Small fix	2023-02-03 09:37:10 +08:00
Yuwen Hu	c7674f52b2	[Nano] Add how-to guide for TensorFlow Keras inference using bf16 mixed precision (#7337 ) * Add basic structure for tf bf16 infer how-to guide * Add how-to use BFloat16 mixed precision for TensorFlow Keras inference * Small fixes * Add instruction requires box * Add outputs to show inference time diff * Small fixes regarding hardware requirements and others * Update based on comments * Small fixes * Small fixes	2023-02-01 13:33:54 +08:00
Junwei Deng	08a556239c	Chronos: add how to guide for forecaster.optimize (#7387 )	2023-02-01 11:39:27 +08:00
Sirui Tao	3543a58723	[Nano] How-to Guide: Inference via Openvino on Intel GPUs (#7212 ) * add openvino gpu inference guide * enable online doc for how-to * fix * fix layout error * update w.r.t. comments * fix * fix * fix error	2023-01-12 09:31:12 +08:00
Henry Ma	2858a1b5bf	[Nano] Add how-to-guide of load/save API for tensorflow inference (#7180 ) * feat(docs): add load/save onnx and opnevino model for tensorflow * fix bugs after previewing * fix order issues of insertion for toc.yml * change link title for tensorflow	2023-01-10 20:15:49 +08:00
Sirui Tao	d950992b91	[Nano] How-To Guides: Accelerate PyTorch Training with IPEX & Multi-instance & BF16 & Channels last (#7035 ) * add pytorch training ipex guide * add pytorch training multi-instance guide * add bf16 guide * small changes of presentation * add channels last guide * remove validation loader * hide code block * update based on comments * add guide for reference * update guides w.r.t. comments	2023-01-10 13:14:47 +08:00
Hu, Zhaojie	6ca8f8567f	Chronos: Add how to guide for export_jit (#7194 ) * add how to guide for export_jit * update * update * update code example	2023-01-09 14:05:49 +08:00
Yuwen Hu	5e9ef7b553	[Nano] Add a generalized how-to guide for accelerate PyTorch cv data process pipeline (#7125 ) * Restyle blockquote elements in web * Add a generalized how-to section for preprocessing, including the data process accelerastion for PyTorch * Small fix * Update based on comments and small typo fixes * Small fixes	2023-01-05 18:07:10 +08:00
Yuwen Hu	bfca337d09	[Nano] Add how-to guide for TensorFlow Keras training using bf16 (#7169 ) * Add basic doc structure for bf16 tf training how-to guide, and change the incorrect order of tf inference guides in toc * Add how-to guide for tf bf16 training * Add warning box for tf bf16 hardware limitations * Add a print message to show the default policy of model after unpatch * Small fixes * Small github action fixes for tf bf16 training how-to guide * diable action test for tf bf16 train for now, due to the core dumped problem on platforms without AVX512 * Updated based on comments	2023-01-05 17:59:43 +08:00
Henry Ma	18ae409b7c	[Nano] add how-to-guide for tensorflow inference by onnxruntime and openvino (#7149 ) * Feat(docs): add how-to-guide for tensorflow inference by onnxruntime and openvino * fix bugs for index.rst * revise according to PR comments * revise minor parts according to PR comments * revise bugs according to PR comments	2023-01-04 23:43:48 +08:00
Yishuo Wang	ab9a3b69fb	Nano: Add CUDA patch key feature (#7089 )	2023-01-04 10:53:27 +08:00
Junwei Deng	56e94eda59	Chronos: add forecaster alg choose guide and some cleaning for how to guide (#7139 ) * fix format * fix * update * update	2023-01-03 15:00:27 +08:00
Kai Huang	264451c6bd	Merge Orca quick starts and how to (#7133 ) * add tf2 to howto * update tf2 * remove * modify sidebar * remove quickstart * minor	2022-12-30 16:05:15 +08:00
Yuwen Hu	37b0c6fa50	[Nano] How-To Guides: Convert to TorchNano & use @nano decorator (#5984 ) * Add how to guide: How to convert your PyTorch code to use TorchNano for training acceleration * Small nano how-to index format update for openvino inference * Update based on comments * Updated based on comments * Add how-to guide: How to wrap a PyTorch training loop through @nano decorator * Add reference to TorchNano guide in @nano guide * Some small fixes and updates * Small typo fix: bulit -> built * Updates based on comments * Remove validation dataloader based on comments * Order change of two guides * Update based on comments	2022-12-26 15:36:53 +08:00
Ruonan Wang	14cf4aff72	Nano: Troubshooting guide for BigDL-Nano (#6910 ) * add troubshooting guide * add more cases * update style * update * update style * update two cases * update * add two issue	2022-12-26 10:02:44 +08:00

1 2

84 commits