ipex-llm

Author	SHA1	Message	Date
Shaojun Liu	bb9be70105	replace bigdl-llm with ipex-llm (#10545 )	2024-03-26 15:12:38 +08:00
Shaojun Liu	c563b41491	add nightly_build workflow (#10533 ) * add nightly_build workflow * add create-job-status-badge action * update * update * update * update setup.py * release * revert	2024-03-26 12:47:38 +08:00
binbin Deng	0a3e4e788f	LLM: fix mistral hidden_size setting for deepspeed autotp (#10527 )	2024-03-26 10:55:44 +08:00
Xin Qiu	1dd40b429c	enable fp4 fused mlp and qkv (#10531 ) * enable fp4 fused mlp and qkv * update qwen * update qwen2	2024-03-26 08:34:00 +08:00
Yuwen Hu	9367db7f2b	Small typo fix (#10535 )	2024-03-25 18:48:44 +08:00
Yuwen Hu	c182acef3f	[Doc] Update IPEX-LLM Index Page (#10534 ) * Update readthedocs readme before Latest Update * Update before quick start section in index page * Update quickstart section * Further updates for Code Example * Small fix * Small fix * Fix migration guide style	2024-03-25 18:43:32 +08:00
Shaojun Liu	93e6804bfe	update nightly test (#10520 ) * trigger nightly test * trigger perf test * update bigdl-llm to ipex-llm * revert	2024-03-25 18:22:05 +08:00
Yuwen Hu	e0ea7b8244	[Doc] IPEX-LLM Doc Layout Update (#10532 ) * Fix navigation bar to 1 * Remove unnecessary python api * Fixed failed langchain native api doc * Change index page layout * Update quicklink for IPEX-LLM * Simplify toc and add bigdl-llm migration guide * Update readthedocs readme * Add missing index link for bigdl-llm migration guide * Update logo image and repo link * Update copyright * Small fix * Update copyright * Update top nav bar * Small fix	2024-03-25 16:23:56 +08:00
Shengsheng Huang	de5bbf83de	update linux quickstart and formats of migration (#10530 ) * update linux quickstart and formats of migration * update quickstart * update format	2024-03-25 15:38:02 +08:00
Shaojun Liu	c8a1c76304	Update README.md (#10524 )	2024-03-25 13:47:45 +08:00
Jason Dai	5b76f88a8f	Update README.md (#10518 )	2024-03-25 13:37:01 +08:00
Shengsheng Huang	d7d0e66b18	move migration guide to quickstart (#10521 )	2024-03-25 11:50:49 +08:00
Dongjie Shi	c4dbd21cfc	update readthedocs project name (#10519 ) * update readthedocs project name * update readthedocs project name	2024-03-25 11:44:35 +08:00
Wang, Jian4	16b2ef49c6	Update_document by heyang (#30 )	2024-03-25 10:06:02 +08:00
Wang, Jian4	e2d25de17d	Update_docker by heyang (#29 )	2024-03-25 10:05:46 +08:00
Wang, Jian4	5dc121ee5e	Add guide for running bigdl-example using ipex-llm libs (#28 ) * add guide * update	2024-03-22 17:17:21 +08:00
Wang, Jian4	a1048ca7f6	Update setup.py and add new actions and add compatible mode (#25 ) * update setup.py * add new action * add compatible mode	2024-03-22 15:44:59 +08:00
Wang, Jian4	9df70d95eb	Refactor bigdl.llm to ipex_llm (#24 ) * Rename bigdl/llm to ipex_llm * rm python/llm/src/bigdl * from bigdl.llm to from ipex_llm	2024-03-22 15:41:21 +08:00
Jin Qiao	cc5806f4bc	LLM: add save/load example for hf-transformers (#10432 )	2024-03-22 13:57:47 +08:00
Ruonan Wang	a7da61925f	LLM: add windows related info in llama-cpp quickstart (#10505 ) * first commit * update * add image, update Prerequisites * small fix	2024-03-22 13:51:14 +08:00
Wang, Jian4	34d0a9328c	LLM: Speed-up mixtral in pipeline parallel inference (#10472 ) * speed-up mixtral * fix style	2024-03-22 11:06:28 +08:00
Cengguang Zhang	b9d4280892	LLM: fix baichuan7b quantize kv abnormal output. (#10504 ) * fix abnormal output. * fix style. * fix style.	2024-03-22 10:00:08 +08:00
Yishuo Wang	f0f317b6cf	fix a typo in yuan (#10503 )	2024-03-22 09:40:04 +08:00
Cheen Hau, 俊豪	a7d38bee94	WebUI quickstart: add instruct chat mode and tested models (#10436 ) * Add instruct chat mode and tested models * Fix table * Remove falcon from 'tested models' * Fixes * Open image in new window	2024-03-21 20:15:32 +08:00
Kai Huang	92ee2077b3	Update Linux Quickstart (#10499 ) * fix quick start * update toc * expose docker	2024-03-21 20:13:21 +08:00
Guancheng Fu	3a3756b51d	Add FastChat bigdl_worker (#10493 ) * done * fix format * add licence * done * fix doc * refactor folder * add license	2024-03-21 18:35:05 +08:00
Ruonan Wang	8d0ea1b9b3	LLM: add initial QuickStart for linux cpp usage (#10418 ) * add first version * update content and add link * --amend * update based on new usage * update usage based on new pr * temp save * basic stable version * change to backend	2024-03-21 17:35:58 +08:00
Xin Qiu	dba7ddaab3	add sdp fp8 for qwen llama436 baichuan mistral baichuan2 (#10485 ) * add sdp fp8 * fix style * fix qwen * fix baichuan 13 * revert baichuan 13b and baichuan2-13b * fix style * update	2024-03-21 17:23:05 +08:00
Kai Huang	30f111cd32	lm_head empty_cache for more models (#10490 ) * modify constraint * fix style	2024-03-21 17:11:43 +08:00
Yuwen Hu	1579ee4421	[LLM] Add nightly igpu perf test for INT4+FP16 1024-128 (#10496 )	2024-03-21 16:07:06 +08:00
Yuxuan Xia	3d59c74a0b	Linux quick start (#10391 ) * Fix Baichuan2 prompt format * Add linux quick start guide * Modify the linux installation quick start * Adjust Linux quick start * Adjust Linux quick start * Add linux quick start screenshots * Revert Baichuan2 changes * Fix linux quick start typo * Fix linux quick start typos * Remove linux quick start downgrade kernel * Change linux quick start bigdl install * Modify linux quick start	2024-03-21 16:02:29 +08:00
binbin Deng	2958ca49c0	LLM: add patching function for llm finetuning (#10247 )	2024-03-21 16:01:01 +08:00
hxsz1997	158a49986a	Add quickstart for install bigdl-llm in docker on windows with Intel GPU (#10421 ) * add quickstart for install bigdl in docker on window with Intel GPU * modify the inference command * add note of required disk space * add the issue of iGPU	2024-03-21 15:57:27 +08:00
Zhicun	5b97fdb87b	update deepseek example readme (#10420 ) * update readme * update * update readme	2024-03-21 15:21:48 +08:00
hxsz1997	a5f35757a4	Migrate langchain rag cpu example to gpu (#10450 ) * add langchain rag on gpu * add rag example in readme * add trust_remote_code in TransformersEmbeddings.from_model_id * add trust_remote_code in TransformersEmbeddings.from_model_id in cpu	2024-03-21 15:20:46 +08:00
Heyang Sun	c672e97239	Fix CPU finetuning docker (#10494 ) * Fix CPU finetuning docker * Update README.md	2024-03-21 11:53:30 +08:00
binbin Deng	85ef3f1d99	LLM: add empty cache in deepspeed autotp benchmark script (#10488 )	2024-03-21 10:51:23 +08:00
Xiangyu Tian	5a5fd5af5b	LLM: Add speculative benchmark on CPU/XPU (#10464 ) Add speculative benchmark on CPU/XPU.	2024-03-21 09:51:06 +08:00
Ruonan Wang	28c315a5b9	LLM: fix deepspeed error of finetuning on xpu (#10484 )	2024-03-21 09:46:25 +08:00
Kai Huang	021d77fd22	Remove softmax upcast fp32 in llama (#10481 ) * update * fix style	2024-03-20 18:17:34 +08:00
Yishuo Wang	cfdf8ad496	Fix `modules_not_to_convert` argument (#10483 )	2024-03-20 17:47:03 +08:00
Xiangyu Tian	cbe24cc7e6	LLM: Enable BigDL IPEX Int8 (#10480 ) Enable BigDL IPEX Int8	2024-03-20 15:59:54 +08:00
ZehuaCao	1d062e24db	Update serving doc (#10475 ) * update serving doc * add tob * update * update * update * update vllm worker	2024-03-20 14:44:43 +08:00
Cengguang Zhang	4581e4f17f	LLM: fix whiper model missing config. (#10473 ) * fix whiper model missing config. * fix style. * fix style. * style.	2024-03-20 14:22:37 +08:00
Jin Qiao	e41d556436	LLM: change fp16 benchmark to model.half (#10477 ) * LLM: change fp16 benchmark to model.half * fix	2024-03-20 13:38:39 +08:00
Yishuo Wang	749bedaf1e	fix rwkv v5 fp16 (#10474 )	2024-03-20 13:15:08 +08:00
Yuwen Hu	72bcc27da9	[LLM] Add `TransformersBgeEmbeddings` class in `bigdl.llm.langchain.embeddings` (#10459 ) * Add TransformersBgeEmbeddings class in bigdl.llm.langchain.embeddings * Small fixes	2024-03-19 18:04:35 +08:00
Cengguang Zhang	463a86cd5d	LLM: fix qwen-vl interpolation gpu abnormal results. (#10457 ) * fix qwen-vl interpolation gpu abnormal results. * fix style. * update qwen-vl gpu example. * fix comment and update example. * fix style.	2024-03-19 16:59:39 +08:00
Jin Qiao	e9055c32f9	LLM: fix fp16 mem record in benchmark (#10461 ) * LLM: fix fp16 mem record in benchmark * change style	2024-03-19 16:17:23 +08:00
Shaojun Liu	0e388f4b91	Fix Trivy Docker Image Vulnerabilities for BigDL Release 2.5.0 (#10447 ) * Update pypi version to fix trivy issues * refine	2024-03-19 14:52:15 +08:00

... 3 4 5 6 7 ...

2643 commits