ipex-llm

Author	SHA1	Message	Date
Wang	a42c25436e	Merge remote-tracking branch 'upstream/main'	2023-10-09 10:55:18 +08:00
JIN Qiao	65373d2a8b	LLM: adjust portable zip content (#9054 ) * LLM: adjust portable zip content * LLM: adjust portable zip README	2023-10-09 10:51:19 +08:00
Guancheng Fu	df8df751c4	Modify readme for bigdl-llm-serving-cpu (#9105 )	2023-10-09 09:56:09 +08:00
Heyang Sun	2756f9c20d	XPU QLoRA Container (#9082 ) * XPU QLoRA Container * fix apt issue * refine	2023-10-08 11:04:20 +08:00
ZehuaCao	aad68100ae	Add trusted-bigdl-llm-serving-tdx image. (#9093 ) * add entrypoint in cpu serving * kubernetes support for fastchat cpu serving * Update Readme * add image to manually_build action * update manually_build.yml * update README.md * update manually_build.yaml * update attestation_cli.py * update manually_build.yml * update Dockerfile * rename * update trusted-bigdl-llm-serving-tdx Dockerfile	2023-10-08 10:13:51 +08:00
Xin Qiu	b3e94a32d4	change log4error import (#9098 )	2023-10-08 09:23:28 +08:00
Kai Huang	78ea7ddb1c	Combine apply_rotary_pos_emb for gpt-neox (#9074 )	2023-10-07 16:27:46 +08:00
Heyang Sun	0b40ef8261	separate trusted and native llm cpu finetune from lora (#9050 ) * seperate trusted-llm and bigdl from lora finetuning * add k8s for trusted llm finetune * refine * refine * rename cpu to tdx in trusted llm * solve conflict * fix typo * resolving conflict * Delete docker/llm/finetune/lora/README.md * fix --------- Co-authored-by: Uxito-Ada <seusunheyang@foxmail.com> Co-authored-by: leonardozcm <leonardo1997zcm@gmail.com>	2023-10-07 15:26:59 +08:00
Wang	4aee952b10	Merge remote-tracking branch 'upstream/main'	2023-10-07 09:53:52 +08:00
ZehuaCao	b773d67dd4	Add Kubernetes support for BigDL-LLM-serving CPU. (#9071 )	2023-10-07 09:37:48 +08:00
Yang Wang	36dd4afd61	Fix llama when rope scaling is not None (#9086 ) * Fix llama when rope scaling is not None * fix style * fix style	2023-10-06 13:27:37 -07:00
Yang Wang	fcb1c618a0	using bigdl-llm fused rope for llama (#9066 ) * optimize llama xpu rope * fix bug * fix style * refine append cache * remove check * do not cache cos sin * remove unnecessary changes * clean up * fix style * check for training	2023-10-06 09:57:29 -07:00
Jason Dai	50044640c0	Update README.md (#9085 )	2023-10-06 21:54:18 +08:00
Jiao Wang	aefa5a5bfe	Qwen kv cache (#9079 ) * qwen and aquila * update * update * style	2023-10-05 11:59:17 -07:00
Jiao Wang	d5ca1f32b6	Aquila KV cache optimization (#9080 ) * update * update * style	2023-10-05 11:10:57 -07:00
Jason Dai	7506100bd5	Update readme (#9084 )	2023-10-05 16:54:09 +08:00
Yang Wang	88565c76f6	add export merged model example (#9018 ) * add export merged model example * add sources * add script * fix style	2023-10-04 21:18:52 -07:00
Yang Wang	0cd8f1c79c	Use ipex fused rms norm for llama (#9081 ) * also apply rmsnorm * fix cpu	2023-10-04 21:04:55 -07:00
Cengguang Zhang	fb883100e7	LLM: support chatglm-18b convert attention forward in benchmark scripts. (#9072 ) * add chatglm-18b convert. * fix if statement. * fix	2023-09-28 14:04:52 +08:00
Yishuo Wang	6de2189e90	[LLM] fix chatglm main choice (#9073 )	2023-09-28 11:23:37 +08:00
binbin Deng	760183bac6	LLM: update key feature and installation page of document (#9068 )	2023-09-27 15:44:34 +08:00
Lilac09	c91b2bd574	fix:modify indentation (#9070 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build * manually build * recover file * manually build * recover file * modify indentation	2023-09-27 14:53:52 +08:00
Wang	ddcd9e7d0a	modify indentation	2023-09-27 14:49:58 +08:00
Wang	9935772f24	recover file	2023-09-26 15:50:51 +08:00
Wang	efc2158215	manually build	2023-09-26 15:47:47 +08:00
Wang	fdc0e838df	Merge remote-tracking branch 'upstream/main'	2023-09-26 15:45:31 +08:00
Wang	b17e536a1b	recover file	2023-09-26 15:45:03 +08:00
Cengguang Zhang	ad62c58b33	LLM: Enable jemalloc in benchmark scripts. (#9058 ) * enable jemalloc. * fix readme.	2023-09-26 15:37:49 +08:00
Wang	9e03c5c7fc	manually build	2023-09-26 15:28:01 +08:00
Wang	2dc76dc358	manually build	2023-09-26 15:15:15 +08:00
Lilac09	ecee02b34d	Add bigdl llm xpu image build (#9062 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build	2023-09-26 14:29:03 +08:00
Wang	d0ac0941a2	Add bigdl llm xpu image build	2023-09-26 14:25:10 +08:00
Wang	781bc5bc8d	Add bigdl inference cpu image build	2023-09-26 14:07:36 +08:00
Wang	390c90551e	Add bigdl inference cpu image build	2023-09-26 14:03:55 +08:00
Wang	7a69bee8d0	Modify Dockerfile	2023-09-26 13:58:42 +08:00
Wang	47996c29e4	Merge remote-tracking branch 'upstream/main'	2023-09-26 13:56:27 +08:00
Lilac09	9ac950fa52	Add bigdl llm cpu image build (#9047 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build	2023-09-26 13:22:11 +08:00
Wang	a50c11d326	Modify Dockerfile	2023-09-26 11:19:13 +08:00
Ziteng Zhang	a717352c59	Replace Llama 7b to Llama2-7b in README.md (#9055 ) * Replace Llama 7b with Llama2-7b in README.md Need to replace the base model to Llama2-7b as we are operating on Llama2 here. * Replace Llama 7b to Llama2-7b in README.md a llama 7b in the 1st line is missed * Update architecture graph --------- Co-authored-by: Heyang Sun <60865256+Uxito-Ada@users.noreply.github.com>	2023-09-26 09:56:46 +08:00
Guancheng Fu	cc84ed70b3	Create serving images (#9048 ) * Finished & Tested * Install latest pip from base images * Add blank line * Delete unused comment * fix typos	2023-09-25 15:51:45 +08:00
Wang	847af63e8e	Add bigdl llm cpu image build	2023-09-25 15:33:39 +08:00
Wang	7f2d2a5238	Add bigdl llm cpu image build	2023-09-25 15:14:23 +08:00
Wang	9cae4600da	Add bigdl llm cpu image build	2023-09-25 14:45:30 +08:00
Wang	ceed895c31	Add bigdl inference cpu image build	2023-09-25 14:31:43 +08:00
Cengguang Zhang	b4a1266ef0	[WIP] LLM: add kv cache support for internlm. (#9036 ) * LLM: add kv cache support for internlm * add internlm apply_rotary_pos_emb * fix. * fix style.	2023-09-25 14:16:59 +08:00
Wang	fc8bf6b0d5	Modify Dockerfile	2023-09-25 14:05:08 +08:00
Wang	e8f436453d	Merge remote-tracking branch 'upstream/main'	2023-09-25 13:59:19 +08:00
Ruonan Wang	975da86e00	LLM: fix gptneox kv cache (#9044 )	2023-09-25 13:03:57 +08:00
Heyang Sun	4b843d1dbf	change lora-model output behavior on k8s (#9038 ) Co-authored-by: leonardozcm <leonardo1997zcm@gmail.com>	2023-09-25 09:28:44 +08:00
Cengguang Zhang	26213a5829	LLM: Change benchmark bf16 load format. (#9035 ) * LLM: Change benchmark bf16 load format. * comment on bf16 chatglm. * fix.	2023-09-22 17:38:38 +08:00

1 2 3 4 5 ...

1474 commits