ipex-llm

Author	SHA1	Message	Date
Lilac09	326ef7f491	add README for llm-inference-cpu (#9147 ) * add README for llm-inference-cpu * modify README * add README for llm-inference-cpu on Windows	2023-10-16 10:27:44 +08:00
Lilac09	e02fbb40cc	add bigdl-llm-tutorial into llm-inference-cpu image (#9139 ) * add bigdl-llm-tutorial into llm-inference-cpu image * modify Dockerfile * modify Dockerfile	2023-10-11 16:41:04 +08:00
Ziteng Zhang	4a0a3c376a	Add stand-alone mode on cpu for finetuning (#9127 ) * Added steps for finetune on CPU in stand-alone mode * Add stand-alone mode to bigdl-lora-finetuing-entrypoint.sh * delete redundant docker commands * Update README.md Turn to intelanalytics/bigdl-llm-finetune-cpu:2.4.0-SNAPSHOT and append example outputs to allow users to check the running * Update bigdl-lora-finetuing-entrypoint.sh Add some tunable parameters * Add parameters --cpus and -e WORKER_COUNT_DOCKER * Modified the cpu number range parameters * Set -ppn to CCL_WORKER_COUNT * Add related configuration suggestions in README.md	2023-10-11 15:01:21 +08:00
Lilac09	30e3c196f3	Merge pull request #9108 from Zhengjin-Wang/main Add instruction for chat.py in bigdl-llm-cpu	2023-10-10 16:40:52 +08:00
Lilac09	1e78b0ac40	Optimize LoRA Docker by Shrinking Image Size (#9110 ) * modify dockerfile * modify dockerfile	2023-10-10 15:53:17 +08:00
Heyang Sun	2c0c9fecd0	refine LLM containers (#9109 )	2023-10-09 15:45:30 +08:00
Wang	a1aefdb8f4	modify README	2023-10-09 13:36:29 +08:00
Wang	3814abf95a	add instruction for chat.py	2023-10-09 12:57:28 +08:00
Guancheng Fu	df8df751c4	Modify readme for bigdl-llm-serving-cpu (#9105 )	2023-10-09 09:56:09 +08:00
Heyang Sun	2756f9c20d	XPU QLoRA Container (#9082 ) * XPU QLoRA Container * fix apt issue * refine	2023-10-08 11:04:20 +08:00
Heyang Sun	0b40ef8261	separate trusted and native llm cpu finetune from lora (#9050 ) * seperate trusted-llm and bigdl from lora finetuning * add k8s for trusted llm finetune * refine * refine * rename cpu to tdx in trusted llm * solve conflict * fix typo * resolving conflict * Delete docker/llm/finetune/lora/README.md * fix --------- Co-authored-by: Uxito-Ada <seusunheyang@foxmail.com> Co-authored-by: leonardozcm <leonardo1997zcm@gmail.com>	2023-10-07 15:26:59 +08:00
ZehuaCao	b773d67dd4	Add Kubernetes support for BigDL-LLM-serving CPU. (#9071 )	2023-10-07 09:37:48 +08:00
Lilac09	ecee02b34d	Add bigdl llm xpu image build (#9062 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build	2023-09-26 14:29:03 +08:00
Lilac09	9ac950fa52	Add bigdl llm cpu image build (#9047 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build	2023-09-26 13:22:11 +08:00
Ziteng Zhang	a717352c59	Replace Llama 7b to Llama2-7b in README.md (#9055 ) * Replace Llama 7b with Llama2-7b in README.md Need to replace the base model to Llama2-7b as we are operating on Llama2 here. * Replace Llama 7b to Llama2-7b in README.md a llama 7b in the 1st line is missed * Update architecture graph --------- Co-authored-by: Heyang Sun <60865256+Uxito-Ada@users.noreply.github.com>	2023-09-26 09:56:46 +08:00
Guancheng Fu	cc84ed70b3	Create serving images (#9048 ) * Finished & Tested * Install latest pip from base images * Add blank line * Delete unused comment * fix typos	2023-09-25 15:51:45 +08:00
Heyang Sun	4b843d1dbf	change lora-model output behavior on k8s (#9038 ) Co-authored-by: leonardozcm <leonardo1997zcm@gmail.com>	2023-09-25 09:28:44 +08:00
Lilac09	9126abdf9b	add README.md for bigdl-llm-cpu image (#9026 ) * modify Dockerfile * add README.md * add README.md	2023-09-22 09:03:57 +08:00
Guancheng Fu	3913ba4577	add README.md (#9004 )	2023-09-21 10:32:56 +08:00
Guancheng Fu	b6c9198d47	Add xpu image for bigdl-llm (#9003 ) * Add xpu image * fix * fix * fix format	2023-09-19 16:56:22 +08:00
Guancheng Fu	7353882732	add Dockerfile (#8993 )	2023-09-18 13:25:37 +08:00
Xiangyu Tian	52878d3e5f	[PPML] Enable TLS in Attestation API Serving for LLM finetuning (#8945 ) Add enableTLS flag to enable TLS in Attestation API Serving for LLM finetuning.	2023-09-18 09:32:25 +08:00
Heyang Sun	aeef73a182	Tell User How to Find Fine-tuned Model in README (#8985 ) * Tell User How to Find Fine-tuned Model in README * Update README.md	2023-09-15 13:45:40 +08:00
Xiangyu Tian	4dce238867	Fix incorrect usage in docs of Finetuning to enable TDX (#8932 )	2023-09-08 16:03:14 +08:00
Xiangyu Tian	ea6d4148e9	[PPML] Add attestation for LLM Finetuning (#8908 ) Add TDX attestation for LLM Finetuning in TDX CoCo --------- Co-authored-by: Heyang Sun <60865256+Uxito-Ada@users.noreply.github.com>	2023-09-08 10:24:04 +08:00
Heyang Sun	2d97827ec5	fix typo in lora entrypoint (#8862 )	2023-09-06 13:52:25 +08:00
Heyang Sun	b1ac8dc1bc	BF16 Lora Finetuning on K8S with OneCCL and Intel MPI (#8775 ) * BF16 Lora Finetuning on K8S with OneCCL and Intel MPI * Update README.md * format * refine * Update README.md * refine * Update README.md * increase nfs volume size to improve IO performance * fix bugs * Update README.md * Update README.md * fix permission * move output destination * Update README.md * fix wrong base model name in doc * fix output path in entrypoint * add a permission-precreated output dir * format * move output logs to a persistent storage	2023-08-31 14:56:23 +08:00

1 2 3

127 commits