ipex-llm

Author	SHA1	Message	Date
Yuwen Hu	21631209a9	[LLM] Skip CPU performance test for now (#9291 ) * Skip llm cpu performance test for now * Add install for wheel package	2023-10-27 12:55:04 +08:00
Ziteng Zhang	46ab0419b8	Merge pull request #9279 from Jasonzzt/main Add bigdl-llm-finetune-cpu to manually_build to upload image on hub	2023-10-27 09:55:08 +08:00
Yuwen Hu	733df28a2b	[LLM] Migrate Arc UT to another runner (#9286 ) * Separate arc llm ut to another runner * Add dependency for einops	2023-10-26 19:08:57 +08:00
Ziteng Zhang	916ccc0779	Update manually_build_for_testing.yml	2023-10-26 16:26:14 +08:00
Ziteng Zhang	14a23015f8	Update manually_build.yml	2023-10-26 16:24:03 +08:00
Jasonzzt	37b1708d16	Add bigdl-llm-finetune-cpu to manually_build	2023-10-26 15:53:44 +08:00
Lilac09	4ed7f066d3	add bigdl-llm-finetune-xpu to manually_build (#9278 )	2023-10-26 15:30:05 +08:00
Cheen Hau, 俊豪	ab40607b87	Enable unit test workflow on Arc (#9213 ) * Add gpu workflow and a transformers API inference test * Set device-specific env variables in script instead of workflow * Fix status message --------- Co-authored-by: sgwhat <ge.song@intel.com>	2023-10-25 15:17:18 +08:00
SONG Ge	160a1e5ee7	[WIP] Add UT for Mistral Optimized Model (#9248 ) * add ut for mistral model * update * fix model path * upgrade transformers version for mistral model * refactor correctness ut for mustral model * refactor mistral correctness ut * revert test_optimize_model back * remove mistral from test_optimize_model * add to revert transformers version back to 4.31.0	2023-10-25 15:14:17 +08:00
WeiguangHan	ec9195da42	LLM: using html to visualize the perf result for Arc (#9228 ) * LLM: using html to visualize the perf result for Arc * deploy the html file * add python license * reslove some comments	2023-10-24 18:05:25 +08:00
Guancheng Fu	f37547249d	Refine README/CICD (#9253 )	2023-10-24 12:56:03 +08:00
Guancheng Fu	9faa2f1eef	Fix bigdl-llm-serving-tdx image (#9251 )	2023-10-24 10:49:35 +08:00
Guancheng Fu	6cb884d82d	Fix missing manually_build_for_testing entry (#9245 )	2023-10-23 16:35:09 +08:00
Guancheng Fu	2ead3f7d54	add manually build (#9244 )	2023-10-23 15:53:30 +08:00
WeiguangHan	f87f67ee1c	LLM: arc perf test for some popular models (#9188 )	2023-10-19 15:56:15 +08:00
ZehuaCao	65dd73b62e	Update manually_build.yml (#9138 ) * Update manually_build.yml fix llm-serving-tdx image build dir * Update manually_build.yml	2023-10-11 15:07:09 +08:00
Yuwen Hu	dc70fc7b00	Update performance tests for dependency of bigdl-core-xe-esimd (#9124 )	2023-10-10 19:32:17 +08:00
Yuwen Hu	0e09dd926b	[LLM] Fix example test (#9118 ) * Update llm example test link due to example layout change * Add better change detect	2023-10-10 13:24:18 +08:00
Zhengjin Wang	0dbb3a283e	amend manually_build	2023-10-10 10:03:23 +08:00
Zhengjin Wang	bb3bb46400	add llm-serving-xpu on github action	2023-10-10 09:48:58 +08:00
Yuwen Hu	65212451cc	[LLM] Small update to performance tests (#9106 ) * small updates to llm performance tests regarding model handling * Small fix	2023-10-09 16:55:25 +08:00
ZehuaCao	aad68100ae	Add trusted-bigdl-llm-serving-tdx image. (#9093 ) * add entrypoint in cpu serving * kubernetes support for fastchat cpu serving * Update Readme * add image to manually_build action * update manually_build.yml * update README.md * update manually_build.yaml * update attestation_cli.py * update manually_build.yml * update Dockerfile * rename * update trusted-bigdl-llm-serving-tdx Dockerfile	2023-10-08 10:13:51 +08:00
ZehuaCao	b773d67dd4	Add Kubernetes support for BigDL-LLM-serving CPU. (#9071 )	2023-10-07 09:37:48 +08:00
Lilac09	c91b2bd574	fix:modify indentation (#9070 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build * manually build * recover file * manually build * recover file * modify indentation	2023-09-27 14:53:52 +08:00
Lilac09	ecee02b34d	Add bigdl llm xpu image build (#9062 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build	2023-09-26 14:29:03 +08:00
Lilac09	9ac950fa52	Add bigdl llm cpu image build (#9047 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build	2023-09-26 13:22:11 +08:00
Yuwen Hu	c389e1323d	fix xpu performance tests by making sure that latest bigdl-core-xe is installed (#9001 )	2023-09-19 17:33:30 +08:00
Wang Jian	7563b26ca9	Occlum fastchat build Use nocache and update order (#8972 )	2023-09-14 14:05:15 +08:00
Yuwen Hu	ca35c93825	[LLM] Fix langchain UT (#8929 ) * Change dependency version for langchain uts * Downgrade pandas version instead; and update example readme accordingly	2023-09-08 13:51:04 +08:00
xingyuan li	704a896e90	[LLM] Add perf test on xpu for bigdl-llm (#8866 ) * add xpu latency job * update install way * remove duplicated workflow * add perf upload	2023-09-05 17:36:24 +09:00
xingyuan li	de6c6bb17f	[LLM] Downgrade amx build gcc version and remove avx flag display (#8856 ) * downgrade to gcc 11 * remove avx display	2023-08-31 14:08:13 +09:00
Shengsheng Huang	7b566bf686	[LLM] add new API for optimize any pytorch models (#8827 ) * add new API for optimize any pytorch models * change test util name * revise API and update UT * fix python style * update ut config, change default value * change defaults, disable ut transcribe	2023-08-30 19:41:53 +08:00
Wang Jian	954ef954b6	[PPML] Add occlum llm image munually build (#8849 )	2023-08-30 11:31:47 +08:00
xingyuan li	67052198eb	[LLM] Build with multiprocess (#8797 ) * build with multiprocess	2023-08-29 10:49:52 +09:00
xingyuan li	6a902b892e	[LLM] Add amx build step (#8822 ) * add amx build step	2023-08-28 17:41:18 +09:00
Song Jiaming	b8b1b6888b	[LLM] Performance test (#8796 )	2023-08-25 14:31:45 +08:00
SONG Ge	d2926c7672	[LLM] Unify Langchain Native and Transformers LLM API (#8752 ) * deprecate BigDLNativeTransformers and add specific LMEmbedding method * deprecate and add LM methods for langchain llms * add native params to native langchain * new imple for embedding * move ut from bigdlnative to casual llm * rename embeddings api and examples update align with usage updating * docqa example hot-fix * add more api docs * add langchain ut for starcoder * support model_kwargs for transformer methods when calling causalLM and add ut * ut fix for transformers embedding * update for langchain causal supporting transformers * remove model_family in readme doc * add model_families params to support more models * update api docs and remove chatglm embeddings for now * remove chatglm embeddings in examples * new refactor for ut to add bloom and transformers llama ut * disable llama transformers embedding ut	2023-08-25 11:14:21 +08:00
xingyuan li	9537194b4b	[LLM] Fix llm test workflow repeatedly download model files	2023-08-25 11:20:46 +09:00
Jin Hanyu	a73a3e5ff9	Fix bugs in manually_build_for_testing.yml. (#8792 )	2023-08-23 15:49:23 +08:00
xingyuan li	c94bdd3791	[LLM] Merge windows & linux nightly test (#8756 ) * fix download statement * add check before build wheel * use curl to upload files * windows unittest won't upload converted model * split llm-cli test into windows & linux versions * update tempdir create way * fix nightly converted model name * windows llm-cli starcoder test temply disabled * remove taskset dependency * rename llm_unit_tests_linux to llm_unit_tests	2023-08-23 12:48:41 +09:00
Shaojun Liu	394304b918	Re organize llm test (#8766 ) * run llm-example-test in llm-nightly-test.yml * comment out the schedule event	2023-08-17 09:42:25 +08:00
Shaojie Cui	0a8db3abe0	[PPML]refactor python toolkit (#8740 ) * add dependency and example * fix stage 3 * downgrade protobuf * reduce epc memory * add script * Readme reduction * delete unused note	2023-08-15 10:11:53 +08:00
xingyuan li	1cb8f5abbd	[LLM] Revert compile OS for llm build workflow (#8732 ) * use almalinux to build	2023-08-11 17:47:45 +09:00
xingyuan li	33d9ad234f	[LLM] Linux vnni build with ubuntu 18.04 (#8710 ) * move from almalinux	2023-08-10 19:04:03 +09:00
Song Jiaming	e717e304a6	LLM first example test and template (#8658 )	2023-08-10 10:03:11 +08:00
Yishuo Wang	710b9b8982	[LLM] add linux chatglm pybinding binary file (#8698 )	2023-08-08 11:16:30 +08:00
xingyuan li	4482ccb329	[LLM] Change build system from centos7 to ubuntu18.04 (#8686 ) * centos7 to ubuntu18 * ubuntu git version 2.17 need to update * use almalinux8 to build avx2 binaries	2023-08-07 19:09:58 +09:00
Yishuo Wang	5837cc424a	[LLM] add chatglm pybinding binary file release (#8677 )	2023-08-04 11:45:27 +08:00
xingyuan li	bc4cdb07c9	Remove conda for llm workflow (#8671 )	2023-08-04 12:09:42 +09:00
xingyuan li	110cfb5546	[LLM] Remove old windows nightly test code (#8668 ) Remove old Windows nightly test code triggered by task scheduler Add new Windows nightly workflow for nightly testing	2023-08-03 17:12:23 +09:00

1 2 3 4

200 commits