ipex-llm

Author	SHA1	Message	Date
Zhengjin Wang	0dbb3a283e	amend manually_build	2023-10-10 10:03:23 +08:00
Zhengjin Wang	bb3bb46400	add llm-serving-xpu on github action	2023-10-10 09:48:58 +08:00
Yuwen Hu	65212451cc	[LLM] Small update to performance tests (#9106 ) * small updates to llm performance tests regarding model handling * Small fix	2023-10-09 16:55:25 +08:00
ZehuaCao	aad68100ae	Add trusted-bigdl-llm-serving-tdx image. (#9093 ) * add entrypoint in cpu serving * kubernetes support for fastchat cpu serving * Update Readme * add image to manually_build action * update manually_build.yml * update README.md * update manually_build.yaml * update attestation_cli.py * update manually_build.yml * update Dockerfile * rename * update trusted-bigdl-llm-serving-tdx Dockerfile	2023-10-08 10:13:51 +08:00
ZehuaCao	b773d67dd4	Add Kubernetes support for BigDL-LLM-serving CPU. (#9071 )	2023-10-07 09:37:48 +08:00
Lilac09	c91b2bd574	fix:modify indentation (#9070 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build * manually build * recover file * manually build * recover file * modify indentation	2023-09-27 14:53:52 +08:00
Lilac09	ecee02b34d	Add bigdl llm xpu image build (#9062 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl inference cpu image build * Add bigdl llm xpu image build	2023-09-26 14:29:03 +08:00
Lilac09	9ac950fa52	Add bigdl llm cpu image build (#9047 ) * modify Dockerfile * add README.md * add README.md * Modify Dockerfile * Add bigdl inference cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build * Add bigdl llm cpu image build	2023-09-26 13:22:11 +08:00
Yuwen Hu	c389e1323d	fix xpu performance tests by making sure that latest bigdl-core-xe is installed (#9001 )	2023-09-19 17:33:30 +08:00
Wang Jian	7563b26ca9	Occlum fastchat build Use nocache and update order (#8972 )	2023-09-14 14:05:15 +08:00
Yuwen Hu	ca35c93825	[LLM] Fix langchain UT (#8929 ) * Change dependency version for langchain uts * Downgrade pandas version instead; and update example readme accordingly	2023-09-08 13:51:04 +08:00
xingyuan li	704a896e90	[LLM] Add perf test on xpu for bigdl-llm (#8866 ) * add xpu latency job * update install way * remove duplicated workflow * add perf upload	2023-09-05 17:36:24 +09:00
xingyuan li	de6c6bb17f	[LLM] Downgrade amx build gcc version and remove avx flag display (#8856 ) * downgrade to gcc 11 * remove avx display	2023-08-31 14:08:13 +09:00
Shengsheng Huang	7b566bf686	[LLM] add new API for optimize any pytorch models (#8827 ) * add new API for optimize any pytorch models * change test util name * revise API and update UT * fix python style * update ut config, change default value * change defaults, disable ut transcribe	2023-08-30 19:41:53 +08:00
Wang Jian	954ef954b6	[PPML] Add occlum llm image munually build (#8849 )	2023-08-30 11:31:47 +08:00
xingyuan li	67052198eb	[LLM] Build with multiprocess (#8797 ) * build with multiprocess	2023-08-29 10:49:52 +09:00
xingyuan li	6a902b892e	[LLM] Add amx build step (#8822 ) * add amx build step	2023-08-28 17:41:18 +09:00
Song Jiaming	b8b1b6888b	[LLM] Performance test (#8796 )	2023-08-25 14:31:45 +08:00
SONG Ge	d2926c7672	[LLM] Unify Langchain Native and Transformers LLM API (#8752 ) * deprecate BigDLNativeTransformers and add specific LMEmbedding method * deprecate and add LM methods for langchain llms * add native params to native langchain * new imple for embedding * move ut from bigdlnative to casual llm * rename embeddings api and examples update align with usage updating * docqa example hot-fix * add more api docs * add langchain ut for starcoder * support model_kwargs for transformer methods when calling causalLM and add ut * ut fix for transformers embedding * update for langchain causal supporting transformers * remove model_family in readme doc * add model_families params to support more models * update api docs and remove chatglm embeddings for now * remove chatglm embeddings in examples * new refactor for ut to add bloom and transformers llama ut * disable llama transformers embedding ut	2023-08-25 11:14:21 +08:00
xingyuan li	9537194b4b	[LLM] Fix llm test workflow repeatedly download model files	2023-08-25 11:20:46 +09:00
Jin Hanyu	a73a3e5ff9	Fix bugs in manually_build_for_testing.yml. (#8792 )	2023-08-23 15:49:23 +08:00
xingyuan li	c94bdd3791	[LLM] Merge windows & linux nightly test (#8756 ) * fix download statement * add check before build wheel * use curl to upload files * windows unittest won't upload converted model * split llm-cli test into windows & linux versions * update tempdir create way * fix nightly converted model name * windows llm-cli starcoder test temply disabled * remove taskset dependency * rename llm_unit_tests_linux to llm_unit_tests	2023-08-23 12:48:41 +09:00
Shaojun Liu	394304b918	Re organize llm test (#8766 ) * run llm-example-test in llm-nightly-test.yml * comment out the schedule event	2023-08-17 09:42:25 +08:00
Shaojie Cui	0a8db3abe0	[PPML]refactor python toolkit (#8740 ) * add dependency and example * fix stage 3 * downgrade protobuf * reduce epc memory * add script * Readme reduction * delete unused note	2023-08-15 10:11:53 +08:00
xingyuan li	1cb8f5abbd	[LLM] Revert compile OS for llm build workflow (#8732 ) * use almalinux to build	2023-08-11 17:47:45 +09:00
xingyuan li	33d9ad234f	[LLM] Linux vnni build with ubuntu 18.04 (#8710 ) * move from almalinux	2023-08-10 19:04:03 +09:00
Song Jiaming	e717e304a6	LLM first example test and template (#8658 )	2023-08-10 10:03:11 +08:00
Yishuo Wang	710b9b8982	[LLM] add linux chatglm pybinding binary file (#8698 )	2023-08-08 11:16:30 +08:00
xingyuan li	4482ccb329	[LLM] Change build system from centos7 to ubuntu18.04 (#8686 ) * centos7 to ubuntu18 * ubuntu git version 2.17 need to update * use almalinux8 to build avx2 binaries	2023-08-07 19:09:58 +09:00
Yishuo Wang	5837cc424a	[LLM] add chatglm pybinding binary file release (#8677 )	2023-08-04 11:45:27 +08:00
xingyuan li	bc4cdb07c9	Remove conda for llm workflow (#8671 )	2023-08-04 12:09:42 +09:00
xingyuan li	110cfb5546	[LLM] Remove old windows nightly test code (#8668 ) Remove old Windows nightly test code triggered by task scheduler Add new Windows nightly workflow for nightly testing	2023-08-03 17:12:23 +09:00
Yina Chen	bd177ab612	[LLM] llm binary build linux add avx & avx2 (#8665 ) * llm add linux avx & avx2 release * fix name * update check	2023-08-03 14:38:31 +08:00
xingyuan li	610084e3c0	[LLM] Complete windows unittest (#8611 ) * add windows nightly test workflow * use github runner to run pr test * model load should use lowbit * remove tmp dir after testing	2023-08-03 14:48:42 +09:00
Xin Qiu	0714888705	build windows avx dll (#8657 ) * windows avx * add to actions	2023-08-03 02:06:24 +08:00
Yina Chen	15b3adc7ec	[LLM] llm linux binary make -> cmake (#8656 ) * llm linux make -> cmake * update * update	2023-08-02 16:41:54 +08:00
xingyuan li	769209b7f0	Chatglm unittest disable due to missing instruction (#8650 )	2023-08-02 10:28:42 +09:00
xingyuan li	cdfbe652ca	[LLM] Add chatglm support for llm-cli (#8641 ) * add chatglm build * add llm-cli support * update git * install cmake * add ut for chatglm * add files to setup * fix bug cause permission error when sf lack file	2023-08-01 14:30:17 +09:00
xingyuan li	3361b66449	[LLM] Revert llm-cli to disable selecting executables on Windows (#8630 ) * revert vnni file select * revert setup.py * add model-api.dll	2023-07-31 11:15:44 +09:00
xingyuan li	919791e406	Add needs to make sure run in order (#8621 )	2023-07-26 14:16:57 +09:00
xingyuan li	e3418d7e61	[LLM] Remove concurrency group for binary build workflow (#8619 ) * remove concurrency group for nightly test	2023-07-26 12:15:53 +09:00
xingyuan li	a98b3fe961	Fix cancel flag causing nightly builds to fail (#8618 )	2023-07-26 11:11:08 +09:00
xingyuan li	7d45233825	fix trigger enable flag (#8616 )	2023-07-26 10:53:03 +09:00
Guancheng Fu	07d1aee825	[PPML] add fastchat image for tdx (#8610 )	2023-07-25 15:23:41 +08:00
Song Jiaming	650b82fa6e	[LLM] add CausalLM and Speech UT (#8597 )	2023-07-25 11:22:36 +08:00
xingyuan li	9c897ac7db	[LLM] Merge redundant code in workflow (#8596 ) * modify workflow concurrency group * Add build check to avoid repeated compilation * remove redundant code	2023-07-25 12:12:00 +09:00
Yuwen Hu	bbde423349	[LLM] Add current Linux UT inference tests to nightly tests (#8578 ) * Add current inference uts to nightly tests * Change test model from chatglm-6b to chatglm2-6b * Add thread num env variable for nightly test * Fix urls * Small fix	2023-07-21 13:26:38 +08:00
Yuwen Hu	2266ca7d2b	[LLM] Small updates to transformers int4 ut (#8574 ) * Small fix to transformers int4 ut * Small fix	2023-07-20 13:20:25 +08:00
xingyuan li	2eeb653c75	fix llm build workflow misspell (#8575 )	2023-07-20 12:08:54 +09:00
Song Jiaming	411d896636	LLM first transformers UT (#8514 ) * ut * transformers api first ut * name * dir issue * use chatglm instead of chatglm2 * omp * set omp in sh * source * taskset * test * test omp * add test	2023-07-20 10:16:27 +08:00
Yishuo Wang	3bd1420b71	LLM: use MSVC to build avx-vnni binary files (#8570 )	2023-07-19 17:38:14 +08:00
Guancheng Fu	4f287df664	Fix manullay_build_for_testing (#8556 )	2023-07-18 16:21:39 +08:00
Guancheng Fu	3e0e370898	[PPML] Add bigdl-llm-demo dependencies to TDX image (#8551 ) * add bigdl-llm-demo dependencies to tdx image * use only one RUN command * Add bigdl-ppml * done	2023-07-18 14:23:07 +08:00
xingyuan li	c87853233b	[LLM] Add windows vnni binary build step (#8518 ) * add windows vnni build step * update build info * add download command	2023-07-14 17:24:39 +09:00
xingyuan li	903e9aee7a	Fix the problem of workflow cancellation after pr merge (#8530 ) * remove concurrency group for llm binary build workflow	2023-07-14 16:12:21 +09:00
Yuwen Hu	df97d39e29	Change thread_num in Linux inference actions (#8528 )	2023-07-14 10:46:03 +08:00
xingyuan li	60c2c0c3dc	Bug fix for merged pr #8503 (#8516 )	2023-07-13 17:26:30 +09:00
xingyuan li	4f152b4e3a	[LLM] Merge the llm.cpp build and the pypi release (#8503 ) * checkout llm.cpp to build new binary * use artifact to get latest built binary files * rename quantize * modify all release workflow	2023-07-13 16:34:24 +09:00
xingyuan li	04f2f04410	Add workflow_dispatch for llm unittest workflow (#8485 )	2023-07-10 13:16:18 +08:00
Guancheng Fu	a4ae132ef4	Add bigdl llm sgx image (#8480 ) * Add dockerfile for bigdl-llm-ppml * fix llm-cli multi-process * add workflow	2023-07-10 10:10:38 +08:00
Wang Jian	16c795158d	[PPML] Pull new deep-learning base image before build (#8469 ) * pull new base image before build * update	2023-07-06 14:29:09 +08:00
Yuwen Hu	936d21635f	[LLM] Extract tests to `.github/actions` to improve reusability (#8457 ) * Extract tests to .github/actions for better reusing in nightly tests * Small fix * Small fix	2023-07-05 10:09:10 +08:00
Guancheng Fu	e3e95e92ca	Add workflow for releasing TDX bigdl-llm image (#8455 )	2023-07-04 17:00:29 +08:00
Yuwen Hu	372c775cb4	[LLM] Change default runner for LLM Linux tests to the ones with AVX512 (#8448 ) * Basic change for AVX512 runner * Remove conda channel and action rename * Small fix * Small fix and reduce peak convert disk space * Define n_threads based on runner status * Small thread num fix * Define thread_num for cli * test * Add self-hosted label and other small fix	2023-07-04 14:53:03 +08:00
binbin Deng	146662bc0d	LLM: fix langchain windows failure (#8417 )	2023-06-30 09:59:10 +08:00
Yina Chen	6251ad8934	[LLM]Windows unittest (#8356 ) * win-unittest * update * update * try llama 7b * delete llama * update * add red-3b * only test red-3b * revert * add langchain * add dependency * delete langchain	2023-06-29 14:03:12 +08:00
Ruonan Wang	4be784a49d	LLM: add UT for starcoder (convert, inference) update examples and readme (#8379 ) * first commit to add path * update example and readme * update path * fix * update based on comment	2023-06-27 12:12:11 +08:00
Shengsheng Huang	c113ecb929	[LLM] langchain bloom, UT's, default parameters (#8357 ) * update langchain default parameters to align w/ api * add ut's for llm and embeddings * update inference test script to install langchain deps * update tests workflows --------- Co-authored-by: leonardozcm <changmin.zhao@intel.com>	2023-06-25 17:38:00 +08:00
Yuwen Hu	066f53232d	[LLM] Small nightly tests fix (#8364 ) * Test for install of tnftp * Improve install of tnftp	2023-06-20 11:19:13 +08:00
binbin Deng	ab1a833990	LLM: add basic uts related to inference (#8346 )	2023-06-19 10:25:51 +08:00
xingyuan li	daae7bd4e4	[LLM] Unittest for llm-cli (#8343 ) * add llm-cli test shell	2023-06-16 17:42:24 +08:00
Yuwen Hu	1aa33d35d5	[LLM] Refactor LLM Linux tests (#8349 ) * Small name fix * Add convert nightly tests, and for other llm tests, use stable ckpt * Small fix and ftp fix * Small fix * Small fix	2023-06-16 15:22:48 +08:00
Yuwen Hu	50dd9dd1c5	[LLM] Small improve for LLM base actions (#8344 ) * Hide ftp url for now * Small file name fix	2023-06-15 16:22:41 +08:00
Yuwen Hu	b30aa49c4e	[LLM] Add Actions for downloading & converting models (#8320 ) * First push to downloading and converting llm models for testing (Gondolin runner, avx2 for now) * Change yml file name	2023-06-15 13:43:47 +08:00
Ruonan Wang	8840dadd86	LLM: binary file version control on source forge (#8329 ) * support version control for llm based on date * update action	2023-06-15 09:53:27 +08:00
Pingchuan Ma (Henry)	773255e009	[LLM] Add dev wheel building and basic UT script for LLM package on Linux (#8264 ) * add wheel build for linux * test fix * test self-hosted runner * test fix * update runner * update runner * update fix * init cicd * init cicd * test conda * update fix * update no need manual python deps * test fix bugs * test fix bugs * test fix bugs * fix bugs	2023-06-08 00:49:57 +08:00
Pingchuan Ma (Henry)	2ed5842448	[LLM] add convert's python deps for LLM (#8260 ) * add python deps for LLM * update release.sh * change deps group name * update all * fix update * test fix * update	2023-06-06 16:01:17 +08:00
Pingchuan Ma (Henry)	c48d5f7cff	[LLM] Enable UT workflow logics for LLM (#8243 ) * check push connection * enable UT workflow logics for LLM * test fix * add licenses * test fix according to suggestions * test fix * update changes	2023-06-02 17:06:35 +08:00
Pingchuan Ma (Henry)	141febec1f	Add dev wheel building script for LLM package on Windows (#8238 ) * Add dev wheel building script for LLM package on Windows * delete conda * delete python version check * minor adjust * wheel name fixed * test check * test fix * change wheel name	2023-06-01 11:55:26 +08:00
Shaojie Cui	768b15881d	[PPML]CICD: build 32g bigdata image (#8205 ) * [PPML]CICD: build 32g bigdata image * fix	2023-05-17 11:30:10 +08:00
Xiangyu Tian	94f08edbb3	[PPML] Refactor BigDL Attestation Service Deployment of Docker and K8s (#8130 ) Refactor BigDL Attestation Service Deployment of Docker image, which split to base image and custom(reference) image. Update version to 2.4.0-SNAPSHOT. Refine documents.	2023-04-26 14:28:00 +08:00
Yao Li	d833a765fe	update manually build (#8129 )	2023-04-24 16:08:56 +08:00
Yao Li	8719674c92	update manually_build.yml (#8126 )	2023-04-24 15:29:24 +08:00
Le-Zheng	8b0876f238	add tdx image in action (#8125 ) * add tdx image in action * update * Update manually_build.yml * update Readme	2023-04-24 14:37:14 +08:00
Yao Li	f30215a77e	delete outdated bigdl kms (#8115 )	2023-04-23 15:45:55 +08:00
Yao Li	981bded4b3	fix bigdl-kms-reference (#8110 )	2023-04-23 14:08:53 +08:00
Yao Li	c7630f759a	[PPML] Add bigdl-kms into manually build (#8105 ) * update readme * update manually_build.yml * Update manually_build.yml * fix format * udpate * update	2023-04-23 11:51:00 +08:00
Wang Jian	96c9343ef2	[PPML] Update occlum production image build dir (#8098 )	2023-04-21 11:46:42 +08:00
Heyang Sun	814f5bd915	add no_proxy for bigdl-kms (#7996 )	2023-04-06 09:18:53 +08:00
Shaojie Cui	f885466475	[CICD]fix: typo in build (#7990 )	2023-04-04 14:30:44 +08:00
Shaojie Cui	8a24ae76a8	[CICD]fix image name in bigdata toolkit (#7978 )	2023-04-03 15:57:27 +08:00
Heyang Sun	e91cb31575	set no (#7974 )	2023-04-03 09:02:43 +08:00
Shaojie Cui	13717ee5c8	[CICD]add noattest bigdata toolkit image (#7968 )	2023-03-31 14:02:08 +08:00
Shaojie Cui	522e5ae35b	[CICD]add no_proxy arg when building bigdata (#7868 ) * [CICD]add no_proxy arg when building bigdata * fix	2023-03-14 16:34:52 +08:00
Shaojie Cui	57125dfcd4	[PPML]tag 8g epc bigdata as default image (#7789 ) * build 4g and tag 8g as latest * reduce script memory * doc	2023-03-08 10:49:40 +08:00
Shaojie Cui	f0fa26a8a8	[PPML]Reduce the epc usage of bigdata (#7606 ) * build bigdata with smaller epc * reduce epc in script * fix action * install pyarrow * delete unused part * reduce epc * fix * malloc_arena_max * set 16g to default and delete 4g * unused example * spark simplequery * pyspark simplequery * fix * fix * fix * code style fix	2023-03-01 14:59:27 +08:00
Guancheng Fu	3a57b97c2b	Add dl-serving manually_build workflow (#7719 )	2023-03-01 11:05:57 +08:00
Shaojie Cui	feacade38c	[CICD]fix: delete latest tag bigdata image built for testing (#7701 )	2023-02-27 17:01:22 +08:00
Shaojie Cui	b12f2e1902	[CICD]delete bigdata image with debug log level (#7657 )	2023-02-23 09:53:06 +08:00
Shaojie Cui	e5fb0a3315	[CICD]build bigdata toolkit with no cache (#7652 )	2023-02-22 16:32:28 +08:00

1 2 3 4 5

232 commits