Cheen Hau, 俊豪
8f23fb04dc
Add inference test for Whisper model on Arc ( #9330 )
...
* Add inference test for Whisper model
* Remove unnecessary inference time measurement
2023-11-03 10:15:52 +08:00
Ziteng Zhang
dd3cf2f153
LLM: Add python 3.10 & 3.11 UT
...
LLM: Add python 3.10 & 3.11 UT
2023-11-02 14:09:29 +08:00
Jasonzzt
d1bdc0ef72
spr & arc ut with python 3.9 & 3.10 & 3.11
2023-11-01 22:57:48 +08:00
Jasonzzt
687da21467
test 3.11
2023-11-01 19:14:53 +08:00
WeiguangHan
9722e811be
LLM: add more models to the arc perf test ( #9297 )
...
* LLM: add more models to the arc perf test
* remove some old models
* install some dependencies
2023-11-01 16:56:32 +08:00
Jasonzzt
3c3329010d
add conda update -n base conda
2023-11-01 16:36:35 +08:00
Jasonzzt
2fff0e8c21
use runner avx2 with linux
2023-11-01 16:28:29 +08:00
Jasonzzt
964a8e6dc1
update conda
2023-11-01 16:20:19 +08:00
Jasonzzt
cb7ef38e86
rerun
2023-11-01 15:30:34 +08:00
Jasonzzt
8f6e979fad
test again
2023-11-01 15:10:11 +08:00
Jasonzzt
b66584f23b
test
2023-11-01 14:51:23 +08:00
Jasonzzt
ba148ff3ff
test py311
2023-11-01 14:08:49 +08:00
Jasonzzt
6f1cee90a4
test
2023-11-01 13:58:03 +08:00
Jasonzzt
d51821e264
test
2023-11-01 13:49:32 +08:00
Jasonzzt
7c7a7f2ec1
spr & arc ut with python3,9&3.10&3.11
2023-11-01 13:17:13 +08:00
Jasonzzt
4f9fd0dffd
arc-ut with 3.10 & 3.11
2023-11-01 10:51:57 +08:00
Cengguang Zhang
d4ab5904ef
LLM: Add python 3.10 llm UT ( #9302 )
...
* add py310 test for llm-unit-test.
* add py310 llm-unit-tests
* add llm-cpp-build-py310
* test
* test
* test.
* test
* test
* fix deactivate.
* fix
* fix.
* fix
* test
* test
* test
* add build chatglm for win.
* test.
* fix
2023-11-01 10:15:32 +08:00
WeiguangHan
03aa368776
LLM: add the comparison between latest arc perf test and last one ( #9296 )
...
* add the comparison between latest test and last one to html
* resolve some comments
* modify some code logics
2023-11-01 09:53:02 +08:00
Cheen Hau, 俊豪
d638b93dfe
Add test script and workflow for qlora fine-tuning ( #9295 )
...
* Add test script and workflow for qlora fine-tuning
* Test fix export model
* Download dataset
* Fix export model issue
* Reduce number of training steps
* Rename script
* Correction
2023-11-01 09:39:53 +08:00
Yuwen Hu
21631209a9
[LLM] Skip CPU performance test for now ( #9291 )
...
* Skip llm cpu performance test for now
* Add install for wheel package
2023-10-27 12:55:04 +08:00
Ziteng Zhang
46ab0419b8
Merge pull request #9279 from Jasonzzt/main
...
Add bigdl-llm-finetune-cpu to manually_build to upload image on hub
2023-10-27 09:55:08 +08:00
Yuwen Hu
733df28a2b
[LLM] Migrate Arc UT to another runner ( #9286 )
...
* Separate arc llm ut to another runner
* Add dependency for einops
2023-10-26 19:08:57 +08:00
Ziteng Zhang
916ccc0779
Update manually_build_for_testing.yml
2023-10-26 16:26:14 +08:00
Ziteng Zhang
14a23015f8
Update manually_build.yml
2023-10-26 16:24:03 +08:00
Jasonzzt
37b1708d16
Add bigdl-llm-finetune-cpu to manually_build
2023-10-26 15:53:44 +08:00
Lilac09
4ed7f066d3
add bigdl-llm-finetune-xpu to manually_build ( #9278 )
2023-10-26 15:30:05 +08:00
Cheen Hau, 俊豪
ab40607b87
Enable unit test workflow on Arc ( #9213 )
...
* Add gpu workflow and a transformers API inference test
* Set device-specific env variables in script instead of workflow
* Fix status message
---------
Co-authored-by: sgwhat <ge.song@intel.com>
2023-10-25 15:17:18 +08:00
SONG Ge
160a1e5ee7
[WIP] Add UT for Mistral Optimized Model ( #9248 )
...
* add ut for mistral model
* update
* fix model path
* upgrade transformers version for mistral model
* refactor correctness ut for mustral model
* refactor mistral correctness ut
* revert test_optimize_model back
* remove mistral from test_optimize_model
* add to revert transformers version back to 4.31.0
2023-10-25 15:14:17 +08:00
WeiguangHan
ec9195da42
LLM: using html to visualize the perf result for Arc ( #9228 )
...
* LLM: using html to visualize the perf result for Arc
* deploy the html file
* add python license
* reslove some comments
2023-10-24 18:05:25 +08:00
Guancheng Fu
f37547249d
Refine README/CICD ( #9253 )
2023-10-24 12:56:03 +08:00
Guancheng Fu
9faa2f1eef
Fix bigdl-llm-serving-tdx image ( #9251 )
2023-10-24 10:49:35 +08:00
Guancheng Fu
6cb884d82d
Fix missing manually_build_for_testing entry ( #9245 )
2023-10-23 16:35:09 +08:00
Guancheng Fu
2ead3f7d54
add manually build ( #9244 )
2023-10-23 15:53:30 +08:00
WeiguangHan
f87f67ee1c
LLM: arc perf test for some popular models ( #9188 )
2023-10-19 15:56:15 +08:00
ZehuaCao
65dd73b62e
Update manually_build.yml ( #9138 )
...
* Update manually_build.yml
fix llm-serving-tdx image build dir
* Update manually_build.yml
2023-10-11 15:07:09 +08:00
Yuwen Hu
dc70fc7b00
Update performance tests for dependency of bigdl-core-xe-esimd ( #9124 )
2023-10-10 19:32:17 +08:00
Yuwen Hu
0e09dd926b
[LLM] Fix example test ( #9118 )
...
* Update llm example test link due to example layout change
* Add better change detect
2023-10-10 13:24:18 +08:00
Zhengjin Wang
0dbb3a283e
amend manually_build
2023-10-10 10:03:23 +08:00
Zhengjin Wang
bb3bb46400
add llm-serving-xpu on github action
2023-10-10 09:48:58 +08:00
Yuwen Hu
65212451cc
[LLM] Small update to performance tests ( #9106 )
...
* small updates to llm performance tests regarding model handling
* Small fix
2023-10-09 16:55:25 +08:00
ZehuaCao
aad68100ae
Add trusted-bigdl-llm-serving-tdx image. ( #9093 )
...
* add entrypoint in cpu serving
* kubernetes support for fastchat cpu serving
* Update Readme
* add image to manually_build action
* update manually_build.yml
* update README.md
* update manually_build.yaml
* update attestation_cli.py
* update manually_build.yml
* update Dockerfile
* rename
* update trusted-bigdl-llm-serving-tdx Dockerfile
2023-10-08 10:13:51 +08:00
ZehuaCao
b773d67dd4
Add Kubernetes support for BigDL-LLM-serving CPU. ( #9071 )
2023-10-07 09:37:48 +08:00
Lilac09
c91b2bd574
fix:modify indentation ( #9070 )
...
* modify Dockerfile
* add README.md
* add README.md
* Modify Dockerfile
* Add bigdl inference cpu image build
* Add bigdl llm cpu image build
* Add bigdl llm cpu image build
* Add bigdl llm cpu image build
* Modify Dockerfile
* Add bigdl inference cpu image build
* Add bigdl inference cpu image build
* Add bigdl llm xpu image build
* manually build
* recover file
* manually build
* recover file
* modify indentation
2023-09-27 14:53:52 +08:00
Lilac09
ecee02b34d
Add bigdl llm xpu image build ( #9062 )
...
* modify Dockerfile
* add README.md
* add README.md
* Modify Dockerfile
* Add bigdl inference cpu image build
* Add bigdl llm cpu image build
* Add bigdl llm cpu image build
* Add bigdl llm cpu image build
* Modify Dockerfile
* Add bigdl inference cpu image build
* Add bigdl inference cpu image build
* Add bigdl llm xpu image build
2023-09-26 14:29:03 +08:00
Lilac09
9ac950fa52
Add bigdl llm cpu image build ( #9047 )
...
* modify Dockerfile
* add README.md
* add README.md
* Modify Dockerfile
* Add bigdl inference cpu image build
* Add bigdl llm cpu image build
* Add bigdl llm cpu image build
* Add bigdl llm cpu image build
2023-09-26 13:22:11 +08:00
Yuwen Hu
c389e1323d
fix xpu performance tests by making sure that latest bigdl-core-xe is installed ( #9001 )
2023-09-19 17:33:30 +08:00
Wang Jian
7563b26ca9
Occlum fastchat build Use nocache and update order ( #8972 )
2023-09-14 14:05:15 +08:00
Yuwen Hu
ca35c93825
[LLM] Fix langchain UT ( #8929 )
...
* Change dependency version for langchain uts
* Downgrade pandas version instead; and update example readme accordingly
2023-09-08 13:51:04 +08:00
xingyuan li
704a896e90
[LLM] Add perf test on xpu for bigdl-llm ( #8866 )
...
* add xpu latency job
* update install way
* remove duplicated workflow
* add perf upload
2023-09-05 17:36:24 +09:00
xingyuan li
de6c6bb17f
[LLM] Downgrade amx build gcc version and remove avx flag display ( #8856 )
...
* downgrade to gcc 11
* remove avx display
2023-08-31 14:08:13 +09:00