ZehuaCao
|
51aa8b62b2
|
add gradio_web_ui to llm-serving image (#9918)
|
2024-01-25 11:11:39 +08:00 |
|
Lilac09
|
de27ddd81a
|
Update Dockerfile (#9981)
|
2024-01-24 11:10:06 +08:00 |
|
Lilac09
|
a2718038f7
|
Fix qwen model adapter in docker (#9969)
* fix qwen in docker
* add patch for model_adapter.py in fastchat
* add patch for model_adapter.py in fastchat
|
2024-01-24 11:01:29 +08:00 |
|
Lilac09
|
052962dfa5
|
Using original fastchat and add bigdl worker in docker image (#9967)
* add vllm worker
* add options in entrypoint
|
2024-01-23 14:17:05 +08:00 |
|
ZehuaCao
|
05ea0ecd70
|
add pv for llm-serving k8s deployment (#9906)
|
2024-01-16 11:32:54 +08:00 |
|
Lilac09
|
2554ba0913
|
Add usage of vllm (#9564)
* add usage of vllm
* add usage of vllm
* add usage of vllm
* add usage of vllm
* add usage of vllm
* add usage of vllm
|
2023-11-30 14:19:23 +08:00 |
|
Lilac09
|
557bb6bbdb
|
add judgement for running serve (#9555)
|
2023-11-29 16:57:00 +08:00 |
|
Guancheng Fu
|
2b200bf2f2
|
Add vllm_worker related arguments in docker serving image's entrypoint (#9500)
* fix entrypoint
* fix missing long mode argument
|
2023-11-21 14:41:06 +08:00 |
|
Lilac09
|
566ec85113
|
add stream interval option to entrypoint (#9498)
|
2023-11-21 09:47:32 +08:00 |
|
Lilac09
|
13f6eb77b4
|
Add exec bash to entrypoint.sh to keep container running after being booted. (#9471)
* add bigdl-llm-init
* boot bash
|
2023-11-15 16:09:16 +08:00 |
|
Lilac09
|
24146d108f
|
add bigdl-llm-init (#9468)
|
2023-11-15 14:55:33 +08:00 |
|
Lilac09
|
b2b085550b
|
Remove bigdl-nano and add ipex into inference-cpu image (#9452)
* remove bigdl-nano and add ipex into inference-cpu image
* remove bigdl-nano in docker
* remove bigdl-nano in docker
|
2023-11-14 10:50:52 +08:00 |
|
Shaojun Liu
|
0e5ab5ebfc
|
update docker tag to 2.5.0-SNAPSHOT (#9443)
|
2023-11-13 16:53:40 +08:00 |
|
Lilac09
|
2c2bc959ad
|
add tools into previously built images (#9317)
* modify Dockerfile
* manually build
* modify Dockerfile
* add chat.py into inference-xpu
* add benchmark into inference-cpu
* manually build
* add benchmark into inference-cpu
* add benchmark into inference-cpu
* add benchmark into inference-cpu
* add chat.py into inference-xpu
* add chat.py into inference-xpu
* change ADD to COPY in dockerfile
* fix dependency issue
* temporarily remove run-spr in llm-cpu
* temporarily remove run-spr in llm-cpu
|
2023-10-31 16:35:18 +08:00 |
|
Guancheng Fu
|
7f66bc5c14
|
Fix bigdl-llm-serving-cpu Dockerfile (#9247)
|
2023-10-23 16:51:30 +08:00 |
|
Shaojun Liu
|
9dc76f19c0
|
fix hadolint error (#9223)
|
2023-10-19 16:22:32 +08:00 |
|
Guancheng Fu
|
df8df751c4
|
Modify readme for bigdl-llm-serving-cpu (#9105)
|
2023-10-09 09:56:09 +08:00 |
|
ZehuaCao
|
b773d67dd4
|
Add Kubernetes support for BigDL-LLM-serving CPU. (#9071)
|
2023-10-07 09:37:48 +08:00 |
|
Guancheng Fu
|
cc84ed70b3
|
Create serving images (#9048)
* Finished & Tested
* Install latest pip from base images
* Add blank line
* Delete unused comment
* fix typos
|
2023-09-25 15:51:45 +08:00 |
|