Wang, Jian4
|
d9f71f1f53
|
Update benchmark util for example using (#11027)
* mv benchmark_util.py to utils/
* remove
* update
|
2024-05-15 14:16:35 +08:00 |
|
Xiangyu Tian
|
02870dc385
|
LLM: Refine README of AutoTP-FastAPI example (#10960)
|
2024-05-08 16:55:23 +08:00 |
|
Xiangyu Tian
|
13a44cdacb
|
LLM: Refine Deepspped-AutoTP-FastAPI example (#10916)
|
2024-05-07 09:37:31 +08:00 |
|
Xiangyu Tian
|
3d4950b0f0
|
LLM: Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example (#10876)
Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example.
|
2024-04-26 13:24:28 +08:00 |
|
ZehuaCao
|
599a88db53
|
Add deepsped-autoTP-Fastapi serving (#10748)
* add deepsped-autoTP-Fastapi serving
* add readme
* add license
* update
* update
* fix
|
2024-04-16 14:03:23 +08:00 |
|