Cengguang Zhang
|
3d2efe9608
|
LLM: update llm latency benchmark. (#8922)
|
2023-09-07 19:00:19 +08:00 |
|
binbin Deng
|
7897eb4b51
|
LLM: add benchmark scripts on GPU (#8916)
|
2023-09-07 18:08:17 +08:00 |
|
Xin Qiu
|
d8a01d7c4f
|
fix chatglm in run.pu (#8919)
|
2023-09-07 16:44:10 +08:00 |
|
Xin Qiu
|
e9de9d9950
|
benchmark for native int4 (#8918)
* native4
* update
* update
* update
|
2023-09-07 15:56:15 +08:00 |
|
Xin Qiu
|
5d9942a3ca
|
transformer int4 and native int4's benchmark script for 32 256 1k 2k input (#8871)
* transformer
* move
* update
* add header
* update all-in-one
* clean up
|
2023-09-07 09:49:55 +08:00 |
|
Song Jiaming
|
c06f1ca93e
|
[LLM] auto perf test to output to csv (#8846)
|
2023-09-01 10:48:00 +08:00 |
|