Ziteng Zhang
|
8b08ad408b
|
Add batch_size in all_in_one (#9999)
Add batch_size in all_in_one, except run_native_int4
|
2024-01-25 17:43:49 +08:00 |
|
Ziteng Zhang
|
4f4ce73f31
|
[LLM] Add transformer_autocast_bf16 into all-in-one (#9890)
* Add transformer_autocast_bf16 into all-in-one
|
2024-01-11 17:51:07 +08:00 |
|
Yuwen Hu
|
48b85593b3
|
Update all-in-one benchmark readme (#9618)
|
2023-12-07 10:32:09 +08:00 |
|
Cheen Hau, 俊豪
|
3e39828420
|
Update all in one benchmark readme (#9496)
* Add gperftools install to all in one benchmark readme
* Update readme
|
2023-11-21 14:57:16 +08:00 |
|
Ruonan Wang
|
7e73c354a6
|
LLM: decoupling bigdl-llm and bigdl-nano (#9306)
|
2023-11-01 11:00:54 +08:00 |
|
Ruonan Wang
|
4f34557224
|
LLM: support num_beams in all-in-one benchmark (#9141)
* support num_beams
* fix
|
2023-10-12 13:35:12 +08:00 |
|
Ruonan Wang
|
ad7d9231f5
|
LLM: add benchmark script for Max gpu and ipex fp16 gpu (#9112)
* add pvc bash
* meet code review
* rename to run-max-gpu.sh
|
2023-10-10 10:18:41 +08:00 |
|
Kai Huang
|
78ea7ddb1c
|
Combine apply_rotary_pos_emb for gpt-neox (#9074)
|
2023-10-07 16:27:46 +08:00 |
|
Cengguang Zhang
|
ad62c58b33
|
LLM: Enable jemalloc in benchmark scripts. (#9058)
* enable jemalloc.
* fix readme.
|
2023-09-26 15:37:49 +08:00 |
|
Kai Huang
|
6981745fe4
|
Optimize kv_cache for gpt-neox model family (#9015)
* override gptneox
* style
* move to utils
* revert
|
2023-09-20 19:59:19 +08:00 |
|
Cengguang Zhang
|
8299b68fea
|
update readme. (#8996)
|
2023-09-18 17:06:15 +08:00 |
|
Cengguang Zhang
|
cca84b0a64
|
LLM: update llm benchmark scripts. (#8943)
* update llm benchmark scripts.
* change tranformer_bf16 to pytorch_autocast_bf16.
* add autocast in transformer int4.
* revert autocast.
* add "pytorch_autocast_bf16" to doc
* fix comments.
|
2023-09-13 12:23:28 +08:00 |
|
Cengguang Zhang
|
3d2efe9608
|
LLM: update llm latency benchmark. (#8922)
|
2023-09-07 19:00:19 +08:00 |
|
binbin Deng
|
7897eb4b51
|
LLM: add benchmark scripts on GPU (#8916)
|
2023-09-07 18:08:17 +08:00 |
|
Song Jiaming
|
c06f1ca93e
|
[LLM] auto perf test to output to csv (#8846)
|
2023-09-01 10:48:00 +08:00 |
|