[LLM] auto performance test fix specific settings to template (#8876)

This commit is contained in:
Song Jiaming 2023-09-01 15:49:04 +08:00 committed by GitHub
parent 242c9d6036
commit 7b3ac66e17
2 changed files with 1 additions and 4 deletions

View file

@ -1 +1 @@
llama2_path: model: /path/to/model

View file

@ -1,3 +0,0 @@
,model,1st token avg latency (ms/token),2+ avg latency (ms/token),input/output tokens
0,llama2,232.42,56.19,32/32
1,llama2,9465.57,68.67,1024/128
1 model 1st token avg latency (ms/token) 2+ avg latency (ms/token) input/output tokens
2 0 llama2 232.42 56.19 32/32
3 1 llama2 9465.57 68.67 1024/128