Skip to content

Commit

Permalink
fixed comments
Browse files Browse the repository at this point in the history
  • Loading branch information
Edwinhr716 committed Aug 31, 2024
1 parent abfc66a commit e7278e8
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,6 @@ for request_rate in $(echo $REQUEST_RATES | tr ',' ' '); do
output_file="latency-profile-${timestamp}.txt"
python3 benchmark_serving.py --host="$IP" --port="$PORT" --model="$TOKENIZER" --dataset=ShareGPT_V3_unfiltered_cleaned_split.json --tokenizer="$TOKENIZER" --request-rate=$request_rate --backend="$BACKEND" --num-prompts=$((request_rate * 30)) --max-input-length=$INPUT_LENGTH --max-output-length=$OUTPUT_LENGTH > $output_file
cat $output_file
sleep 5 # wait 5 minutes before next run
sleep 5 # wait 5 seconds before next run
done

0 comments on commit e7278e8

Please sign in to comment.