Skip to content

Commit

Permalink
readded comment
Browse files Browse the repository at this point in the history
  • Loading branch information
Edwinhr716 committed Aug 31, 2024
1 parent b982e4a commit cd95fd1
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ export IP=$IP
huggingface-cli login --token "$HF_TOKEN" --add-to-git-credential

for request_rate in $(echo $REQUEST_RATES | tr ',' ' '); do

# TODO: Check if profile already exists, if so then skip
timestamp=$(date +"%Y-%m-%d_%H-%M-%S")
output_file="latency-profile-${timestamp}.txt"
python3 benchmark_serving.py --host="$IP" --port="$PORT" --model="$TOKENIZER" --dataset=ShareGPT_V3_unfiltered_cleaned_split.json --tokenizer="$TOKENIZER" --request-rate=$request_rate --backend="$BACKEND" --num-prompts=$((request_rate * 30)) --max-input-length=$INPUT_LENGTH --max-output-length=$OUTPUT_LENGTH > $output_file
Expand Down

0 comments on commit cd95fd1

Please sign in to comment.