fix bug in python benchmark script #1206

thevishalagarwal · 2025-01-29T13:59:30Z

Bug: if we use random token ids (with flag --use_random_token), decoding and encoding generates a different set of tokens which is not equal to the original set. This changes the number of prompt tokens and generates incorrect result during benchmarking. e.g.

original_tokens = np.random.randint(100, size=(1, 50))
prompt = tokenizer.decode(original_tokens )
new_tokens = tokenizer.encode(prompt)

Earlier the number of tokens was 50 but in new_tokens it may not be 50.

I have removed the step of encoding the prompt again to solve for this but the tokenization latency also need to be removed.
IMO, it is not an important metric as tokenizer is never a bottleneck and this easily resolves the above problem/bug.

Alternatively, if we do not use the --use_random_token, then the default way of generating tokens/prompt is very slow.

thevishalagarwal · 2025-02-12T08:28:53Z

@baijumeswani Can you please review this? Thanks!

thevishalagarwal added 2 commits January 29, 2025 19:17

remove tokenization metric and fix random tokens inconsistency

26c3b50

remove unwanted changes

b9c67ae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bug in python benchmark script #1206

fix bug in python benchmark script #1206

thevishalagarwal commented Jan 29, 2025 •

edited

Loading

thevishalagarwal commented Feb 12, 2025

fix bug in python benchmark script #1206

Are you sure you want to change the base?

fix bug in python benchmark script #1206

Conversation

thevishalagarwal commented Jan 29, 2025 • edited Loading

thevishalagarwal commented Feb 12, 2025

thevishalagarwal commented Jan 29, 2025 •

edited

Loading