Skip to content

Commit

Permalink
fix vocab_size
Browse files Browse the repository at this point in the history
  • Loading branch information
lzhangzz committed Oct 26, 2023
1 parent c50f1cc commit 4294bff
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/turbomind/models/llama/LlamaBatch.cc
Original file line number Diff line number Diff line change
Expand Up @@ -525,7 +525,7 @@ bool LlamaBatch<T>::generate()
decoder_output_buf_,
batch_size_);

CheckValues(logits_buf_, batch_size_ * llama_->vocab_size_, "post_decode_embedding", stream_);
CheckValues(logits_buf_, batch_size_ * llama_->vocab_size_padded_, "post_decode_embedding", stream_);

// stop-words & bad-words require the matched tokens to be contiguous, so item size > 1 is
// not supported yet.
Expand Down

0 comments on commit 4294bff

Please sign in to comment.