Skip to content

Commit

Permalink
cast before decoding candidate response tensors back to string
Browse files Browse the repository at this point in the history
  • Loading branch information
lucidrains committed Jan 25, 2024
1 parent 5fbe322 commit 67922a1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion self_rewarding_lm_pytorch/self_rewarding_lm_pytorch.py
Original file line number Diff line number Diff line change
Expand Up @@ -427,7 +427,7 @@ def forward(self) -> DPODataset:
)
)

candidate_responses: List[str] = [*map(self.tokenizer_decode, candidate_responses_tensor)]
candidate_responses: List[str] = [*map(self.tokenizer_decode, candidate_responses_tensor.long().tolist())]

# get rewards

Expand Down

0 comments on commit 67922a1

Please sign in to comment.