Skip to content

Commit

Permalink
Merge pull request #44 from ChanderG/bugfix-mock-dataset
Browse files Browse the repository at this point in the history
bugfix: make mock data loader usable
  • Loading branch information
aldopareja authored Jun 18, 2024
2 parents 25cea8d + 01a1060 commit 1cb03c5
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/instructlab/training/token_dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -63,7 +63,7 @@ def __getitem__(self, idx):
}

def get_lengths(self):
np.array([len(self.input_ids[0])] * len(self.input_ids))
return np.array([len(self.input_ids[0])] * len(self.input_ids))


def make_collate_fn(pad_token_id, is_granite=False, max_batch_len=60000):
Expand Down

0 comments on commit 1cb03c5

Please sign in to comment.