Skip to content

Commit

Permalink
Merge pull request #255 from Modalities/megatron_dataset_removal
Browse files Browse the repository at this point in the history
refactor: removed legacy megatron dataset code
  • Loading branch information
le1nux authored Sep 17, 2024
2 parents 8158de7 + 52172be commit ebb75bc
Show file tree
Hide file tree
Showing 3 changed files with 0 additions and 169 deletions.
77 changes: 0 additions & 77 deletions benchmarks/dataloader/README.md

This file was deleted.

87 changes: 0 additions & 87 deletions benchmarks/dataloader/launch_benchmark.sh

This file was deleted.

5 changes: 0 additions & 5 deletions src/modalities/config/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -285,11 +285,6 @@ class PackedMemMapDatasetMegatronConfig(BaseModel):
sample_key: str


class MMapIndexedDatasetConfig(BaseModel):
path: Path
skip_warmup: bool


class BatchSamplerConfig(BaseModel):
sampler: PydanticSamplerIFType
batch_size: Annotated[int, Field(strict=True, gt=0)]
Expand Down

0 comments on commit ebb75bc

Please sign in to comment.