-
Notifications
You must be signed in to change notification settings - Fork 154
Issues: mosaicml/streaming
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Streaming Dataset Simulator stops working beyond 10,000 batches
bug
Something isn't working
#877
opened Feb 12, 2025 by
mcrchopra
NCCL Timeouts from Adding More Datasets + Proportion Sampling
bug
Something isn't working
#876
opened Feb 11, 2025 by
schopra8
[Bug] Incorrect local file path in HfUploader
bug
Something isn't working
#875
opened Feb 9, 2025 by
Abhinay1997
FileNotFoundError: [Errno 2] No such file or directory: '/tmp/mds_data/train/index.json'
#873
opened Feb 7, 2025 by
jitendra986
dataframe_to_mds fails with non-nullable ArrayType
bug
Something isn't working
#870
opened Jan 30, 2025 by
sthudium25
Facing issue with resuming training for saved dataset state (>1 epoch)
bug
Something isn't working
#869
opened Jan 28, 2025 by
rodosingh
Facing issue with improper file renaming while multi-node training in ROCm
bug
Something isn't working
#868
opened Jan 27, 2025 by
rodosingh
how about read speed when training, just save on local. compare to use pytorch dataloader
#864
opened Jan 20, 2025 by
yja1
Error in Streaming Dataset Decompression in Distributed Setting
bug
Something isn't working
#863
opened Jan 15, 2025 by
jasonkrone
Expose StreamingDataset world (or world_size and rank) as argument
enhancement
New feature or request
#854
opened Dec 19, 2024 by
lukemelas
Will cache eviction logic take previously-existing shards into account?
#844
opened Dec 5, 2024 by
jamin-chen
Pipeline Parallelism (Supported? How to?)
enhancement
New feature or request
#827
opened Nov 14, 2024 by
casper-hansen
UnicodeDecodeError: ... Efficient way to debug the dataset with streaming?
enhancement
New feature or request
#820
opened Nov 1, 2024 by
TAYmit
Choose JPEG compression level
enhancement
New feature or request
#811
opened Oct 24, 2024 by
cabreraalex
Support for on-the-fly filtering
enhancement
New feature or request
#800
opened Oct 9, 2024 by
ColinToft
Make New feature or request
epoch_sample_ids
cachable
enhancement
#792
opened Sep 28, 2024 by
janEbert
Dataset does not work after stopping training
bug
Something isn't working
#781
opened Sep 15, 2024 by
gluonfield
JointWriter: Allow shard file appending
bug
Something isn't working
#775
opened Sep 5, 2024 by
janEbert
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.