issues
search
mosaicml
/
streaming
A Data Streaming Library for Efficient Neural Network Training
https://streaming.docs.mosaicml.com
Apache License 2.0
1.01k
stars
125
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Upgrade ci_testing, remove codeql
#714
snarayan21
closed
10 hours ago
0
enable adaptive retry for s3 download
#713
bigning
closed
15 hours ago
0
Remove duplicate `dbfs:` prefix from error message
#712
vanshcsingh
closed
4 days ago
0
Add HF File System Support to Streaming
#711
orionw
opened
4 days ago
6
Bump pytest-split from 0.8.2 to 0.9.0
#710
dependabot[bot]
closed
4 days ago
1
Optional dependency for different storages?
#709
huxuan
opened
5 days ago
2
fix convert imagenet
#708
Hprairie
closed
1 week ago
0
AttributeError when trying to convert Imagenet1k
#707
Hprairie
closed
1 week ago
3
Fix `drop_first` checking in partitioning to account for `world_size` divisibility
#706
snarayan21
closed
1 week ago
0
Fix linting issues with numpy 2
#705
snarayan21
closed
1 week ago
0
Bump pydantic from 2.7.3 to 2.7.4
#704
dependabot[bot]
closed
1 week ago
1
Error writing to databricks UC volume
#703
JK87iab
closed
1 week ago
5
Fix edge cases with scalar or empty numpy array encoding
#702
snarayan21
closed
2 weeks ago
0
Raise IndexError in `Spanner` object instead of `ValueError`
#701
snarayan21
closed
2 weeks ago
1
Enable correct resumption from the end of an epoch
#700
snarayan21
closed
1 week ago
0
[QUESTION] Ask about some detailed questions regarding the shuffle algorithm in official website.
#699
yanghua
closed
2 weeks ago
3
Different batch_size for different streams
#698
huxuan
closed
2 weeks ago
2
Bump pytest from 8.2.1 to 8.2.2
#697
dependabot[bot]
closed
2 weeks ago
1
Bump pydantic from 2.7.2 to 2.7.3
#696
dependabot[bot]
closed
2 weeks ago
0
Handle zero-sized ndarray more gracefully
#695
huxuan
closed
2 weeks ago
1
fix: expand user path for Writer's output directory.
#694
huxuan
closed
2 weeks ago
1
Make sure epoch_size is an int
#693
snarayan21
closed
3 weeks ago
0
Bump pydantic from 2.7.1 to 2.7.2
#692
dependabot[bot]
closed
3 weeks ago
1
Bump uvicorn from 0.29.0 to 0.30.1
#691
dependabot[bot]
closed
3 weeks ago
1
DeltaTorch Compatability?
#690
rangi513
opened
4 weeks ago
3
Bug that causes FileExistsError in shm
#689
Shade5
closed
4 weeks ago
6
Warning condition changed for Sequence Parallelism
#688
XiaohanZhangCMU
closed
1 month ago
0
Bump databricks-sdk from 0.27.1 to 0.28.0
#687
dependabot[bot]
closed
3 weeks ago
2
Suboptimal usage of 8xH100 GPUs - Streaming dataloader speed significantly fluctuates across batches
#686
VSehwag
opened
1 month ago
4
Fix node calculation in `replication` for `World` object
#685
snarayan21
closed
1 month ago
0
Heterogeneous
#684
XiaohanZhangCMU
opened
1 month ago
0
Improve local temp directory error when only `remote` is specified
#683
snarayan21
closed
1 month ago
4
Fix `batch_size` typo for `Stream` object in docs
#682
snarayan21
closed
1 month ago
0
Update CODEOWNERS
#681
karan6181
closed
1 month ago
0
Bump pytest from 8.2.0 to 8.2.1
#680
dependabot[bot]
closed
1 month ago
1
Bump databricks-sdk from 0.27.0 to 0.27.1
#679
dependabot[bot]
closed
1 month ago
2
Reading all formats (parquet, csv, tsv, json) etc natively without conversion steps
#678
abhijithneilabraham
closed
1 month ago
2
Last entry in the dataset is causing "Relative sample index $x is not present" error
#677
isidentical
opened
1 month ago
2
Using minio with StreamingDataset
#676
abhijithneilabraham
closed
1 month ago
1
Update platform references
#675
aspfohl
closed
1 month ago
1
Use IndexError instead of ValueError in __getitem__
#674
keaganlong
closed
2 weeks ago
1
Helpful error on `py1e` for improperly written datasets
#673
snarayan21
closed
1 month ago
0
Ensure shards cannot be larger than 4GB
#672
snarayan21
closed
1 month ago
0
Shard maximum size should be 4GB for MDS
#671
smspillaz
closed
1 month ago
1
Bump fastapi from 0.110.2 to 0.111.0
#670
dependabot[bot]
closed
1 month ago
1
Version bump to v0.7.6
#669
snarayan21
closed
1 month ago
0
Fix: having zero bytes files after converting spark dataframe to MDS saved on dbfs:/Volumes
#668
XiaohanZhangCMU
closed
1 month ago
2
Bump databricks-sdk from 0.23.0 to 0.27.0
#667
dependabot[bot]
closed
1 month ago
1
Bump pydantic from 2.7.0 to 2.7.1
#666
dependabot[bot]
closed
1 month ago
3
Bump databricks-sdk from 0.23.0 to 0.26.0
#665
dependabot[bot]
closed
1 month ago
1
Next