issues
search
argonne-lcf
/
dlio_benchmark
An I/O benchmark for deep Learning applications
https://dlio-benchmark.readthedocs.io
Apache License 2.0
65
stars
30
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Change `max` to `abs` for preprocess time
#240
rayandrew
closed
1 week ago
0
enable option to disable pin_memory in pytorch
#239
rayandrew
closed
1 week ago
2
fix wrong tracing location of fetch data
#238
rayandrew
closed
1 week ago
0
Fix wrong configuration for hdf5 chunking
#237
rayandrew
closed
3 weeks ago
0
fix last step is not executed
#236
rayandrew
closed
3 weeks ago
3
Fix last step is not executed with/without user specifying `total_training_steps`
#235
rayandrew
closed
3 weeks ago
0
Issue with Accelerator Utilization in UNet3D for ( n =13 )
#234
userAmber
opened
3 weeks ago
0
fix negative value of computation time when stdev exists
#233
rayandrew
closed
3 weeks ago
0
Negative computation time when stdev of computation time is provided
#232
rayandrew
closed
3 weeks ago
1
fix misleading generator message
#231
rayandrew
closed
3 weeks ago
0
New improved modelling for LLM Deepspeed.
#230
hariharan-devarajan
opened
1 month ago
4
Bugfix: fix type of number for offset and size
#229
hariharan-devarajan
closed
3 weeks ago
1
Update overview.rst
#228
mannreis
closed
1 week ago
0
Improve CI Performance.
#227
hariharan-devarajan
closed
2 months ago
0
For sample indexing we fix the uneven sampling
#226
hariharan-devarajan
closed
1 month ago
2
IndexError: list index out of range when running custom.yaml file with custom num_files_train parameter
#225
anrahman4
closed
1 month ago
5
Fix PyPI Publish Issue and Improve Project Metadata
#224
izzet
closed
2 months ago
0
Fix missing import for chunking.
#223
hariharan-devarajan
closed
2 months ago
0
Changing logging levels
#222
zhenghh04
opened
2 months ago
7
Redundant shuffling
#221
zhenghh04
opened
2 months ago
0
Fixed mocking for DFTracer
#220
hariharan-devarajan
closed
2 months ago
1
Is computation_time only related to the GPU model?
#219
LLJING-ER
opened
2 months ago
3
Adding version fix restricts matching on python 3.9 environment.
#218
hariharan-devarajan
closed
2 months ago
0
OOM Fix
#217
zhenghh04
closed
3 months ago
0
Fixed iterator to only store data for that rank.
#216
hariharan-devarajan
closed
2 months ago
0
Ignore file indexing for native data loader.
#215
hariharan-devarajan
closed
2 months ago
0
Only intialize and finalize on DLIOMPI
#214
hariharan-devarajan
closed
2 months ago
0
Bug fix for MPI and HDF5 workloads on LC clusters
#213
hariharan-devarajan
closed
3 months ago
1
Fix README CI badge
#212
izzet
closed
3 months ago
0
Publish on PyPI
#211
izzet
closed
3 months ago
4
Refactor `setup.py` to Enable PyPI Publishing
#210
izzet
closed
4 months ago
0
DFTracer CI environment variables fixed.
#209
izzet
closed
4 months ago
0
Switch DLIO Profiler to DFTracer.
#208
hariharan-devarajan
closed
3 months ago
1
Fixed the MPI initialization issue
#207
zhenghh04
closed
4 months ago
0
Mlperf storage v1.0
#206
zhenghh04
closed
4 months ago
0
sync up
#205
zhenghh04
closed
4 months ago
0
Fix requirements file
#204
johnugeorge
closed
4 months ago
0
sync up mlperf_storage_v1.0
#203
zhenghh04
closed
5 months ago
0
Bring v1.0 to the most recent commit
#202
zhenghh04
closed
5 months ago
0
Mlperf requests
#201
zhenghh04
closed
5 months ago
0
Fixed potential insufficient samples due to num_files is not divisible by comm.size
#200
zhenghh04
closed
5 months ago
3
Request changes from MLPerf Storage
#199
zhenghh04
closed
5 months ago
0
Fix # of files not divisible by # of accelerators
#198
LouisDDN
closed
5 months ago
6
Shard filenames instead of images (tfreader)
#197
LouisDDN
closed
5 months ago
0
Update config.py
#196
zhenghh04
closed
5 months ago
0
Metric updates
#195
zhenghh04
opened
5 months ago
0
Improve tfreader parsing performance (batch)
#194
LouisDDN
closed
5 months ago
3
Looks like the benchmark is not calculating client memory sizes correctly in the datasize command for scale out client testing
#193
danlchilton
opened
5 months ago
0
reduced tensorflow version
#192
zhenghh04
closed
5 months ago
0
Packaging
#191
zhenghh04
closed
5 months ago
0
Next