argonne-lcf dlio_benchmark issues

argonne-lcf / dlio_benchmark

An I/O benchmark for deep Learning applications

https://dlio-benchmark.readthedocs.io

Apache License 2.0

70 stars 30 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

upgrade pydftracer package

#242 rayandrew opened 2 weeks ago
0
Add user config to specify type of distribution of time configuration

#241 rayandrew opened 2 weeks ago
0
Change `max` to `abs` for preprocess time

#240 rayandrew closed 1 month ago
0
enable option to disable pin_memory in pytorch

#239 rayandrew closed 1 month ago
2
fix wrong tracing location of fetch data

#238 rayandrew closed 1 month ago
0
Fix wrong configuration for hdf5 chunking

#237 rayandrew closed 1 month ago
0
fix last step is not executed

#236 rayandrew closed 1 month ago
3
Fix last step is not executed with/without user specifying `total_training_steps`

#235 rayandrew closed 1 month ago
0
Issue with Accelerator Utilization in UNet3D for ( n =13 )

#234 userAmber opened 1 month ago
0
fix negative value of computation time when stdev exists

#233 rayandrew closed 1 month ago
0
Negative computation time when stdev of computation time is provided

#232 rayandrew closed 1 month ago
1
fix misleading generator message

#231 rayandrew closed 1 month ago
0
New improved modelling for LLM Deepspeed.

#230 hariharan-devarajan opened 1 month ago
4
Bugfix: fix type of number for offset and size

#229 hariharan-devarajan closed 1 month ago
1
Update overview.rst

#228 mannreis closed 1 month ago
0
Improve CI Performance.

#227 hariharan-devarajan closed 3 months ago
0
For sample indexing we fix the uneven sampling

#226 hariharan-devarajan closed 2 months ago
2
IndexError: list index out of range when running custom.yaml file with custom num_files_train parameter

#225 anrahman4 closed 2 months ago
5
Fix PyPI Publish Issue and Improve Project Metadata

#224 izzet closed 3 months ago
0
Fix missing import for chunking.

#223 hariharan-devarajan closed 3 months ago
0
Changing logging levels

#222 zhenghh04 opened 3 months ago
7
Redundant shuffling

#221 zhenghh04 opened 3 months ago
0
Fixed mocking for DFTracer

#220 hariharan-devarajan closed 3 months ago
1
Is computation_time only related to the GPU model?

#219 LLJING-ER opened 3 months ago
3
Adding version fix restricts matching on python 3.9 environment.

#218 hariharan-devarajan closed 3 months ago
0
OOM Fix

#217 zhenghh04 closed 3 months ago
0
Fixed iterator to only store data for that rank.

#216 hariharan-devarajan closed 3 months ago
0
Ignore file indexing for native data loader.

#215 hariharan-devarajan closed 3 months ago
0
Only intialize and finalize on DLIOMPI

#214 hariharan-devarajan closed 3 months ago
0
Bug fix for MPI and HDF5 workloads on LC clusters

#213 hariharan-devarajan closed 4 months ago
1
Fix README CI badge

#212 izzet closed 4 months ago
0
Publish on PyPI

#211 izzet closed 4 months ago
4
Refactor `setup.py` to Enable PyPI Publishing

#210 izzet closed 4 months ago
0
DFTracer CI environment variables fixed.

#209 izzet closed 5 months ago
0
Switch DLIO Profiler to DFTracer.

#208 hariharan-devarajan closed 4 months ago
1
Fixed the MPI initialization issue

#207 zhenghh04 closed 5 months ago
0
Mlperf storage v1.0

#206 zhenghh04 closed 5 months ago
0
sync up

#205 zhenghh04 closed 5 months ago
0
Fix requirements file

#204 johnugeorge closed 5 months ago
0
sync up mlperf_storage_v1.0

#203 zhenghh04 closed 5 months ago
0
Bring v1.0 to the most recent commit

#202 zhenghh04 closed 5 months ago
0
Mlperf requests

#201 zhenghh04 closed 5 months ago
0
Fixed potential insufficient samples due to num_files is not divisible by comm.size

#200 zhenghh04 closed 5 months ago
3
Request changes from MLPerf Storage

#199 zhenghh04 closed 5 months ago
0
Fix # of files not divisible by # of accelerators

#198 LouisDDN closed 5 months ago
6
Shard filenames instead of images (tfreader)

#197 LouisDDN closed 6 months ago
0
Update config.py

#196 zhenghh04 closed 6 months ago
0
Metric updates

#195 zhenghh04 opened 6 months ago
0
Improve tfreader parsing performance (batch)

#194 LouisDDN closed 6 months ago
3
Looks like the benchmark is not calculating client memory sizes correctly in the datasize command for scale out client testing

#193 danlchilton opened 6 months ago
0