uber / petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Apache License 2.0
1.78k stars 285 forks source link

tf_dataset: add unit test to verify repeat() works after cache() #674

Closed chongxiaoc closed 3 years ago

chongxiaoc commented 3 years ago

We can cache first iteration result then use repeat() to do multiple epochs.

codecov[bot] commented 3 years ago

Codecov Report

Merging #674 (ba7e23d) into master (211600b) will not change coverage. The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #674   +/-   ##
=======================================
  Coverage   85.79%   85.79%           
=======================================
  Files          84       84           
  Lines        4928     4928           
  Branches      779      779           
=======================================
  Hits         4228     4228           
  Misses        561      561           
  Partials      139      139           
Impacted Files Coverage Δ
petastorm/tf_utils.py 85.10% <ø> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 211600b...ba7e23d. Read the comment docs.