googleinterns / amaranth

Apache License 2.0
2 stars 0 forks source link

Add dtypes to CSV reading code #5

Open tommylau-exe opened 4 years ago

tommylau-exe commented 4 years ago

Expected Behavior

CSV reading code tells Tensorflow what datatypes to expect for each column of every CSV file.

Actual Behavior

CSV reading code doesn't provide this information to Tensorflow, resulting in a slowdown for CSV reading code and unnecessary memory usage.

Steps to Reproduce the Problem

  1. Switch branch to model-dev
  2. Run ml/main.py
  3. Observe warning sys:1: DtypeWarning: Columns (9) have mixed types.Specify dtype option on import or set low_memory=False.