Closed ghost closed 8 years ago
Unfortunately, preprocesor-shards.py
still lags behind in terms of features due to heavy code duplication with preprocess.lua
. In the mean time, you can use the updated implementation from @mdasadul:
See #49.
Thanks!
Due to large training dataset I had to use the preprocess_shards in order to split it. When running the train.lua i get the following error: loading data... /home/sergio/torch/install/bin/luajit: /home/sergio/torch/install/share/lua/5.1/hdf5/group.lua:312: HDF5Group:read() - no such child 'num_source_features' for [HDF5Group 33554432 /] Seems like 'num_source_features' is used in preprocess but not in shards. Could you please advice? Thanks