Closed thomas-a-neil closed 4 years ago
This should also help with https://github.com/songlab-cal/tape/issues/8
Should we merge this? I don't think the rinokeras
changes have been merged to master?
It depends on rinokeras
changes, so I don't think we can merge it yet.
Closing since both this and rinokeras are in basic maintenance mode now, so no major changes will be made.
Building on https://github.com/CannyLab/rinokeras/pull/12, the data consolidation step will read the entire output dataset into memory (which will crash for relatively small datasets if we include all encoder outputs, especially for the LSTM).
hdf5 allows us to iteratively write, and avoid the memory overhead of pickle
Upon reflection, some documentation update should probably be done as well, because I think we reference pickle a few time