Closed esantamariavazquez closed 2 years ago
I just discovered that I can speedup deepdish disabling the compression with:
dd.io.save('test.h5', {'data': data}, None)
Sorry for the inconvenience, issue closed!
Glad you discovered the solution! Yes, it is quite a slow compression that is used as default. In <=3.2.0, the default was the much faster Blosc (CHANGELOG.rst). As the changelog suggests, it had some interoperability issues. It was a key requirement that the files saved can (by default) be loaded across any combination of Python 2/3, and Mac/Linux/Windows (e.g. Saved in Python 3 on Mac, load on Python 2 on Linux). I don't remember exactly which combinations Blosc failed at, but I do remember it did. It's too bad, because it is much fast. If interoperability is not a concern for you, Blosc is an excellent choice.
More info here: https://www.pytables.org/usersguide/optimization.html
Hi guys!
I just came accross deepdish and I really love it, thank you very much for this great work!
However, I've noticed that there is a speed problem compared to h5py. Here is a very simple piece of code that shows it:
In my computer:
I know that the strong point of deepdish is its capability to save complex data (dicts and so on), and perhaps performance is not the key goal here. But still, I think it would be great to achieve similar speed, especially when no complex data is involved.
Whant do you think? It can be done with some optimization?
Cheers, Eduardo