Open andersy005 opened 4 years ago
The Dask performance reports show that they are almost the same! That's great!
BTW, I want to write a blog post about this. đ
So, FUSE needs to be installed for this to run. Does this work on Cheyenne? Can you install libfuse
on HPC systems?
BTW, I want to write a blog post about this. đ
Me too.
Can you install libfuse on HPC systems?
libfuse is available on conda-forge: https://anaconda.org/conda-forge/libfuse
Does this work on Cheyenne?
Haven't tested it yet.
Hmm. I tried installing libfuse with conda and it failed.
âšī¸
I think libfuse
is only installable on linux. And you need to install osxfuse on Mac by hand?
Hmm. I tried installing libfuse with conda and it failed.
đI didn't realize that the conda-forge version is linux only.
And you need to install osxfuse on Mac by hand?
I think so. I installed osxfuse on my local machine a while ago, and that's why I didn't even notice that FUSE was a dependency when I ran the notebook
Installing osxfuse is pretty easy and it works once installed. So, that's not too big of a deal. I wonder if installing ratarmount
on a linux system will install libfuse
automatically.
Some notes on testing this on Cheyenne:
The system install of libfuse
is old (version 2.9.3). It was not until version 2.9.9 (or version 3.3.0 for fuse3
) that support for Lustre and GPFS filesystems was added to libfuse
. So, you will get an error if you try ratarmount
with the system install of libfuse
.
If you use conda
to install the conda-forge libfuse
package (currently version 3.9.1), you need to tell the fusepy
package where to find this new library by setting the FUSE_LIBRARY_PATH
environment variable equal to the absolute path to the libfuse3.so
file.
Currently, the libfuse
conda-forge package does not install the fusermount3
utility function needed by fusepy
. As a result, if you do the above, you still encounter an error:
fuse: failed to exec fusermount3: No such file or directory
I have opened an issue in the libfuse-feedstock
repo to ask if they can build the fusermount3
utility with the libfuse
package:
Thanks a lot @kmpaul and @andersy005 Great to see just talking about it in the evening, and waking up in Europe, work is done! I'll check on the HPC site here if there is libfuse installed in the right version for lustre/gpfs.
I just wanted to chime in here and give an update.
The libfuse
issue has stalled, and I suspect it will not get completed. It turns out is was substantially more complicated than I previously thought. I've asked out sysadmins to install an updated version of libfuse on our system, but they chose not to do it. So, we're stuck testing this on Cheyenne.
@kmpaul & @tinaok
I just put together a simple notebook demonstrating how to mount the tar archive via ratarmount. I then ran some smoke tests to see what the performance looks like:
Native/Original netCDF files
netCDF files in the tar archive
Here's a rendered version of the notebook: https://nbviewer.jupyter.org/github/NCAR/xtar/blob/master/notebooks/tar-netcdf.ipynb. I included dask's performance report in the notebook itself.