Closed yarikoptic closed 2 years ago
On https://unix.stackexchange.com/questions/67568/mount-http-server-as-file-system found mentioning of other FUSE based solutions
NB will edit here later when get to try avfs
@bendichter do I have a correct memory in that you seemed mentioned that you had some progress in establishing "online" access to HDF5 files? If so, could you please reference here to anything along those efforts?
@yarikoptic Yes, check out HDF5Zarr. I'd be happy to walk you through it
yep yep, range requests work, used by HDF5 ROS3, fsspec, and using it datalad-fuse, etc. FOI could be closed
We have touched upon this topic a few times -- shouldn't it be possible to access hdf5 files over the network (i.e. over http) since http does support range requests and for basic querying we would need just a bit of binary blobs from there and there, and even for access to specific datasets it should be feasible to get just needed ranges. So, I apt-get installed
httpfs2
which is probably a descendant of even older httpfs which had last (or may be first as well? ;)) vital signs in 2006.So, here is a brief attempt:
ifconfig was just to provide crude assessment of how much traffic there was (nsntrace failed to help), while I was
dandi ls
'ing that mounted file. It took a while:which seems to cause no more than 21MB (comparing ifconfig outputs ;)) of traffic (the full size of the file > 400MB). Sure thing it is slower from running on a local copy (
dandi ls 7.32s user 0.14s system 100% cpu 7.452 total
) but caused less traffic than getting a full copy (took 1:08.03 total ;) ).Well, we already know that pynwb is a bit "too non-lazy" which causes possibly some traffic which could have been avoided in this particular case. But it also hints on possibility to use access over regular http for at least some use cases with .nwb files.
Unfortunately httpfs2 had some indigestion for me to try it on https:// urls to buzsaki's data with Gbs in size.