Open TomAugspurger opened 3 years ago
Any chance you can recreate this without azure? I have no account to try to replicate this.
Mmm this this dataset should be public. Let me make sure I wasn't accidentally using some keys.
Yeah, I was able to reproduce on another machine without credentials.
Sorry, I thought specifying an "account_name" meant I would need an account. I can confirm the crash.
The cause is the lack of an async _cat_file
in AzureBlobFileSystem
. ReferenceFileSystem only works with async right now, and abfs is mostly async; but we implicitly require an async _cat_file(url, start, end) - and this ought not to use open/seek, but result in a direct call like https://github.com/dask/s3fs/blob/master/s3fs/core.py#L737 (note that start/end is directly encoded in the call headers).
presumably the hard crash happens because what would ordinarily be an ordinary stack exception is inside the (C) event loop.
Thanks for that info. I'll see if I can make an async cat_file
for adlfs.
Note the leading underscore in the method name
Is this fixed?
probably an issue with
fsspec.implementation.reference
, but I'm not sure.I'm trying this out on the Daymet Archive at https://azure.microsoft.com/en-us/services/open-datasets/catalog/daymet/.
I generate the offsets with
Works great. But when I try to read the file I get a stack overflow exception.
Looking into the implementation. I see that
cat_file
recursively callscat_file
. Putting in some debug,I see
Haven't looked any further than this.