NERC-CEH / plankton_ml

A project for image processing and analysis pipelines for plankton sampling
GNU General Public License v3.0
0 stars 1 forks source link

Reading an image collection from s3 with intake #3

Open metazool opened 1 month ago

metazool commented 1 month ago

All the scivision examples including the CEFAS plankton one seem to have image collections in a single zipfile.

In theory you can use a wildcard in a url_path but for me this throws an s3 error on the directory listing. I'm not sure whether this is a permissions issue or not actually a feature of intake.

At the moment, it's somewhat moot due to:

So for prototyping purposes intake is returning a CSV with file locations in s3, but we lose the benefit of the neatly packaged ImageSource - one to bear in mind when refactoring the image preprocessing and improving the metadata. Also worth requesting an object store from JASMIN for testing purposes, to experiment with the bucket policies.