Clay-foundation / model

The Clay Foundation Model (in development)
https://clay-foundation.github.io/model/
Apache License 2.0
299 stars 38 forks source link

Allow ClayDataModule to load GeoTIFF files directly from s3 #92

Closed weiji14 closed 8 months ago

weiji14 commented 8 months ago

Similar to work done in #85 on the GeoTIFFDataPipeModule, this PR implements similar functionality in ClayDataModule to load GeoTIFF files from an s3 bucket. Plus a few more minor tweaks to align both LightningDataModules.

Implementation uses torchdata's S3FileLister to get the files, but instead of returning an iterator, a list is returned.

TODO:

Continuing on from #91, this PR is part 2/3 of working towards generating new embeddings from the model developed at #47.