Closed amy-langley closed 5 years ago
The ones I've used/would suggest are:
Look into faktory for a queue system as well, https://www.mikeperham.com/2019/01/08/using-faktory-with-python/
Pillow was my solution of choice as well, but it looks like they can't support 16-bit/channel TIFF files, which is the format of LANDSAT data channels, so I may need to either find something else or augment Pillow with our existing method of using ImageMagick
I'm no longer delighted about Faktory, knowing that it bakes its own persistence mechanism into the server. I'd much rather stick with Celery/Redis for future flexibility. Thoughts?
Faktory is moving from RocksDB to an embedded Redis instance:
https://github.com/contribsys/faktory/wiki/Redis
but note that they are explicitly not enabling shared tenancy or replication
Last note, we require several features that are in Faktory's paid tier but not their free tier, so I think celery is probably going to be our best bet.
Current plan is:
ORM : Django Queue: Celery Images: Pillow/ImageMagick where necessary GIS: libgdal + python bindings REST/Oauth: requests / request-oauthlib Testing: pytest + pspec
Since we're going to tentatively build this pipeline out in Python, we need to select Python versions of our familiar tools:
As well as tools for handling some novel challenges: