wildlife-dynamics / ecoscope-workflows

An extensible task specification and compiler for local and distributed workflows.
BSD 3-Clause "New" or "Revised" License
0 stars 1 forks source link

Data connections #31

Closed cisaacstern closed 5 days ago

cisaacstern commented 2 weeks ago

Closes #32 #33

cisaacstern commented 2 weeks ago

TODO:

cisaacstern commented 1 week ago

I have presented the proposed solution to the Pydantic community for feedback here: https://github.com/pydantic/pydantic/discussions/3205#discussioncomment-9799424

cisaacstern commented 1 week ago

I have now a draft of an interactive local "keystore" generator (i.e. .env file). Remaining TODO:

cisaacstern commented 1 week ago

@Yun-Wu @atmorling @walljcg, this is ready to go IMHO.

It's obviously a bit of a hard PR to review given its size and relative lack of documentation 🤭 . But it's also an important one, so before merging if you all want to / are able to give it a quick skim, perhaps something will jump out at you that you'd like to ask a question about.

I'll leave this open for now and move on to other work, and then we can merge early next week once anyone who'd like to has given it a quick read.

To clarify though, while I'm very open to criticism on this, and the possibility that I've overlooked something, I'm also quite confident in the behavior at this point, and would feel comfortable merging without review as well, if you happen to be busy with other things!

To list a few things here for our collective future reference:

Yun-Wu commented 1 week ago

Thank you Charles! It looks great! Added some minor comments for clarification here.

cisaacstern commented 1 week ago

Thanks so much for the review Yun! I really appreciate it!! Left you responses inline above. 🙏

atmorling commented 6 days ago

I really like this implementation :partying_face: Do you see the gsm solution being tenable when we get to storing user configured data connections (and therefore potentially user credentials to those data connections) via ecoscope-web and then being retrieved wherever the task is executed?

cisaacstern commented 5 days ago

I really like this implementation 🥳 Do you see the gsm solution being tenable when we get to storing user configured data connections (and therefore potentially user credentials to those data connections) via ecoscope-web and then being retrieved wherever the task is executed?

Great question @atmorling. Short answer: I'm not sure. Slightly longer answer: the GSM solution here is at least a reference for how a GCP-based backed can be plugged into this PR eventually. At most, it's the actual solution we'd want to use, but that part I'm not sure about.

I've opened https://github.com/wildlife-dynamics/ecoscope-workflows/issues/48 to track this further, please add to the discussion there if you have further thoughts or if I haven't captured the questions correctly there!

cisaacstern commented 5 days ago

Thank you @Yun-Wu and @atmorling for the reviews!!