leap-stc / data-management

Collection of code to manually populate the persistent cloud bucket with data
https://catalog.leap.columbia.edu/
Apache License 2.0
0 stars 6 forks source link

Add 'external' field to catalog? #58

Open jbusecke opened 11 months ago

jbusecke commented 11 months ago

Wondering if we should add a bool field external to the catalog to distinguish datasets ingested 'in-house' compared to external ones like in #34?

cisaacstern commented 11 months ago

I agree some differentiation might be helpful, but what will be most useful for users to know, I wonder? Perhaps where ingestion happened is less important than access considerations (region, provider, requester-pays, etc.) which could be captured in something like tags?

jbusecke commented 11 months ago

Ah interesting. I was more thinking of this as an internal field (for LEAP reporting for instance), but I think your points are good ones to @cisaacstern