There are some use cases not covered by the rough first pass:
Some users will be interested in in-progress data, or data that has only been run through a pipeline as a means of simple data discovery.
Solution
Add tabs to the summary page:
Complete or Ground Truth, (or something) a list of high quality human-reviewed data suitable for inclusion in training, good for marketing/demos, and suitable for use as presentation material during workshops.
In Progress or something, a list of all public data that people can browse to find all public data.
It would be nice if we could somehow motivate people to publish their data once they're sufficiently happy with it.
Data moderation
Because this feature makes data more discoverable, data moderation begins to be important. In worst-case scenarios, a user could upload illegal content or tag data with offensive information that appears in the summary. How will we deal with this? Who is responsible for it? How much time is too much time to spend on this?
There will need to be a way for someone to disallow spam data from this list.
Terms of service
We should be more transparent about our data use policy and terms of serivce when people upload data to this service. For most of our gov customers, this isn't an issue because the data should be public anyway. But it's better to have clear terms than to have a problem later.
There are some use cases not covered by the rough first pass:
Some users will be interested in in-progress data, or data that has only been run through a pipeline as a means of simple data discovery.
Solution
Add tabs to the summary page:
Complete
orGround Truth
, (or something) a list of high quality human-reviewed data suitable for inclusion in training, good for marketing/demos, and suitable for use as presentation material during workshops.In Progress
or something, a list of all public data that people can browse to find all public data.It would be nice if we could somehow motivate people to publish their data once they're sufficiently happy with it.
Data moderation
Because this feature makes data more discoverable, data moderation begins to be important. In worst-case scenarios, a user could upload illegal content or tag data with offensive information that appears in the summary. How will we deal with this? Who is responsible for it? How much time is too much time to spend on this?
There will need to be a way for someone to disallow spam data from this list.
Terms of service
We should be more transparent about our data use policy and terms of serivce when people upload data to this service. For most of our gov customers, this isn't an issue because the data should be public anyway. But it's better to have clear terms than to have a problem later.