src-d / ml-backlog

Issues belonging to source{d}'s Machine Learning team which cannot be related to a specific repository.
0 stars 3 forks source link

Describe how we publish datasets #58

Open zurk opened 5 years ago

zurk commented 5 years ago

We do not have any guide about datasets publishing and it is worth to create it. The point of this PR is to describe how do we publish datasets right now to help newcomers better understand how to do it.

Resources that should be taken into account:

  1. Draft design document "Documenting Models, Datasets, and Algorithms"
  2. Proposal guide https://github.com/src-d/guide/pull/163/files

The similar issue about models: https://github.com/src-d/ml-backlog/issues/59

vmarkovtsev commented 5 years ago

This happened in "source{d} datasets, a blog series" post which is scheduled for tomorrow.