src-d / ml-backlog

Issues belonging to source{d}'s Machine Learning team which cannot be related to a specific repository.
0 stars 3 forks source link

Try DVC as a collaboration workflow in RML #79

Open m09 opened 5 years ago

m09 commented 5 years ago

During our recent exploration of ML collaboration tool-suites, we came across dvc, a well-established open source solution developed among others by the folks at Open Data Science, a community beloved by Russian speaking data scientists.

We'd like to give it a try since it fits really well with our values at source{d} and solves the core part of our problems when we experiment:

To try dvc, the first step is to use it individually in one or two projects:

The second step is to have the ability to share the large data files and results for good teamwork and collaboration. To test this, two things are needed:

vmarkovtsev commented 5 years ago

@m09 I guess we've done everything and can close this?

m09 commented 5 years ago

Even though we used it extensively for single-user repos (both @glimow and I), we did not try to collaborate much yet. We'll do it this week with @r0mainK and close the issue with concluding comments after that :)