Open m09 opened 5 years ago
@m09 I guess we've done everything and can close this?
Even though we used it extensively for single-user repos (both @glimow and I), we did not try to collaborate much yet. We'll do it this week with @r0mainK and close the issue with concluding comments after that :)
During our recent exploration of ML collaboration tool-suites, we came across
dvc
, a well-established open source solution developed among others by the folks at Open Data Science, a community beloved by Russian speaking data scientists.We'd like to give it a try since it fits really well with our values at source{d} and solves the core part of our problems when we experiment:
To try
dvc
, the first step is to use it individually in one or two projects:dvc
for the CodRep task in https://github.com/src-d/formatml/;dvc
for the topic modeling experiments (if @m09's first experiments are promising) in https://github.com/src-d/tm-experiments.The second step is to have the ability to share the large data files and results for good teamwork and collaboration. To test this, two things are needed:
dvc
remote on our ML Cluster;