Closed padpadpadpad closed 7 years ago
You're right, this is unclear, the data is in the repository: https://github.com/spholmes/F1000_workflow/tree/master/data
We should probably automate the download like we did for the other data.
Especially given the slight difficulty of downloading a single GitHub file. Finally managed it in terminal as:
wget https://raw.githubusercontent.com/spholmes/F1000_workflow/master/data/MIMARKS_Data_combined.csv
after changing to the desired directory. Thanks Dan
I'm not understanding the issue. The expectation is that you would download/clone the whole repository, in which case all the files, including in data/
, would be available locally, in the (portable) relative path of the scripts you're running.
The entire workflow is completely reproducible as a repository, as-is.
@spholmes am I missing something?
Ah this is a good point.
Maybe you should write that in the README.md to make it clearer. I came across this from the manuscript.
There is no obvious mention of GitHub repository on there and no mention of the desire to clone it. I came by the GitHub repository after I had already started the tutorial analysis.
Thanks for clarifying. You may not the only person to miss this.
However, there is a Data Availability section toward the end of the manuscript:
Intermediary data for the analyses are made available both on GitHub at https://github.com/spholmes/F1000_workflow and at the Stanford digital repository permanent url for this paper: http://purl.stanford.edu/wh250nn9648. All other data have been previously published and the links are included in the paper.
Software availability Bioconductor packages at https://www.bioconductor.org/. CRAN packages at https://cran.r-project.org/.
Permanent repository for the data and program source of this paper:
https://purl.stanford.edu/wh250nn9648
Latest source code as at the time of publication:
https://github.com/spholmes/F1000_workflow
Archived source as at the time of publication: Zenodo: F1000_workflow: MicrobiomeWorkflowv0.9, doi: 10.5281/zenodo.5454436
Given that this is in the manuscript, I think I will close this issue.
The walkthrough states:
"The last bit of information needed is the sample data contained in a .csv file."
But then just sets the path and reads it in and there I cannot find where the file would be created and what information needs to be in it.
Thanks Dan