R-ODAF / R-ODAF_Health_Canada

Health Canada's version of the R-ODAF pipeline, which includes additional visualization and pipelining features
MIT License
2 stars 3 forks source link

Establish sets of reference data #3

Open mattjmeier opened 3 years ago

mattjmeier commented 3 years ago

For testing purposes we need to have two relatively small datasets, with corresponding metadata and contrasts files, to test

cookkate commented 3 years ago

My thoughts:

IMO the first criteria is the most important, and maybe #3.

mattjmeier commented 2 years ago

I think this is where we can discuss continuous integration and development of tests.

To set this up we will really need to define dependencies carefully (which is a good thing anyway).

This guide shows a good strategy. This will also help define a VM/container when that time comes too. Their overall workflow can be seen here. More than one way to skin this cat but this seems like a good starting point for the discussion anyway. I don't think we have to use the "tox" tool, but it provides a useful framework.

Furthermore - this example shows how R can be used within workflows. They have a rather different goal there, but we can still use some of that code to help us.

So, I think the overall tasks for the workflow to make continuous integration possible are:

I'll try to start getting this done in a new branch... I don't know if this all the best strategy but it's a start!

mattjmeier commented 2 years ago

This also seems useful

mattjmeier commented 2 years ago

And act seems like what we should use to run actions locally. Looks easy to use and should save issues with computational power.

mattjmeier commented 2 years ago

I think to close this issue all we should do is establish a set of test data for RNASeq and create a separate entry in the test matrix for it.

Later we can think about adding in other temposeq platforms or different genomes or whatever, but I will feel good about just having the two main types of analysis tested regularly.