Open blahah opened 9 years ago
This yeast dataset, generated for the Trinity paper, is an option:
Advantages of this dataset include:
Doing some preliminary analysis on the yeast dataset, I downloaded it and ran assemblotron without subsampling. A single assembly with SOAPdenovoTrans + transrate takes about 4 minutes on 24 cores of our cluster, so this looks promising for a sweep of perhaps 4 major parameters (K, d, e, t). Unfortunately there's some weird combination of the environment on the cluster and the build of Salmon that is making it segfault on some data. I am working through this with Rob Patro (Salmon developer).
Turns out there was a bug in Salmon that was triggered by this dataset. It's now fixed in Salmon 0.4.2 and transrate 1.0.0, so I've set the parameter sweep running again.
Possible arabidopsis read datasets:
Features of the dataset: