Open blahah opened 9 years ago
Yeah I agree do at least triplicate for the sampling. 5 or more times would be even better
OK, let's go with 5 to start with. We need to balance robustness with computation time. Perhaps we don't need to take large samples, like 80%. We could focus on smaller ones - perhaps starting with 5, 10, 20 %? My thinking is that we don't really care whether larger samples are representative because they don't help much. We just want to show whether the sort of sizes we might actually use are useful.
OK, I've done sweeps of both the yeast and arabidopsis datasets (see #7). Results to follow
Experiment: