A workflow for construction of Gene Expression count Matrices (GEMs). Useful for Differential Gene Expression (DGE) analysis and Gene Co-Expression Network (GCN) construction
GEMmaker currently adds failed runs to an HTML report, and does nothing else. Most of the time, these runs fail diue to an issue with NCBI.
Most users will want to try to run these again. If they did not implement a retry, this will need to be done after the first GEMmaker run completes. If they do not want to re-run the entire run, they are forced to copy-paste and create a new SRA.txt file to re-run.
It would be nice if there was a failed runs file which could just be used as this file, rather than dealing with the html file. Currently the HTML file is not user friendly to copy from, due to multiple columns and unneccesary $ signs:
When copied from HTML:
$SRR2931055 | $SRX1423949 | $Failed to fetch sample url.
-- | -- | --
$SRR2931063 | $SRX1423957 | $Failed to fetch sample url.
$SRR2931062 | $SRX1423956 | $Failed to fetch sample url.
$SRR2931065 | $SRX1423959 | $Failed to fetch sample url.
$SRR2931068 | $SRX1423962 | $Failed to fetch sample url.
$SRR2931069 | $SRX1423963 | $Failed to fetch sample url.
$SRR2931083 | $SRX1423977 | $Failed to fetch sample url.
$SRR2931084 | $SRX1423978 | $Failed to fetch sample url.
$SRR2931086 | $SRX1423980 | $Failed to fetch sample url.
$SRR2931079 | $SRX1423973 | $Failed to fetch sample url.
$SRR2931077 | $SRX1423971 | $Failed to fetch sample url.
$SRR2931087 | $SRX1423981 | $Failed to fetch sample url.
$SRR2931080 | $SRX1423974 | $Failed to fetch sample url.
$SRR2931045 | $SRX1423939 | $Failed to fetch sample url.
$SRR2931050
I could either write a bash command to get the list of failed runs, or manually modify this list by deleting unnecesary information. Would be nice if the failed run file had a partner that looked like this:
GEMmaker currently adds failed runs to an HTML report, and does nothing else. Most of the time, these runs fail diue to an issue with NCBI.
Most users will want to try to run these again. If they did not implement a retry, this will need to be done after the first GEMmaker run completes. If they do not want to re-run the entire run, they are forced to copy-paste and create a new SRA.txt file to re-run.
It would be nice if there was a failed runs file which could just be used as this file, rather than dealing with the html file. Currently the HTML file is not user friendly to copy from, due to multiple columns and unneccesary $ signs:
When copied from HTML:
I could either write a bash command to get the list of failed runs, or manually modify this list by deleting unnecesary information. Would be nice if the failed run file had a partner that looked like this: