Open victorlin opened 3 years ago
I think this is just a straight bug, the first line should be starting with SUMZER_COMMENT=
, I totally agree
Is something me and @rcedgar have argued about. I disliked the sra=XXXX
on every line quite a bit originally as it looks ugly, but in practice it's incredibly pragmatic since we grep
these files very often for spot checking and development. If someone were to work with the summary files in bulk I think the same point is true there, it's very useful to have the sra=
on each line. It solves some ugly problems with working with millions of files on a linux file-system. I'd opt to retain it.
Good point about the grep
. Would it be equally beneficial to have the sra=XXXX
for nucleotide summary files as well? That way it's more consistent.
Yes, equally beneficial.
Examples for ERR2756788.
Original summary header line [S3 link]:
New summary header line [S3 link]:
New psummary contents [S3 link]:
2 questions:
SUMZER_COMMENT=
as it was originally?sra=ERR2756788;
be removed from the beginning of every line?I know these files have already been uploaded, so this is more a note for any future reprocessing.