Closed tnguyensanger closed 6 years ago
@tnguyensanger I've not looked at the alignment_filter before. Do you know what this means? What are you using it for?
@podpearson I haven't read any official documentation for it, but inferring from its usage, it specifies the reference the reads were aligned against. It only seems to exist for plasmodium, anopheles samples. It doesn't exist for the combined Human/plamodium samples (eg /seq/15235/15235_2.cram ). I don't know what's more reliable, using the alignment_filter column or simply filtering out any filenames with phix or _human in the filename.
This was a deep one...
alignment_filter was not set because it didn't appear in the baton JSON.
It didn't appear in the baton JSON because I didn't have the baton JSON stored.
I didn't have the baton JSON stored because the samples for these files had not Multi-LIMS warehouse ID.
Added MLWH IDs, got the baton iRODs, populated alignment_filter. Your original query now returns 0 results. As things should be.
@tnguyensanger to review
@magnusmanske Thanks for fixing this Magnus! Would it be possible to get a link to the commit that addresses the issue? Ad-hoc manual testing shows the issue has been resolved.
The commit would be: https://github.com/malariagen/fits/commit/08d87ceaf8c34e396b345a6ce8a328c624d068ab
Sometimes the alignment_filter is empty in FITS, but populated in IRODS.
EG) FIts query:
IRODS query: