Closed olgabot closed 4 years ago
To make the output protein/dna sequences unique and indexable, they need to not have spaces where there is unique information, e.g. for the translation frame. This PR adds the translation frame to the sequence name, but without spaces.
E.g. here's a current output:
>read1/tr|A0A024R1R8|ENSP00000491117;mate1Start:1;mate2Start:1 translation_frame: 1
This would be changed to:
>read1/tr|A0A024R1R8|ENSP00000491117;mate1Start:1;mate2Start:1__translation-frame:1
Many thanks to contributing to czbiohub/sencha!
Please fill in the appropriate checklist below (delete whatever is not relevant). These are the most common things requested on pull requests (PRs).
pytest
make coverage
black . --check
usage.md
README.md
the tests are failing as the expected sequence names have spaces in them, otherwise this PR looks good
Closed in favor of #93
To make the output protein/dna sequences unique and indexable, they need to not have spaces where there is unique information, e.g. for the translation frame. This PR adds the translation frame to the sequence name, but without spaces.
E.g. here's a current output:
This would be changed to:
Many thanks to contributing to czbiohub/sencha!
Please fill in the appropriate checklist below (delete whatever is not relevant). These are the most common things requested on pull requests (PRs).
PR checklist
pytest
ormake coverage
if you want to see which lines don't have tests yet)black . --check
).usage.md
is updatedREADME.md
is updated