lh3 / miniprot

Align proteins to genomes with splicing and frameshift
https://lh3.github.io/miniprot/
MIT License
323 stars 17 forks source link

Add an option to not to print ##PAF line to gff file #6

Closed xiekunwhy closed 2 years ago

xiekunwhy commented 2 years ago

Hi,

1) Would you please add an option to not to print ##PAF line to gff file? Those lines may cause some errors when using gffread(https://github.com/gpertea/gffread) to do some downstream screening. gffread Opl.gff -T -o Opl.gtf image

2) The ID prefix now is MP, would please add an option to accept custom ID prefix (and accept custom source name if possible) ? We have difference protein sets with differece weight need to mapping some times, for example, preparing homolog annotation results for evm (https://evidencemodeler.github.io/).

Best, Kun

lh3 commented 2 years ago
  1. Will add an option to disable ##PAF lines. For now, use grep -v ^##PAF to postprocess the output. PS: even with the new option to add, it is recommended to keep ##PAF lines. These lines give detailed alignment information which gff doesn't provide.

  2. I also want to have an option to change the prefix. It just hasn't been implemented yet. As you mention it, I will implement this feature soon.

lh3 commented 2 years ago
  1. Use --gff-only
  2. Use -P MyPrefix