kblin / merge-gbk-records

Merge multiple GenBank records using a defined spacer sequence
Apache License 2.0
6 stars 1 forks source link

Question: Missing LOCUS identification #3

Open ne1al opened 2 years ago

ne1al commented 2 years ago

Hello, all, I understand that merge-gbk-records concatenates all gbk files into one, but is there a way to keep the LOCUS metadata in the new merged file? It only keeps the locus of the first gbk record in the loop and I don't want to the loci of other records. Thanks in advance

kblin commented 2 years ago

That's not possible at the moment, but depending on how you want the other locus tags it will be easy to add. Would having the LOCUS id of the subsequent records on the spacers preceding these records work for you?

ne1al commented 2 years ago

I think your suggestion might be helpful. The thing is that I have to manually annotate those loci on a visualization tool called Clinker for biosynthetic gene clusters. The waymerge-gbk-records merges the records makes me lose to which locus the original record belongs. I might give your suggestion a try.

kblin commented 2 years ago

I think Clinker supports loading multiple separate GenBank records, so I don't think you need to merge them using merge-gbk-records.

ne1al commented 2 years ago

Yes, it does. But I am trying to merge GenBank records for different gene clusters present in one sample to compare them to other samples.