fulcrumgenomics / fgpyo

Quality of life improvements for Bioinformatics in Python.
https://fgpyo.readthedocs.io/en/latest/
Other
24 stars 3 forks source link

Add function to combine headers from multiple AlignmentFiles #56

Open msto opened 10 months ago

msto commented 10 months ago

It is sometimes useful to combine headers (particularly the reference sequence dictionaries) in order to write alignments from multiple input files to a single output file.

Suggestions for validations from @nh13 :

nh13 commented 10 months ago

It might be useful to see what others (reference impls) have done to merge headers and seqdicts, as well as tools themselves (e.g. samtools merge). You may

tfenne commented 10 months ago

It's also worth noting that except in very limited cases you can't just merge the headers without then transforming all the reads too. E.g.