Closed suryasaha closed 3 years ago
Possibly something I'm missing. Why do we remove the XML header lines? @amcooksey https://github.com/AgBase/InterProScan/blob/08ae6068f4771b168426a3ccd8c3032f4c49da92/5.45-80/iprs_wrapper.sh#L227
Looks like the protein-matches
closing tag should also be removed from all split XML files and then added to the final XML file
Missing opening tag
It looks like the merging skips the opening tag
<?xml version="1.0" encoding="UTF-8"?><protein-matches xmlns="http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5" interproscan-version="5.45-80.0">
So we have a bunch of dangling
</protein-matches>
tags