Closed aldendirks closed 1 year ago
Hi @aldendirks,
Sorry for the late reply.
We published a new version of InterProScan last week (version 5.64-96.0
) and PANTHER GO terms now have the category and the name, e.g.
<?xml version="1.0" encoding="UTF-8"?><protein-matches xmlns="https://ftp.ebi.ac.uk/pub/software/unix/iprscan/5/schemas" interproscan-version="5.64-96.0">
<protein>
<sequence md5="9e4b4cc8d93c10ef376100d7ebfa07d0">MAPTKYTPLTLHFSDAVTNVYPRQVEKLVANDGSYEYFRALGENEQKDILWRSKIAKALVEKYLKNAKGDRLTETDTAKDYIFKTLPENYKLYEHVKGKRDEKSGGTISERRDTYLFGHPTGKRFRSPAEFVPHILHLAAQDDRPCECWICTGSKHGNPPTSVKKPTKRETEVTQARKVVALEERQREQETAGWVLRKGEVVWVWLSDNPEAEEASDDALIDGDGGLWVAGVVAERPSFTPPYQKVRKTTGNAFADIDMDDTPPTWQQEGGNVPEKTYIIQLCSDPPKLGQILKGVPQHHVKPWLSRQECAQAPPSYSGKIEHPSIPRARRVAETFSLFDRVSEPSDPPSASDPSPDAPKIANFQGVFLGAEKIYIHEPVRISSANEDEIEDVLVVDKIYTCTTTSESASSGSDGKKKTLTTTQFRGNVYTAYPSTTCTPLSSHQFTELPFRMRRGSGTGEIIKWFIRNVPEERGECSLKMILGRWYEPQAVNEWIGSTGFSGGLPSSKETAMCQKDVKRWVKNRADALGLVSVNGIDLKSEGEVKIQPGKLTSPLKPKPADATAEAMDVDEPPQVTPERGFKSVNLRISSVTPGSASSLKITPRTEADDAGIDGGDIEEEEQVEGDEDEEDEDDEATMSDDKYHQPGPEVLSRSPTKRLSK</sequence>
<xref id="FUN_001952-T1" name="FUN_001952-T1 FUN_001952"/>
<matches>
<panther-match ac="PTHR38046:SF1" evalue="5.9E-54" graft-point="PTN002866222" name="CRYPTIC LOCI REGULATOR 2" score="195.5">
<signature ac="PTHR38046" name="CRYPTIC LOCI REGULATOR 2">
<entry ac="IPR038986" desc="Cryptic loci regulator 2" name="Clr2" type="FAMILY">
<go-xref category="CELLULAR_COMPONENT" db="GO" id="GO:0070824" name="SHREC complex"/>
<go-xref category="BIOLOGICAL_PROCESS" db="GO" id="GO:0031507" name="heterochromatin formation"/>
</entry>
<signature-library-release library="PANTHER" version="17.0"/>
</signature>
<model-ac>PTHR38046:SF1</model-ac>
<locations>
<panther-location env-start="2" env-end="546" hmm-start="16" hmm-end="548" hmm-length="0" hmm-bounds="INCOMPLETE" start="4" end="494">
<location-fragments>
<panther-location-fragment start="4" end="494" dc-status="CONTINUOUS"/>
</location-fragments>
</panther-location>
</locations>
<go-xref category="BIOLOGICAL_PROCESS" db="GO" id="GO:0016575" name="histone deacetylation"/>
<go-xref category="BIOLOGICAL_PROCESS" db="GO" id="GO:0030466" name="silent mating-type cassette heterochromatin formation"/>
<go-xref category="CELLULAR_COMPONENT" db="GO" id="GO:0033553" name="rDNA heterochromatin"/>
</panther-match>
</matches>
</protein>
</protein-matches>
You can download the latest version of InterProScan there: https://www.ebi.ac.uk/interpro/download/InterProScan/.
I am attempting to run Funannotate to annotate my fungal genomes, which makes use of interproscan. When parsing the interproscan.xml file, Funannotate is giving me the error
Error parsing XML GO terms: None is not a valid term
. Looking more closely at the XML file, I see withing thepanther-match
carrots there arego-xref
lines without any category information. For example, the first protein of aninterproscan.xml
file is pasted below. Is this list of GO IDs without any more information normal (towards the end)?