husonlab / megan-ce

MEGAN Community Edition
GNU General Public License v3.0
65 stars 22 forks source link

Importing from RPD Standalone format is not operational. #6

Open ialbert opened 7 years ago

ialbert commented 7 years ago

When using MEGAN 6 we were unsuccessful in importing data in the "RDP Standalone" classification format.

Notably this same file in RDP Standalone format loads correctly in MEGAN 5.

Looking at the code below:

https://github.com/danielhuson/megan-ce/blob/master/src/megan/parsers/blast/RDPStandalone2SAMIterator.java

It does not appear that it attempts to do the right thing. As the MEGAN 6 manual states the RDP Standalone format is:

read-name [-] [taxon-name rank-name disjointScore] [taxon-name rank-name disjointScore] 

whereas the code above attempts to split by ; among other things.