Open zx0223winner opened 2 years ago
Hi this is a common error in HSDFinder, especially for species rich of gene duplicates. I detailed the reason and solution for users with the link below, let me know if you still have trouble with the new merged Blast result file. Please go to the link and search for the question below:
https://github.com/zx0223winner/HSDFinder#how-to-deal-with-error-require-length-of-gene-
How to deal with Error: require length of gene ?
~Xi
Hi
This is same error and can be easily fixed Looks like in your new blast all-against-all file, you are still missing the length info lines like below if you search it . Make sure you run the Unix command on your protein data to acquire all the length info lines for each protein gene.
XP_034417599.1 XP_034417599.1 100 30256
awk '/^>/{if (l!="") print l; print; l=0; next}{l+=length($0)}END{print l}' '/.../.../protein.fa' |paste - - |sed 's/>//g'|awk -F'\t' '{print $1"\t"$1"\t"100"\t"$2}' >##.protein.length.aa
~Xi
I'm trying to run the HSD finder but ending up with an error, suggestions please. Are there any python packages to be installed?