Closed mradz19 closed 1 month ago
Can you go to the results/bins
directory and run the following code?
python3 -c """
from glob import glob
fs = glob('*tax')
for f in fs:
for line in open(f):
if line.startswith('Consensus'):
print(f, line.split('\t')[1])
break
else:
print(f, 'NO RESULT')
"""
You should get the size for each bin. Let me know if some bin shows NO RESULT
instead of the size.
Thanks for your help! The very last bin says no result:
concoct.386.fa_sub.contigs.fa.tax NO RESULT
Can you share that file with us?
Sure here is a link to the file:
Any chance that step 16 (16.addtax2.pl
) died midway?
Can you check whether all the fasta files for the bins in /data/full_project_2/results/bins
have an associated *.fa.tax
file?
Most don't have an associated tax file, about 25% of them do. Will I have to repeat from step 16?
Try running step 16 alone (16.addtax2.pl /path/to/project
) then we double check whether everything looks correct and, if so, restart from step 17.
Ok I will run that and update when it is finished
Ok it ran for a little while and stopped part way through:
Adding SqueezeMeta taxonomy for concoct.381.fa.contigs.fa
Calculating consensus taxonomy using mode s
Adding SqueezeMeta taxonomy for concoct.382.fa_sub.contigs.fa
Calculating consensus taxonomy using mode s
Adding SqueezeMeta taxonomy for concoct.383.fa_sub.contigs.fa
Calculating consensus taxonomy using mode s
Adding SqueezeMeta taxonomy for concoct.386.fa_sub.contigs.fa
Killed
It seems to be stopping at the same bin:
concoct.386.fa_sub.contigs.fa.tax NO RESULT
How many contigs are there in that Bin?
Nevermind I believe it is 176004
grep -c "^>" concoct.386.fa_sub.contigs.fa
176004
It seems the tax file for this bin is predominantly chordata, could I try just deleting this bin then re-running step 16 or would that cause issues?
Not 100% sure but will probably work. The error happens because the Bin is too large for our disparity calculation algorithm. We actually added a check for this a while ago, but the fix hasn't made an official release yet. https://raw.githubusercontent.com/jtamames/SqueezeMeta/binonly/scripts/16.addtax2.pl This would be the link to the updated code, in case you want to replace your script by the fixed one.
Just an update. I used the new code you attached and step 16 completed successfully, it just skipped the larger bin. I then restarted from step 18 and the run finished without issue. Thanks for your help.
Glad to hear, closing issue!
My project is stopping at step 18 with the error:
I saw that #815 had the same issue but I r syslog.txt eally don't want to start the binning again is this is a very large project. I have confirmed that none of the bins in results/bins are 0 bytes in size, they are all at least 1MB.
I have also attached the syslog file