Closed Rob-murphys closed 1 year ago
Hi thanks for using our tool could you send us ur data (without sensitive info) ?
sample data.zip Here is a sample of the data
thx will do this in the weekend
Many thanks :) I am really excited to see the predicted substrates and gene families!
Thx
@linnabrown Hey Le, I was wondering if you had managed to find the cause of the issue?
Sorry I forgot to resolve it since I was busy these 3 weeks. Let me do this next week when I back to school.
Hey @linnabrown, No worries, I hope you settled back into work after the summer break 😄 I have tried running the tool on subsets of the data and after freshly reinstalling and still get the same error. unfortunately.
Hi Rob, the problem might be from the location file (gff), specified by --cluster $locations. It might be that it's not in an acceptable format. you just shared your the protein seq file (faa), can you also share the gff file for us to look at where the problem is?
Hi @yinlabniu, here is the associated gff file 😄 sample_data_gff.zip
okay, your gff file is clearly not in gff format. You may see https://www.ncbi.nlm.nih.gov/genbank/genomes_gff/ or https://www.youtube.com/watch?v=hiRXtxcTn_I for the gff format.
Opps, I forgot to request gff from prokka but thought I had. My bad!
Given the fantastic new capabilities of dbcan 4 I am wanting to rerun some data I previously ran through dbcan 2. I have installed the latest version of the tool and attempted to run my samples through it but once
CGC-Finder
I get this error:My running command is:
run_dbcan $input protein --db_dir $database --out_dir $outdir --hmm_cpu 10 --dia_cpu 10 --tf_cpu 10 --stp_cpu 10 --cluster $locations --cgc_substrate
I included a few of the lines that come before it in case that is helpful. Any idea how I can solve this issue?