Closed Shirley-Q closed 4 years ago
Hi @qiaoxuejiao,
most likely, something is wrong with your coverage file. Check that it looks correct and ensure that it has tab separated columns. If you find no error, I will have to look into this error more closely, preferably using input files where you can replicate the problem.
Best Wishes, Johannes
Hi @alneberg, I can not find error, Could you help me look into this error more closely, thanks!
sample1_L1_concoct_inputtableR.pdf
Best wishes, Xuejiao
Woah! A thousand pages pdf! I think the problem could be that the contig ids are digits only. We've had this issue before, could you try adding for example "contig_" before all contig ids.
Hope this will work. Johannes
I adding "contig_" before all contig ids. But it does not work.
I think I prepare a wrong inputfile, I do the whole process again. But I have another error.
After I do this command:
And I have this error:
Thanks very much!
Xuejiao
Hi,
I want to know where are bins.
This is my content of concoct-output file:
Thanks very much!
Hi again @qiaoxuejiao,
the ClusterPlot.R issue is a know issue which is fixed in the latest online version of the script: https://github.com/BinPro/CONCOCT/blob/master/scripts/ClusterPlot.R
Your clustering results are available in the clustering_gt1000.csv file.
In order to get the clusters in fasta format you can use the script:
https://github.com/BinPro/CONCOCT/blob/master/scripts/extract_fasta_bins.py
But if you used the cut_up_fasta.py script you probably want to merge the clustering first, you can do that using: https://github.com/EnvGen/toolbox/blob/master/scripts/concoct/merge_cutup_clustering.py
Best Wishes, Johannes
Hi,
When I python merge_cutup_clustering.py, it have something wrong.
Are there something wrong about my usage?
Thank you very much!
Yes you should run that script on the clustering file and not the original fasta file.
Johannes
Hi,
I run the command using those two scripts.
1.
The following is result:
The results displayed on the terminals directly.Should I generate a new file?what should I do?
2.
For the extract_fasta_bins.py script.
I use file with velvet_71_c10K.fa and clustering_gt1000.csv,right?
Thanks very much!
Hi @alneberg, I want to determine the most suitable number of genomes. What should I do?
In most cases you're fine running with the default parameter '-c 400'. If you have both highly diverse samples AND very deeply sequenced samples, you could increase this number.
I do this command:concoct -c 40 --coverage_file concoct-input/concoct_inputtableR.tsv --composition_file contigs/velvet_71_c10K.fa -b concoct-output/ Then I have something error:
And I can not deal with it,I need you help,thanks!