Closed Stephaniedv closed 8 years ago
Hi @Stephaniedv,
I can't figure out completely what's going on based on the information you gave me but there seems to be some problem reading your coverage file. More specifically concoct has troubles identifying any columns in the coverage file. Is it tab separated?
I can make a more thorough debug effort if you send me some test data that can replicate the error.
Cheers, Johannes Alneberg
thank you, that was exactly it! I changed it and it worked :)
From: "Johannes Alneberg" notifications@github.com To: "BinPro/CONCOCT" CONCOCT@noreply.github.com Cc: "stephanie devos" stephanie.devos@ugent.be, "Mention" mention@noreply.github.com Sent: Friday, June 3, 2016 1:22:04 PM Subject: Re: [BinPro/CONCOCT] IndexError: index 0 is out of bounds for axis 0 with size 0 (#160)
Hi @Stephaniedv ,
I can't figure out completely what's going on based on the information you gave me but there seems to be some problem reading your coverage file. More specifically concoct has troubles identifying any columns in the coverage file. Is it tab separated?
I can make a more thorough debug effort if you send me some test data that can replicate the error.
Cheers, Johannes Alneberg
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub , or mute the thread .
Glad I could help!
Dear Johannes,
One more question, I hope you don't mind. Concoct is running, but has been running for over 4 days now. In the output folder, there are these files: -rw-r--r--+ 1 stvos aflp 500 Jun 3 13:51 args.txt -rw-r--r--+ 1 stvos aflp 693 Jun 3 14:03 log.txt -rw-r--r--+ 1 stvos aflp 276M Jun 3 14:02 original_data_gt1000.csv -rw-r--r--+ 1 stvos aflp 140K Jun 3 14:03 PCA_components_data_gt1000.csv -rw-r--r--+ 1 stvos aflp 133M Jun 3 14:03 PCA_transformed_data_gt1000.csv
No sign of a clustering file yet. Concoct stays within memory settings (8Gb) and uses quite a lot of CPU, but this cluster can handle that. I must say our input files are quite big: -rw-r--r--+ 1 stvos aflp 742M Jun 1 11:28 final_contigs_rm_mt_extractedscaffolds_contigs_cuttofragments.fasta -rwxr-xr-x+ 1 stvos aflp 6.7M Jun 3 13:48 final_contigs_rm_mt_extractedscaffolds_cuttofragments.fasta_PE_and_SE_sorted_duplicatesremoved.coverage.percontig.tsv
Does this long run seem normal to you, regarding the size of the input files? I submitted the concoct command with default parameters, without setting the amount of cluster or iterations. We are expecting one large cluster (main arthropod genome) and one cluster containing bacteria, algae, etc.
With kind regards and thank you in advance for your opinion,
Stephanie
From: "Johannes Alneberg" notifications@github.com To: "BinPro/CONCOCT" CONCOCT@noreply.github.com Cc: "stephanie devos" stephanie.devos@ugent.be, "Mention" mention@noreply.github.com Sent: Friday, June 3, 2016 2:11:16 PM Subject: Re: [BinPro/CONCOCT] IndexError: index 0 is out of bounds for axis 0 with size 0 (#160)
Glad I could help!
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub , or mute the thread .
Sure!
Yes its not a completely strange amount of time used, although I cannot really tell from the file sizes only how long time I would expect. I don't spot anything abnormal in your output files so I would just recommend you to wait for concoct to finish.
Cheers, johannes
ok!
Thanks loads, also for such a quick response.
Stéphanie
From: "Johannes Alneberg" notifications@github.com To: "BinPro/CONCOCT" CONCOCT@noreply.github.com Cc: "stephanie devos" stephanie.devos@ugent.be, "Mention" mention@noreply.github.com Sent: Tuesday, June 7, 2016 4:45:39 PM Subject: Re: [BinPro/CONCOCT] IndexError: index 0 is out of bounds for axis 0 with size 0 (#160)
Sure!
Yes its not a completely strange amount of time used, although I cannot really tell from the file sizes only how long time I would expect. I don't spot anything abnormal in your output files so I would just recommend you to wait for concoct to finish.
Cheers, johannes
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub , or mute the thread .
Hi,
I submit this module load python/x86_64/2.7.2 concoct -c 40 --coverage_file final_contigs_cuttofragments_PE_and_SE_sorted_duplicatesremoved.coverage.percontig.tsv --composition_file final_contigs_contigs_cuttofragments.fasta -b concoct-output/
and get this error: /software/shared/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/biopython-1.65-py2.7-linux-x86_64.egg/Bio/Seq.py:342: BiopythonDeprecationWarning: This method is obsolete; please use str(my_seq) instead of my_seq.tostring(). BiopythonDeprecationWarning) Traceback (most recent call last): File "/software/shared/apps/x86_64/python/2.7.2/bin//concoct", line 5, in
pkg_resources.run_script('concoct==0.4.0', 'concoct')
File "/software/shared/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/setuptools-14.3-py2.7.egg/pkg_resources/init.py", line 698, in run_script
self.require(requires)[0].run_script(script_name, ns)
File "/software/shared/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/setuptools-14.3-py2.7.egg/pkg_resources/init.py", line 1616, in run_script
exec(code, namespace, namespace)
File "/shared/clssoft/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/concoct-0.4.0-py2.7-linux-x86_64.egg/EGG-INFO/scripts/concoct", line 76, in
results = main(args)
File "/shared/clssoft/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/concoct-0.4.0-py2.7-linux-x86_64.egg/EGG-INFO/scripts/concoct", line 20, in main
composition, cov, cov_range = load_data(args)
File "/software/shared/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/concoct-0.4.0-py2.7-linux-x86_64.egg/concoct/input.py", line 25, in load_data
read_length = args.read_length
File "/software/shared/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/concoct-0.4.0-py2.7-linux-x86_64.egg/concoct/input.py", line 87, in load_coverage
cov_range = (cov.columns[0],cov.columns[-1])
File "/software/shared/apps/x86_64/python/2.7.2/lib/python2.7/site-packages/pandas/core/index.py", line 1149, in getitem
return getitem(key)
IndexError: index 0 is out of bounds for axis 0 with size 0
Does anyone know what could be causing this? I have a fasta file and a 2-column file with contig names and coverages as input. Should the coverages be integers? Should I build an index? Thank you for any ideas