Closed jvelotta closed 5 years ago
Hi Jon,
Can you give some more details on how you run Clust and the error? Can you please copy and paste here the entire terminal output including the error? Are you running it over one or multiple datasets collectively?
I will make sure I provide the required assistant to make clust run for you.
All the best Basel
Hi Basel,
Thank you. I am running Clust on cpm normalized RNAseq data. I have a .txt data file (11,104 rows of gene names and 26 columns of individuals), and a replicates file. This is a single dataset.
Below is the code and the error message. The list of integers after the error message goes to 288730!
Thanks for your help.
Jonathans-MacBook-Pro:clust jonathanvelotta1$ python clust-1.8.12/clust.py gastroc_norm_counts.txt -r gastroc_replicates_file.txt
/===========================================================================\
| Clust |
| (Optimised consensus clustering of multiple heterogenous datasets) |
| Python package version 1.8.12 (2018) Basel Abu-Jamous |
+---------------------------------------------------------------------------+
| Analysis started at: Friday 22 February 2019 (09:37:39) |
| 1. Reading dataset(s) |
Traceback (most recent call last):
File "clust-1.8.12/clust.py", line 6, in
Update: I uninstalled and reinstalled clust and pandas, and that did not change the error message. Thanks again! J
Hi and sorry for being late in replying.
I guess I have seen this error before with someone whose dataset does not use the correct newline character '\n' or '\r\n'; rather it only uses the carriage return character '\r'. I don't think that any modern proper operating system uses '\r' alone, as it technically does not define a new line.
In other words, to an operating system, your data file looks like a very very very long SINGLE LINE string!
The solution would be to replace every '\r' in your data file with '\r\n'. I am happy to do it for you if you like to email me the dataset confidentially @ basel.abu-jamous@sensynehealth.com.
Best wishes and please let me know if any further help is needed :)
Basel
Thanks Basel, that did the trick. My .tsv files were just one long line! Thanks again.
Hi Basel,
I keep getting this error when running the script using python clust.py (I was not able to execute the program using the first two methods described). Any idea if this is because of a file formatting issue?
Thanks,
Jon