Open smorzechowski opened 8 months ago
Here are files to reproduce:
plot_mutations.py.txt plot_mutations_jobscript.txt subset_inact_mut_data.txt subset_query_annotation.txt
Hi @smorzechowski
Thank you for highlighting the issue and providing the code fixes and files to reproduce the issue. When we initially developed the tool, transcript identifiers with numerous dots were not common. Nonetheless, it's clear they shouldn't pose a problem in a properly written script. Once I write an update it will appear in the comments to this issue
Hi,
this is a very good point. We never encountered this, because we always strip any .1 or something like that from scaffold names. I guess the best approach is to check at the beginning whether gene / transcript names contain '.' or replace them with "_" or another character. Thx for reporting it.
I see, sounds good, thanks to you both!
Hello! I am a big fan of this tool, thank you! I want to plot inactivating mutations for various selected transcripts and although I can plot some successfully, others throw an index error.
First, I should note I've had to modify the plot_mutations.py in several places in order to plot anything at all. I had to modify line 711 to append the chain value to the trans_line string:
trans_line = line_data[0][2:]+"."+line_data[1]
, otherwise no lines would be selected from the inact_mut_data.txt file.Similarly, I had to do the same at line 1025 for trans string
trans = defect_line_dat[0]+"."+defect_line_dat[1]
:These modifications work with the following files and for some transcripts, such as ENSGALT00000067177.3.23 and NOG_ENSGALT00000004916.2.48. However, for other transcripts, like rna-XM_025148119.1.26806 and rna-XM_025144897.1.149, an index error is thrown:
I'm not familiar enough with this script, or with python in general, to troubleshoot this further, unfortunately!
Also, if you have any insight as to why the original plot_mutations.py doesn't work with my data, I'd be grateful. The original script doesn't work regardless of whether I use an isoform file and gene id instead of transcript id. Appending the chain number is necessary to select anything from the inact_mut_data.txt file.
Thanks very much for any insight you might have!