itmat / rum

RNA-Seq Unified Mapper
http://cbil.upenn.edu/RUM
MIT License
26 stars 4 forks source link

another broken RUM job #161

Open safisher opened 11 years ago

safisher commented 11 years ago

Below is an error file for another RUM job that crapped out. Note that the error file has lots of control characters. That seems somewhat unexpected. Do you think I might have a corrupted data file?


Thu Jan 3 09:57:47 2013 16856 FATAL RUM::Death - Unmatched ) in regex; marked by <-- HERE in m/ 43$,-2>16&7,9%;53072"% > 676&? acvvraegv:gvttevt8fgq=2!7>,2 ,vvjr, 106.0=,) <-- HERE 6'9 +%,3&67%&1-:7612& avartartvccetggaatatgtgca/ at /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../lib/RUM/Sort.pm line 75, line 71437762. Thu Jan 3 09:57:47 2013 6473 FATAL RUM::Death - Error running "perl /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../bin/rum2quantifications.pl --genes-in /lab/repo/resources/rum2/drosophila/drosophila_modencode_gene_info.txt --unique-in /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/RUM_Unique.sorted.28 --non-unique-in /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/RUM_NU.sorted.28 -o /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/quant.28.tmp.7OAWQ51D -countsonly"

The stderr from that command is

Unmatched ) in regex; marked by <-- HERE in m/ 43$,-2>16&7,9%;53072"% > 676&? acvvraegv:gvttevt8fgq=2!7>,2 ,vvjr, 106.0=,) <-- HERE 6'9 +%,3&67%&1-:7612& avartartvccetggaatatgtgca/ at /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../lib/RUM/Sort.pm line 75, line 71437762.

The error log file Sample_H4D15/rum.trim/log/rum_errors_28.log may have more details. Thu Jan 3 10:18:53 2013 16941 FATAL RUM::Death - Unmatched ) in regex; marked by <-- HERE in m/ 43$,-2>16&7,9%;53072"% > 676&? acvvraegv:gvttevt8fgq=2!7>,2 ,vvjr, 106.0=,) <-- HERE 6'9 +%,3&67%&1-:7612& avartartvccetggaatatgtgca/ at /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../lib/RUM/Sort.pm line 75, line 71437762. Thu Jan 3 10:18:53 2013 16908 FATAL RUM::Death - Error running "perl /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../bin/rum2quantifications.pl --genes-in /lab/repo/resources/rum2/drosophila/drosophila_modencode_gene_info.txt --unique-in /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/RUM_Unique.sorted.28 --non-unique-in /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/RUM_NU.sorted.28 -o /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/quant.28.tmp.gsXqlkRH -countsonly"

The stderr from that command is

Unmatched ) in regex; marked by <-- HERE in m/ 43$,-2>16&7,9%;53072"% > 676&? acvvraegv:gvttevt8fgq=2!7>,2 ,vvjr, 106.0=,) <-- HERE 6'9 +%,3&67%&1-:7612& avartartvccetggaatatgtgca/ at /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../lib/RUM/Sort.pm line 75, line 71437762.

The error log file Sample_H4D15/rum.trim/log/rum_errors_28.log may have more details. Thu Jan 3 10:40:16 2013 17080 FATAL RUM::Death - Unmatched ) in regex; marked by <-- HERE in m/ 43$,-2>16&7,9%;53072"% > 676&? acvvraegv:gvttevt8fgq=2!7>,2 ,vvjr, 106.0=,) <-- HERE 6'9 +%,3&67%&1-:7612& avartartvccetggaatatgtgca/ at /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../lib/RUM/Sort.pm line 75, line 71437762. Thu Jan 3 10:40:16 2013 17049 FATAL RUM::Death - Error running "perl /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../bin/rum2quantifications.pl --genes-in /lab/repo/resources/rum2/drosophila/drosophila_modencode_gene_info.txt --unique-in /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/RUM_Unique.sorted.28 --non-unique-in /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/RUM_NU.sorted.28 -o /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks/quant.28.tmp.jxPIetp1 -countsonly"

The stderr from that command is

Unmatched ) in regex; marked by <-- HERE in m/ 43$,-2>16&7,9%;53072"% > 676&? acvvraegv:gvttevt8fgq=2!7>,2 ,vvjr, 106.0=,) <-- HERE 6'9 +%,3&67%&1-:7612& avartartvccetggaatatgtgca/ at /home/fisher/local/src/RUM-Pipeline-v2.0.3_04/bin/../lib/RUM/Sort.pm line 75, line 71437762.

The error log file Sample_H4D15/rum.trim/log/rum_errors_28.log may have more details.

mdelaurentis commented 11 years ago

I'm almost positive this is due to a corrupt intermediate file. I believe I've seen a similar error before, and it was due to a corrupt file. The error about the unmatched ')' in a regular expression is thrown from the code that compares two chromosome names to each other, which is used in sorting alignments by location. The sort function creates a regular expression based on the chromosome names, and uses that for part of the comparison. In this case, one of the chromosome names looks like it's junk.

The script that failed (rum2quantifications) takes three input files: the gene annotation file in the index, the RUM_Unique.CHUNK file, and the RUM_NU.CHUNK file. My guess is that either the RUM_NU or RUM_Unique file is corrupted in some way.

I don't think I have access to the directory of the failed job (I don't have read permissions on /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks). If you want to give me access to it I'll take a look and see which file is messed up. I think you'll probably just need to rerun this one.

On Fri, Jan 4, 2013 at 2:57 PM, safisher notifications@github.com wrote:

/unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks

safisher commented 11 years ago

I added you to the lab group so you should now be able to access that directory.

Thanks, Stephen

On Jan 7, 2013, at 10:25 AM, Mike DeLaurentis notifications@github.com wrote:

I'm almost positive this is due to a corrupt intermediate file. I believe I've seen a similar error before, and it was due to a corrupt file. The error about the unmatched ')' in a regular expression is thrown from the code that compares two chromosome names to each other, which is used in sorting alignments by location. The sort function creates a regular expression based on the chromosome names, and uses that for part of the comparison. In this case, one of the chromosome names looks like it's junk.

The script that failed (rum2quantifications) takes three input files: the gene annotation file in the index, the RUM_Unique.CHUNK file, and the RUM_NU.CHUNK file. My guess is that either the RUM_NU or RUM_Unique file is corrupted in some way.

I don't think I have access to the directory of the failed job (I don't have read permissions on /unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks). If you want to give me access to it I'll take a look and see which file is messed up. I think you'll probably just need to rerun this one.

On Fri, Jan 4, 2013 at 2:57 PM, safisher notifications@github.com wrote:

/unsafe14/fisher/e.53/Sample_H4D15/rum.trim/chunks

— Reply to this email directly or view it on GitHub.