Open AmirArgoetti opened 7 years ago
Hi, I have solved some of the problems running the example data, but it still does not run till completion. Amir, I am pretty sure you are using a version of samtools that is too recent. Newer versions of samtools changed the output from samtools sort. Try using the one that comes with the FAST-iCLIP software in dependecies/
Also, I found that the GRCh38 bowtie2 index is not downloaded when one runs ./configure. I had to download it (from here: http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) and rename to fit the hard-coded index name: docs/GRCh38/GRCh38_bowtie/GRCh38
Furthermore, the hard-coded snoRNA file is named incorrectly. I had to rename docs/GRCh38/snoRNA_coordinates.bed to docs/GRCh38/snoRNA_coordinates_15exp.bed
With these additions I was able to start the example, which ran for about 25 minutes, but failed. When I check the output files in the results folder it seems like the snoRNAs might be the problem. The first empty file generated is "clipGenes_snoRNA_reads.bed"
Here is how I ran the example data and the error messages:
~/software/FAST-iCLIP> fasticlip -i rawdata/A_Human_Test_R1.fastq rawdata/A_Human_Test_R2.fastq --GRCh38 -n Human_test -o results
/usr/local/lib/python2.7/dist-packages/matplotlib-2.0.0b3-py2.7-linux-x86_64.egg/matplotlib/font_manager.py:279: UserWarning: Matplotlib is building the font cache using fc-list. This may take a moment.
warnings.warn('Matplotlib is building the font cache using fc-list. This may take a moment.')
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 1 sequences.
[samopen] SAM header is present: 1 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 625 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 625 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 195 sequences.
[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 195 sequences.
Traceback (most recent call last):
File "/usr/local/bin/fasticlip", line 9, in <module>
load_entry_point('fasticlip==0.9.3', 'console_scripts', 'fasticlip')()
File "/usr/local/lib/python2.7/dist-packages/fasticlip-0.9.3-py2.7.egg/fasticlip/fasticlip.py", line 331, in main
geneCounts_pc = get_gene_counts(proteinCodingReads_centered)
File "/usr/local/lib/python2.7/dist-packages/fasticlip-0.9.3-py2.7.egg/fasticlip/helper.py", line 570, in get_gene_counts
bf=pd.DataFrame(pd.read_table(bedFile,header=None))
File "/usr/local/lib/python2.7/dist-packages/pandas-0.18.1_345.g4a80521-py2.7-linux-x86_64.egg/pandas/io/parsers.py", line 647, in parser_f
return _read(filepath_or_buffer, kwds)
File "/usr/local/lib/python2.7/dist-packages/pandas-0.18.1_345.g4a80521-py2.7-linux-x86_64.egg/pandas/io/parsers.py", line 390, in _read
parser = TextFileReader(filepath_or_buffer, **kwds)
File "/usr/local/lib/python2.7/dist-packages/pandas-0.18.1_345.g4a80521-py2.7-linux-x86_64.egg/pandas/io/parsers.py", line 731, in __init__
self._make_engine(self.engine)
File "/usr/local/lib/python2.7/dist-packages/pandas-0.18.1_345.g4a80521-py2.7-linux-x86_64.egg/pandas/io/parsers.py", line 910, in _make_engine
self._engine = CParserWrapper(self.f, **self.options)
File "/usr/local/lib/python2.7/dist-packages/pandas-0.18.1_345.g4a80521-py2.7-linux-x86_64.egg/pandas/io/parsers.py", line 1376, in __init__
self._reader = _parser.TextReader(src, **kwds)
File "parser.pyx", line 538, in pandas.parser.TextReader.__cinit__ (pandas/parser.c:5517)
pandas.io.common.EmptyDataError: No columns to parse from file
Hello all, I tried to run the example but it was failed (problem is attached). Would you please advise me on how to proceed ?
Amir