Closed matwgmkw closed 1 year ago
I got the "regionsorter.txt" file with some data in it, but that is the only file I got after running RepEnrich2.
Another question is if I downloaded the setup for the human genome from the dropbox link (https://www.dropbox.com/sh/ovf1t2gn5lvmyla/AADEPGgGMg-vjj8Gk-s76i6Ja?dl=0), do I still have to do the setup in step 2?
Hi there,
The setup folder is meant to save some time and should let you skip the setup step, but it seems like something might have been changed inside repnames.bed - maybe an extra tab or some white space was added to the line in question? If it still doesn't work after checking this, I would advise just starting from the first step and attaining fresh annotation files from UCSC table browser and running the setup yourself as opposed to using the dropbox folder. That should give you a fresh setup folder that should work with whatever annotation you provide it.
Best, Nick
Hi. I got stuck while using RepEnrich2. Here is the error message I got. How can I fix it?
Software versions I used: py-biopython/1.70_py27 bedtools/2.27.1 bowtie2/2.3.4.1 samtools 1.16.1 Python 2.7.13
Thanks. Matthew
Preparing for analysis using RepEnrich2... Conducting region sorting on unique mapping reads.... ***** WARNING: File ./RepEnrich2_setup_hg38/repnames.bed has inconsistent naming convention for record: chr1 10469 11447 TAR1
***** WARNING: File ./RepEnrich2_setup_hg38/repnames.bed has inconsistent naming convention for record: chr1 10469 11447 TAR1
Identified 0 unique reads that mapped to repeats. Processing repeat psuedogenomes... sorting and combining the output for both read pairs... completed ... Writing and processing intermediate files... Traceback (most recent call last): File "RepEnrich2.py", line 269, in
for data in importtext(sorted + os.path.sep + rep + '.txt', '\t'):
File "RepEnrich2.py", line 64, in import_text
skipinitialspace=True):
_csv.Error: line contains NULL byte